Abstract
Mutations in the Plasmodium falciparum chloroquine resistance transporter (PfCRT) confer resistance to several antimalarial drugs such as chloroquine (CQ) or piperaquine (PPQ), a partner molecule in current artemisinin-based combination therapies. As a member of the Drug/Metabolite Transporter (DMT) superfamily, the vacuolar transporter PfCRT may translocate substrate molecule(s) across the membrane of the digestive vacuole (DV), a lysosome-like organelle. However, the physiological substrate(s), the transport mechanism and the functional regions of PfCRT remain to be fully characterized. Here, we hypothesized that identification of evolutionary conserved sites in a tertiary structural context could help locate putative functional regions of PfCRT. Hence, site-specific substitution rates were estimated over Plasmodium evolution at each amino acid sites, and the PfCRT tertiary structure was predicted in both inward-facing (open-to-vacuole) and occluded states through homology modeling using DMT template structures sharing <15% sequence identity with PfCRT. We found that the vacuolar-half and membrane-spanning domain (and especially the transmembrane helix 9) of PfCRT were more conserved, supporting that its physiological substrate is expelled out of the parasite DV. In the PfCRT occluded state, some evolutionary conserved sites, including positions related to drug resistance mutations, participate in a putative binding pocket located at the core of the PfCRT membrane-spanning domain. Through structural comparison with experimentally-characterized DMT transporters, we identified several conserved PfCRT amino acid sites located in this pocket as robust candidates for mediating substrate transport. Finally, in silico mutagenesis revealed that drug resistance mutations caused drastic changes in the electrostatic potential of the transporter vacuolar entry and pocket, facilitating the escape of protonated CQ and PPQ from the parasite DV.
Subject terms: Phylogeny, Protein analysis, Protein function predictions, Protein structure predictions, Molecular evolution
Introduction
Resistance to antimalarial drugs continues to threaten malaria control. Formerly used as first-line treatment for uncomplicated Plasmodium falciparum malaria in most endemic areas, chloroquine (CQ) has seen its efficacy reduced in the late 60 s onwards after a decade of use. CQ resistance (CQR) was initially reported in Southeast Asia and South America, and then in Western Pacific and Africa1. The spread of CQR has led to successive antimalarial drug replacements and since 2000 to the progressive adoption of artemisinin-based combination therapies (ACTs). CQ remains highly effective to treat malaria caused by the other human-infecting Plasmodium parasite species2.
During the parasite intraerythrocytic development, hemoglobin from the host cytosol is internalized by the parasite and delivered into its digestive vacuole (DV), an acidic lysosome-like organelle. Degradation of hemoglobin in the DV generates toxic Fe2+-heme molecules that spontaneously oxidize to form membrane-disrupting Fe3+-heme3. To avoid this, the Fe3+-heme molecules are biocrystallized in the DV into non-toxic, chemically inert crystals known as hemozoin.
Quinoline-containing antimalarial drugs such as CQ, but also piperaquine (PPQ), amodiaquine and pyronaridine, are thought to primarily interfere with the conversion of toxic heme to non-toxic hemozoin crystal4. This process has been especially characterized for CQ, which is found in the DV at high concentrations5–7, leading to the accumulation of toxic Fe2+-heme and parasite death8.
The main genetic determinant of CQR is the Plasmodium falciparum chloroquine resistance transporter gene (pfcrt)9,10. Pfcrt encodes a 424 amino acid protein (PfCRT), consisting of 10 transmembrane helices (TMs)10, which is essential to the parasite intraerythrocytic development11. At least six independent origins of CQR mutations were reported12, with various mutant pfcrt haplotypes harboring different sets of 4 to 10 non-synonymous mutations such as the Asian Dd2 haplotype that later spread to Africa1. Mutant PfCRT molecules have acquired the ability to expel CQ out of the DV13. All CQR haplotypes carry the key K76T mutation that removes a positive charge in TM 1, suggesting a charge-dependent transport mechanism as CQ is di-protonated in the acidic DV10,13. The K76T mutation is always accompanied by additional mutations which may increase the CQR level and/or attenuate the fitness cost of resistance14,15. PfCRT remains a protein of utmost clinical importance since both ancient and novel PfCRT mutations are associated with altered in vitro antimalarial activity of some ACT drugs (PPQ, amodiaquine, lumefantrine, and artemisinin derivatives) and with ACT treatment failures13,16–21. For most of these drugs, the PfCRT-dependent pharmaco-modulation might operate through different mechanisms than for CQ19,22,23.
After two decades of research, the physiological role of PfCRT in the parasite biology is still questioned. Its secondary structure, location in the DV membrane and belonging to the Drug/Metabolite Transporter (DMT) superfamily support its participation in metabolite transport10,24. PfCRT was reported as a proton-coupled transporter recognizing some cationic substrates25, a Fe2+/Fe3+ transporter26, and playing a role in glutathione homeostasis27,28. Earlier studies have also reported PfCRT as a Cl− channel mediator, a proton pump regulatory and an activator of Na+/H+ exchangers29–31.
Like DMT proteins32, PfCRT may presumably operate by a mechanism known as the alternate access model33,34. In such a model, the binding of substrate, maintained inside the transporter through a so-called occluded state, triggers a structural transition between two other conformations, the inward- and outward-facing states, thereby inducing the translocation of the substrate across the membrane. In the case of PfCRT, the binding site(s) of physiological substrate(s) and the different structural states of the transporter remain to be fully elucidated. Consistent with this, a PfCRT mutant structure was recently determined in an inward-facing state by cryo-electron microscopy (named here PfCRTcryo-EM, open-to-DV) and also predicted by modeling in an outward-facing state (PfCRTOF, open-to-cytoplasm; which were published during the revision process of this work)35.
Here, we hypothesized that combining structural and evolutionary information could help identify putative functional regions of PfCRT and also their link with drug resistance. We reasoned that codon or amino acid sites that are important for PfCRT function are especially conserved. First, we investigated the variability of the conservation level across the CRT-coding DNA sequence and phylogeny over Plasmodium evolution. To get a finer map of evolutionary constraints in a structural context, we also built models of the tertiary structure of PfCRT by homology modeling, based on high-resolution 3D structures of DMT proteins containing 10 TMs32,36. Two different PfCRT conformations were predicted: an inward-facing state (PfCRTIF model; i.e. open-to-DV) and an occluded state (PfCRTOC model; i.e. both vacuolar and cytoplasmic gates closed) in which an open cavity and a binding pocket were identified, respectively. Combined evolutionary and structural analyses identified several slowly evolving amino acid sites located in the open cavity (for PfCRTIF) and binding pocket (for PfCRTOC). Through a comparative structural approach with other experimentally-characterized DMTs, we further identified several amino acid sites of the binding pocket that may be involved in the physiological transport activity of PfCRT. Finally, we explored the location and structural/physicochemical impacts of CQR and PPQ resistance (PPQR) mutations through a series of in silico mutagenesis from the PfCRTOC and PfCRTIF models.
Results
CRT sequence dataset and phylogenetic tree analysis
To evaluate the selective regime acting on the coding-sequence and protein product of the crt gene, we first retrieved orthologous CRT sequences from public databases. Non-Plasmodium species have not been included in the study because we cannot confirm that PfCRT homologous protein(s) in those organisms are truly orthologous to PfCRT. As we had enough evolutionary information with CRTs from Plasmodium, we choose this smaller dataset to avoid introducing any putative bias in our evolutionary analyses37. Among the retrieved 24 Plasmodium CRT sequences, only that of the bird-infecting P. relictum appeared incomplete with an absence of twenty amino acid positions in the C-terminal region. The other sequences showed high conservation in length, ranging from 423 to 425 amino acids (Supplementary Table S1), resulting in the presence of few indels (8 amino acid positions) in the CRT multiple sequence alignment that were removed from subsequent analyses (Supplementary Fig. S1). The phylogenetic tree produced from the curated crt sequence alignment was highly congruent with the acknowledged phylogeny of Plasmodium species38, with four main monophyletic groups (bird parasites, primate parasites ‘Plasmodium’, rodent parasites and primate parasites ‘Laverania’) supported by high bootstrap values (≥90%; Supplementary Fig. S2).
Variable levels of conservation across the crt gene sequence and phylogeny
The selective pressures acting on the CRT-coding DNA sequence were searched for by estimating the non-synonymous to synonymous substitution rate ratio ω (=dN/dS) across lineages (branch model) and codon sites (random-site models) using the codeml program (ω > 1 indicates positive selection whereas ω < 1 reflects purifying selection)39. The entire crt gene was found to evolve under a variable selective regime across Plasmodium phylogeny as the free-ratio model (that allows ω to vary along branches in the crt phylogenetic tree) had better fit to the data than the one-ratio null model M0 (which supposes only one ω value for all branches; p = 1.85 × 10−32; Table 1). Although purifying selection (ω < 1) was pervasive across the whole phylogeny, some specific branches showed evidence of positive selection (ω > 1 in P. vinckei vinckei or in the common ancestor of P. fragile, P. knowlesi and P. coatneyi), suggesting few adaptive changes during Plasmodium evolution (Supplementary Fig. S3). The substitution rate ratio ω was also found to be heterogeneously distributed across crt codon sites as assessed by the M0:M3 comparison (p = 1.45 × 10−133; Table 1), but no specific codon site was detected to evolve under positive selection since the positive selection models M2a and M8 provided no better fits to the data than the neutral models M1a and M7, respectively (p = 1.00 in both cases; Table 1 and Supplementary Table S2). Under the best-fitted model M3, the sequence encoding the cytoplasmic N- and C-terminal extremities of CRT were found to be much less conserved than most of the remaining protein-coding DNA sequence (also noticeable by visual inspection of the CRT multiple sequence alignment provided in Supplementary Figure S1). Those extremities contained several sites that were found phosphorylated in PfCRT40, including T416 which notwithstanding appeared highly conserved. Altogether, the data indicate that much of the CRT protein has evolved under purifying selection over Plasmodium evolution.
Table 1.
Models | 2Δℓ | df | p value | Conclusion |
---|---|---|---|---|
M0:free-ratio | 260.20 | 44 | 1.85 × 10−32 | ω heterogeneous among lineages of the phylogenetic tree |
M0:M3 | 623.24 | 4 | 1.45 × 10−133 | ω heterogeneous across codon sites |
M1a:M2a | 0.00 | 2 | 1.00 | No codon sites detected to evolve under positive selection |
M7:M8 | 0.00 | 2 | 1.00 | No codon sites detected to evolve under positive selection |
Partitions: cytoplasmic-half (n = 146 codon sites) versus vacuolar-half (n = 196 codon sites) sides | ||||
A:B | 14.59 | 1 | 1.33 × 10−4 | Different nucleotide substitution rates among partitions |
B:D | 19.71 | 2 | 5.24 × 10−5 | Heterogeneity of evolutionary parameters among partitions |
C:E | 18.87 | 2 | 7.97 × 10−5 | Heterogeneity of evolutionary parameters among partitions |
Partitions: non-TMs (n = 116 codon sites) versus TMs (n = 226 codon sites) | ||||
A:B | 58.32 | 1 | 2.22 × 10−14 | Different nucleotide substitution rates among partitions |
B:D | 12.56 | 2 | 1.87 × 10−3 | Heterogeneity of evolutionary parameters among partitions |
C:E | 10.77 | 2 | 4.58 × 10−3 | Heterogeneity of evolutionary parameters among partitions |
ω, non-synonymous (dN) to synonymous (dS) substitution rate ratio (dN/dS); 2∆ℓ, log-likelihood ratio of the two tested models; df, degrees of freedom; TMs, transmembrane helices; non-TMs, loops connecting two TMs. 2Δℓ was compared to a chi-squared table to determine the significance of the likelihood ratio tests. For the two series of partitioning tests shown here, codon sites were partitioned with the PfCRTIF model and we discarded the poorly conserved N- and C-terminal regions, both located in the cytoplasm (this alteration was conservative). Similar results were obtained when codon sites were partitioned with the PfCRTOC model (data not shown).
Identification of DMT template structures to predict PfCRT tertiary structure by homology modeling
In order to get a finer map of evolutionary constraints in a structural context, we aimed to produce a tertiary structure of PfCRT by homology modeling. By reviewing the literature, we found that, since 2016, several high-resolution 3D structures of DMTs containing 10 TMs have been determined by X-ray diffraction in different conformational states: the nucleotide sugar GDP-mannose transporter Vrg4 resolved in an inward-facing state (PDB ID: 5oge)36, the triose-phosphate/phosphate translocator TPT resolved in an occluded state (in complex with 3-phosphoglycerate; PDB ID: 5y79)32, and the aromatic amino acids and exogenous toxic exporter YddG (PDB ID: 5i20)41. Phyre242 and HHpred43 interactive servers identified these same three DMT structures as the current best structures to model PfCRT, with an e-value < 5.4 e−21 and a confidence criterion (which represents the probability that the match between the sequence and the template arises from a true relationship42) of 98.7% for the least acceptable of these three DMT structures (YddG; Supplementary Tables S3 and S4). Despite a low sequence identity between PfCRT and these transporters, reaching a maximum of 14% for the PfCRT-TPT comparison (Supplementary Table S5), homology modeling remains still feasible44 and the PfCRT tertiary structure may be predicted in both inward-facing and occluded states.
As an initial validity control, we evaluated the variability in sequence and structure across the three candidate DMT templates which structures were experimentally and therefore independently obtained. Of note, in the phylogeny of the DMT superfamily, they are located in different branches45. Accordingly, the paired sequence identities for these three template transporters were low, reaching for example only 13% between TPT and Vrg4 (Supplementary Table S5). However, a visual inspection of paired structural superposition revealed a very similar fold (Supplementary Fig. S4). To quantify this, we computed for paired DMT structures the Cɑ-based root-mean-square deviation (RMSD, which measures to what extent a given residue in a protein of interest changes its position compared to the structurally aligned residue from another protein). When the TMs-connecting loops were included, the RMSD ranged from 5.3 Ångström (Å) for Vrg4-YddG to 7.6 Å for TPT-YddG, and respectively diminished to 4.5 Å and 5.4 Å after exclusion of the loops (Supplementary Table S5). Hence, these results indicated that the structures have a similar fold but a different degree of intra/extracellular gate openness. We also noted that these RMSD values are consistent with those observed for other transporter superfamilies. For example, we retrieved from the Protein Data Bank three members of the Major Facilitator Superfamily (MFS), LacY (PDB ID: 2y5y)46, GlpT (PDB ID: 1pw4)47 and EmrD (PDB ID: 2gfp)48 which, despite exhibiting low paired sequence identity (~15%), shared a similar fold and arrangement of 12 TMs48, with Cɑ-based RMSDs including TMs-connecting loops ranging from 5.9 Å (LacY-GlpT) to 8.9 Å (GlpT-EmrD; Supplementary Fig. S5). Therefore, we assumed that PfCRT may share a similar fold to the aforementioned 10 TMs-containing DMT proteins, at least regarding the membrane spanning region.
Predicting different conformational states of PfCRT tertiary structure by homology modeling
Having identified robust DMT template structures, we modeled wild-type (WT) PfCRT tertiary structure in two different conformational states: an inward-facing state (PfCRTIF) and an occluded state (PfCRTOC), using respectively Vrg4 and TPT as templates (Supplementary Table S6)32,36. We discarded the YddG template because it had a lower sequence identity with PfCRT than Vrg4 (both in inward-facing state; Supplementary Table S5)32,41. We also discarded the PfCRT positions 269–313 corresponding to the long vacuolar loop connecting TM 7 and TM 8 and the cytoplasmic N- and C-terminal extremities (positions 1–55 and 399–424, respectively) because of insufficient amino acid coverage in DMT templates.
In homology modeling, alignment of the target-template sequences (here PfCRT-Vrg4 and PfCRT-TPT) is a critical step. Since hydrophobicity, which reflects the transmembrane topology, is typically conserved in TMs during evolution, the hydropathy profile can contain similar global features even in very distantly related proteins49. Consequently, we first produced a superfamily-averaged hydropathy profile alignment with AlignMe50 using the CRT multiple sequence alignment and an alignment of different members of the DMT superfamily. This profile-profile (CRTs-DMTs) alignment pinpointed some conserved amino acid positions that should be aligned in each final target-template alignment (Supplementary Figs. S6 and S7). We then produced a target-template alignment using the AlignMe PST algorithm which is designed for distantly related proteins (i.e. with a sequence identity <15%)50, and manually optimized it in order to align the conserved amino acid positions identified from the profile-profile alignment (see the final alignments for PfCRT-Vrg4 and PfCRT-TPT in Supplementary Figs. S6 and S7 respectively).
After the refinement and minimization steps, we checked the quality of the two built PfCRT models using local stereochemistry (MolProbity). For comparison, we used as controls several high-resolution 3D structures: the two DMT template and the PfCRTcryo-EM structure35. The two PfCRT models exhibited similar quality indices than these experimentally-determined control structures as suggested by Ramachandran plots (Table 2). General (ERRAT51, Prosa II52) and transmembrane-specialized (QMEANBrane53) 3D quality metrics also provided high confidence in structure conformation, energies and non-bonded interactions between atoms of the PfCRTOC and PfCRTIF models, when compared to the templates and other highly refined structures including PfCRTcryo-EM from the Protein Data Bank (Table 2 and Supplementary Fig. S8). Finally, directional atomic contacts were evaluated to check the structures for correctness using the fine packing quality control implemented in the What IF server54. For both PfCRT models, the average Z-score for all contacts was close to 0, confirming the normality of the local environment of amino acids, similarly to Vrg4 and TPT template structures and PfCRTcryo-EM (Table 2). We then found that the paired PfCRTIF-Vrg4 and PfCRTOC-TPT structures had a Cɑ-based RMSD of 5.2 Å and 3.0 Å, respectively (3.0 Å and 2.5 Å when TMs-connecting loops were excluded; Supplementary Table S5). These RMSD values were similar to those obtained with paired DMT template structures (Supplementary Table S5). In all, different quality metrics and structural comparisons indicated that the PfCRTIF and PfCRTOC models can be reliably used for further structural analyses.
Table 2.
Proteina | ProSA IIb | MolProbityc | ERRATd [0-100] | QMEAN Branee [0-1] | Fine packing quality controlf | |||||
---|---|---|---|---|---|---|---|---|---|---|
Poor rotamers, n (%) | Favored rotamers, n (%) | Ramachandran outliers, n (%) | Ramachandran favored, n (%) | Bad bonds, n (%) | Bad angles, n (%) | |||||
Vrg4 | −1.76 | 21/268 (7.84) | 218/268 (81.34) |
2/298 (0.67) |
278/298 (93.29) |
0/2415 (0.00) | 4/3278 (0.12) | 92.32 | 0.80 | 0.47 |
TPT | −3.17 | 5/251 (1.99) | 239/251 (95.22) |
0/303 (0.00) |
300/303 (99.01) |
0/2423 (0.00) | 0/3313 (0.00) | 96.61 | 0.84 | 0.85 |
PfCRTCryo-EM | −2.75 |
3/312 (0.96) |
289/312 (92.63) |
0/346 (0.00) |
334/346 (96.53) |
0/2876 (0.00) |
0/3898 (0.00) |
89.10 | 0.81 | −0.60 |
PfCRTIF | −2.72 | 4/279 (1.43) | 269/279 (96.42) |
0/301 (0.00) |
287/301 (95.35) |
0/2550 (0.00) | 2/3446 (0.06) | 98.70 | 0.79 | 0.24 |
PfCRTOC | −2.11 | 0/272 (0.00) | 270/272 (99.26) |
0/294 (0.00) |
287/294 (97.62) |
4/2487 (0.16) | 3/3366 (0.09) | 99.01 | 0.81 | 0.43 |
aThe experimentally-determined tertiary structures of two DMTs (Vrg4 and TPT) were used as templates for homology modeling of PfCRT structure.PfCRTcryo-EM corresponds to a PfCRT mutant structure recently determined in an inward-facing state by cryo-electron microscopy. PfCRTIF and PfCRTOC correspond to PfCRT wild type models in inward-facing and occluded states, respectively. bProSA II generates an overall quality score compared with the ones from all experimentally determined protein chains available in the Protein Data Bank. cMolProbity indicates the accuracy of a macromolecular structure model and diagnoses local problems by all-atom contact analysis, complemented by updated versions of covalent-geometry and torsion-angle criteria. dERRAT analyzes the statistics of non-bonded interactions between different atom types, then plots error values as a function of the position of a 9-residue sliding window, calculated by a comparison with statistics of non-bonded atom-atom interactions from highly refined structures. eQMEANBrane is a structure quality assessment tool dedicated to membrane proteins. It determines a local quality score per residue (the mean per-residue quality score is shown), by combining statistical potentials trained on membrane protein structures with a per-residue weighting scheme. fThe fine packing quality control, implemented in the What IF server, checks the normality of the local environment of amino acids. Values are expressed as Z-scores: a value < −2 or >2 indicates a wrong model or an incorrect structure; a Z-score close to 0 represents a high quality structure.
To evaluate the accuracy of the PfCRTIF and PfCRTOC models, we compared their secondary and tertiary structural features with those of PfCRTcryo-EM. Our two models revealed symmetry-related structural repeats consisting of 10 TM bundle (with TM 1 to TM 5 and TM 6 to TM 10 distributed in anticlockwise and clockwise manners, respectively), approximately 30 Å in length, similarly to PfCRTcryo-EM (Fig. 1A,B). In PfCRTcryo-EM, two additional helices (JM 1 and JM 2) were identified, respectively located in the N-terminal region and the long vacuolar loop connecting TM 7 and TM 8 (two regions discarded in our modeling study; Fig. 2A). In the PfCRTIF model, the structure was basket-shaped, similarly to Vrg4 and PfCRTcryo-EM, with a deep cavity of ~3,200 Å3 opened at the vacuolar side and composed of amino acids from most TM helices except TM 5 and TM 10 (Fig. 1A). In the PfCRTOC model, a pocket of ~1,050 Å3 was observed in the core of the structure (Fig. 1B; the 41 pocket positions are listed in Supplementary Table S7), which corresponds to the substrate-binding pocket in TPT32. A structural comparison of PfCRTIF and PfCRTOC revealed a prominent difference in the orientation of TM 3 and TM 4 of ~30° at the vacuolar face (Fig. 1C), suggesting that these helices undergo rocker-switch movements to open and close the vacuolar gate. By estimating the electrostatic surface potential on the models (which have WT PfCRT haplotype), we noted that the vacuolar gate is slightly electropositive whilst the cavity and putative binding pocket are globally neutral (Fig. 1D). TMs boundaries between PfCRTcryo-EM and models were similar, except TM 3, TM 7 and TM 10 that were longer in PfCRTcryo-EM (Fig. 2A). In addition, the folds of the PfCRTIF model and PfCRTcryo-EM were highly similar as evidenced by a RMSD of 2.8 Å (2.4 Å without TMs-connecting loops; Fig. 2B), which provides a direct validation of our PfCRTIF model. This result is consistent with the Cɑ-based structural conservation profile produced between PfCRT models and PfCRTcryo-EM using the Dali server55, revealing a weaker structural conservation only on some TMs-connecting loops (Supplementary Fig. S9). Also, Cɑ-based contact maps generated with CMWeb56 indicated overall the same intra-protein contacts between our PfCRT models and PfCRTcryo-EM, with similar pairwise amino acid contact distributions (Supplementary Fig. S10). Finally, we used the orientation of side-chain amino acids related to CQR and PPQR mutations as another indicator of validity. After substituting mutant positions in PfCRTcryo-EM (S72, T76, S220, D326 and L356) to WT (C72, K76, A220, N326 and I356) with UCSF Chimera57, we noticed that most of drug resistance-associated amino acid sites were similarly oriented in the PfCRT models compared to PfCRTcryo-EM (Fig. 2C), except a major difference for the amino acid I218 in PfCRTIF. Altogether, the two models we produced were accurate and reliable for subsequent analyses.
Vacuolar and TM sites have evolved under strong purifying selection
We used our new PfCRT models to test whether some PfCRT regions, apart from the N- and C-terminal extremities, were differentially conserved. We used the fixed-site models and partitioning function implemented in the codeml program58. We tested two different partitioning of crt codon sites: 1) cytoplasmic-half versus vacuolar-half sides of the transporter and 2) non-TMs (i.e. loops) versus TMs. In the two sets of partitioning, model B provided a better fit to the data than model A which assumes the same substitution pattern with identical parameters among partitions (p = 1 × 10−4 and 2 × 10−14 for cytoplasmic-half/vacuolar-half and non-TMs/TMs, respectively, Table 1 and Supplementary Table S8). This indicates that the nucleotide substitution rates at vacuolar and TM sites were on average 0.80 and 0.62 times as low as at the cytoplasmic and non-TM sites, respectively. More complex models D and E also provided better fits to the data than comparison models B and C (Table 1), confirming a heterogeneity of evolutionary parameters in the two sets of partitioning (Supplementary Table S8). By looking at the substitution rates ω in the different partitions, vacuolar and TM sites were under stronger purifying selection than cytoplasmic and non-TM sites, respectively (cytoplasmic-half/vacuolar-half: ω1 = 0.121/ω2 = 0.070, best fitted D model; non-TMs/TMs: ω1 = 0.111/ω2 = 0.078, best fitted E model; Supplementary Table S8). This is consistent with TMs being the hydrophobic and conserved core of the protein and the vacuolar side being under functional constraint as the likely entry route for physiological substrates6,13,22,25,26,59.
The cavity of PfCRT contains highly conserved amino acid sites
We next searched for the most conserved amino acid positions in the PfCRTOC and PfCRTIF models as indicators of putative functional sites. We used the FuncPatch server which jointly analyses phylogenetic tree, protein tertiary structure and conservation information. FuncPatch infers site-specific substitution rates at the protein level by taking into account their spatial location in the tertiary structure, and is especially useful in the case of highly conserved proteins37,60. Site-specific substitution rates were very similar using either the PfCRTIF or PfCRTOC models (Spearman’s rank correlation: r = 0.99, p < 0.001; Fig. 3A and Supplementary Data S1), so only the data obtained with PfCRTIF will be presented. We detected a significant spatial correlation of site-specific substitution rates in the PfCRTIF model as evidenced by a log Bayes factor of 47.6, suggesting the presence of a functional patch. We then mapped onto the tertiary structure the 10% most conserved amino acid sites of the 297 PfCRT positions included in the FuncPatch analysis. Remarkably, almost the whole TM 9 that lines the PfCRTIF cavity was highlighted (Fig. 3B and Supplementary Table S9). Other highly conserved positions were located on TM 4, TM 5, in the middle of TM 8 and on TM 10 (Fig. 3B). It is noteworthy that TM 9 has been reported to play a major role in substrate binding in other DMTs consisting of 10 TMs24, and especially in Vrg4 and TPT proteins (Supplementary Fig. S11)32,36. Evolutionary rates at the protein level were also estimated using the Consurf web server61, and this analysis revealed again the high conservation of the PfCRTIF cavity and especially TM 9 (Supplementary Fig. S12).
PfCRT amino acids Y68, D329, Y345 and S349 might play a role in physiological substrate trafficking
Some of the most highly conserved PfCRT amino acid sites might be under strong functional – rather than structural – constraints. Being a member of the DMT superfamily, PfCRT may bind a substrate through amino acids located in its central cavity, and more particularly in the binding pocket. Consequently, we focused on amino acid positions strictly conserved across Plasmodium species and whose side-chains line the binding pocket. After these filtering steps, we listed 24 amino acid positions (Supplementary Table S10) that were further analyzed in three paired structural alignments, PfCRTIF-Vrg4, PfCRTIF-YddG and PfCRTOC-TPT. We searched for structurally aligned positions that have been shown experimentally to modulate substrate trafficking in DMT template proteins (Supplementary Fig. S13)32,36,41. Two tyrosines of PfCRTIF (Y68 in TM 1 and Y345 in TM 9) paired with two tyrosines of Vrg4 (Y28 in TM 1 and Y281 in TM 9; Fig. 4A) reported to be essential for the translocation of GDP–mannose and GMP molecules36. One aspartic acid of PfCRTOC (D329 in TM 8) paired with a tyrosine of the TPT protein (Y339 in TM 8, Fig. 4B) which is crucial for phosphate recognition and transport32. Finally, PfCRTIF S349 (TM 9) paired with both S244 from the bacterial YddG protein and G285 from Vrg4 (Fig. 4C), which play a role in the transport of threonine/methionine and both GDP-mannose and GMP molecules, respectively36,41. Of those four candidate PfCRT positions, three belonged to the conserved patch evidenced by the FuncPatch server (D329, Y345 and S349; Supplementary Table S9). In PfCRTcryo-EM (inward-facing state), the side-chains of these four positions were also oriented towards the cavity (Supplementary Fig. S14). Overall, this comparative evolutionary and structural analysis identified amino acid sites putatively involved in the physiological transport activity of PfCRT (Fig. 4D).
Most PfCRT mutations conferring drug resistance are located in the cavity or binding pocket
Mutations in PfCRT confer drug resistance, chiefly to CQ and PPQ, but also alter sensitivities to other ACT drugs. To better understand the underlying resistance mechanisms, we mutated the PfCRTOC and PfCRTIF models in silico. We introduced jointly several mutations to produce Dd2, GB4 and 7G8 mutant structures which confer CQR and correspond to Asian, African and South American CQR haplotypes, respectively (Table 3). From an evolutionary view, five of the nine CQR-related positions investigated here were strictly conserved during Plasmodium evolution including the position 76 related to the key CQR K76T mutation (Supplementary Fig. S1). We noted that some positions related to CQR mutations (M74I, N75E and K76T) were markedly more conserved when site-specific substitution rates were computed by taking into account their spatial location in the tertiary structure (FuncPatch versus PAML data in Fig. 5A; Supplementary Data S1), indicating that these three amino acid sites were spatially close to conserved ones. Remarkably, the CQR-related positions we analyzed were directly located in the cavity of the PfCRTIF model, except position 74 (mutation M74I) which lies in TM 1 but on the outer surface of the transporter and position 371 (mutation R371I) in the TM 9-TM 10 vacuolar loop next to the entry of the cavity (Figs. 2C and 5A). Based on the PfCRTOC model, six positions related to CQR mutations were predicted to be involved in the architecture of the putative binding pocket (C72S, N75E, K76T, A220S, N326S/D and I356T/L), including five of the eight CQR mutations carried by the canonical Dd2 haplotype (Table 3). As the key CQR K76T mutation removes a positive charge in the cavity, we investigated in silico the changes in the electrostatic surface potential of the PfCRTIF and PfCRTOC Dd2, GB4 and 7G8 models. While the surface potential of PfCRTIF WT cavity and PfCRTOC WT binding pocket were neutral (Fig. 5B), they turned highly electronegative in all CQR mutant structures (Fig. 5C and Supplementary Fig. S15A).
Table 3.
Origin regiona | Parasite lineb | Drug responsec | TM 1 | TM 1 | TM 1 | TM1 | TM 2 | TM 2 | TM 2 | TM 3 | TM 6 | TM 6 | L7-8 | TM 8 | TM 9 | TM 9 | TM 9 | TM 9 | L9-10 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
72 | 74 | 75 | 76 | 93 | 97 | 101 | 145 | 218 | 220 | 271 | 326 | 343 | 350 | 353 | 356 | 371 | |||
All regions | 3D7 | CQS, PPQS | C | M | N | K | T | H | C | F | I | A | Q | N | M | C | G | I | R |
SE Asia | Dd2 | CQR, PPQS | C | I | E | T | T | H | C | F | I | S | E | S | M | C | G | T | I |
SE Asia | Dd2 + T93S | CQS, PPQR | C | I | E | T | S | H | C | F | I | S | E | S | M | C | G | T | I |
SE Asia | Dd2 + H97Y | CQS, PPQR | C | I | E | T | T | Y | C | F | I | S | E | S | M | C | G | T | I |
SE Asia | Dd2 + C101F | CQS, PPQR | C | I | E | T | T | H | F | F | I | S | E | S | M | C | G | T | I |
SE Asia | Dd2 + F145I | CQS, PPQR | C | I | E | T | T | H | C | I | I | S | E | S | M | C | G | T | I |
SE Asia | Dd2 + I218F | CQS, PPQR | C | I | E | T | T | H | C | F | F | S | E | S | M | C | G | T | I |
SE Asia | Dd2 + M343L | CQS, PPQR | C | I | E | T | T | H | C | F | I | S | E | S | L | C | G | T | I |
SE Asia | Dd2 + G353V | CQS, PPQR | C | I | E | T | T | H | C | F | I | S | E | S | M | C | V | T | I |
S America | 7G8 | CQR, PPQS | S | M | N | T | T | H | C | F | I | S | Q | D | M | C | G | L | R |
S America | 7G8 + C350R | CQS, PPQR | S | M | N | T | T | H | C | F | I | S | Q | D | M | R | G | L | R |
Africa | GB4 | CQR, PPQS | C | I | E | T | T | H | C | F | I | S | E | N | M | C | G | I | I |
aGeographic origin of the haplotype. SE Asia, Southeast Asia; S America, South America. bFor Dd2 and 7G8 parasite lines, one additional mutation was found to reverse CQR but confer PPQR19,63. cCQS, chloroquine sensitivity; CQR, chloroquine resistance; PPQS, piperaquine sensitivity; PPQR, piperaquine resistance. In this study, the Q271E mutation was not introduced in the mutant structures and subsequent analyses since it is located in the long TM 7-TM 8 vacuolar loop that was discarded for tertiary structure prediction. The location in the structure (TM, transmembrane helix; L, loop) is indicated above each position.
Next, the recently identified, additive PfCRT mutations conferring PPQR on a CQR genetic background (namely T93S, H97Y, C101F, F145I, I218F, M343L and G353V, found in PfCRT Dd2; and C350R found in PfCRT 7G8)19,62,63 were individually introduced in the PfCRTIF and PfCRTOC Dd2 or 7G8 CQR mutant models. Similarly as CQR-related positions, PPQR-related positions were almost all located in the cavity of the PfCRTIF model, except I218F and M343L which were located at the outer surface and at the cytoplasmic face of the transporter respectively (Figs. 2C and 5A). In the PfCRTOC model, positions related to PPQR mutations H97Y, C101F, F145I, C350R and G353V were involved in the architecture of the binding pocket. We observed a large electronegative surface potential in the cavity and binding pocket, similarly to the PfCRTIF and PfCRTOC Dd2, GB4 or 7G8 mutant structures conferring CQR (Fig. 5D and Supplementary Fig. S15B). Of note, addition of C350R rendered neutral a large part of the cavity except in the proximity of K76T (note that this was found only in the PfCRTIF model; Supplementary Fig. S15B). Six of the eight PPQR-related positions were strictly conserved during Plasmodium evolution (T93, C101, I218, M343, C350, G353; Supplementary Fig. S1).
Altogether, we concluded that PfCRT positions associated with mutations conferring resistance to quinoline-containing antimalarial drugs were globally conserved over Plasmodium evolution, mostly located in the cavity/binding pocket where physiological substrate trafficking is supposed to take place, and made the surface potential of the cavity and binding pocket more electronegative, as reported using the high-resolution PfCRTcryo-EM structure35.
Discussion
Based on our evolutionary results, we found that PfCRT is on average more conserved on its vacuolar-half side and its membrane-spanning domain. Therefore, the vacuolar side of PfCRT may play an important role in substrate recognition and/or transport, which supports the hypothesis of a physiological substrate being expelled by PfCRT out of the parasite DV. This is in line with the observation that recombinant PfCRT, either reconstituted in proteoliposomes25 or expressed at the surface of Xenopus laevis oocytes13,26, directionally transports CQ, peptides, basic amino acids, and ferric/ferrous iron. In the parasite, these solutes are known to accumulate in the DV from which they could be transported out either solely by mutant PfCRT (CQ and PPQ in their protonated forms) or by WT and/or mutant PfCRT (peptides, basic amino acids, and ferric/ferrous iron).
A high-resolution 3D PfCRT mutant structure determined in an inward-facing state by single-particle cryo-electron microscopy (PfCRTcryo-EM) was recently published35. The authors also modeled an outward-facing conformation of this mutant PfCRT (PfCRTOF), considering the pseudo-symmetrical arrangement of the TMs. Here, we complete the putative cycle of the PfCRT transporter by producing a model of PfCRT in an occluded state (PfCRTOC), where any substrate is engulfed by the residues of the binding pocket. Altogether, the results are compatible with the hypothesis that PfCRT may operate by the alternate access model33,34. Such a transporter mechanistic model was already proposed for other DMT proteins32. In PfCRT, the vacuolar half of TM 3 and TM 4 undergo rocker-switch movements which may contribute to open (in PfCRTIF/PfCRTcryo-EM) and close (PfCRTOC and PfCRTOF) the vacuolar gate. Meanwhile, based on the PfCRTOF model from Kim et al. (2019)35, the cytoplasmic half of TM 8 and TM 9 act similarly and may contribute to open (in PfCRTOF) and close (PfCRTIF/PfCRTcryo-EM and PfCRTOC) the cytoplasmic gate.
Based on evolutionary conservation and structural alignment with other DMTs, we also propose four PfCRT candidate sites in the binding pocket that may be involved in the transport of physiological substrate(s): Y68 (TM 1), D329 (TM 8), Y345 (TM 9) and S349 (TM 9). None of these positions were found mutated in drug-resistant parasites. Mutations at those positions could be either too deleterious for the parasite physiology and survival (pfcrt is an essential gene11) or not involved in drug transport. Two other interesting sites in TM 8 (N326) and TM 9 (I356) from the highly conserved patch are located in the PfCRT binding pocket. N326 and I356 were initially mutated in the canonical Dd2 haplotype that initially invaded Southeast Asia and Africa. Their removal does not significantly alter CQR but enhances parasite fitness in pfcrt-modified isogenic parasite lines20. These N326/I356-revertant fitter haplotypes are now predominant in sub-Saharan Africa, suggesting some competitive advantage of carrying WT residues at those positions20.
The modeled PfCRTIF and PfCRTOC structures predicted here provide some insights into quinoline-containing drug resistance, which are fully congruent with those recently reported35. The most striking result is that most positions mutated in CQR, including the key K76T, are located in the cavity of the PfCRTIF model, and more particularly in its predicted substrate-binding pocket (PfCRTOC model). Several of the mutated positions (C72S, M74I, N75E, N326S/D, I356T/L) are in close proximity with K76T, suggesting that all these mutations work together. These amino acid changes jointly cause drastic alterations of the electrostatic surface potential in the vacuolar entry of the cavity and in the binding pocket that become highly electronegative in the CQR PfCRT mutant structures. This radical shift in electrostatic surface potential is very likely a major contributor to the dramatically increased uptake of di-protonated CQ in X. laevis oocytes expressing CQR versus WT PfCRT13 and the decreased accumulation of CQ in the DV of CQ-resistant parasites6,64.
Similarly to CQR mutations, most of the novel PPQR mutations that occurred in Dd2 or 7G8 CQR genetic backgrounds19,21,62,63 are located in the cavity and binding pocket in our PfCRTIF and PfCRTOC models. However, the addition of PPQR mutations on CQR mutant structures (Dd2 and 7G8 haplotypes) did not alter the electronegative surface potential. This was moderately surprising because some PPQR mutations (such as M343L or G353V) attenuate but do not fully reverse CQR19. Remarkably, substantially increased PPQ transport rates by PPQ-resistant PfCRT variants were found in proteoliposomes, which were then confirmed in pfcrt-edited parasites35. Hence, we hypothesize that PPQ-conferring mutations provide additional changes in the cavity/binding pocket which could affect interactions with PPQ.
In conclusion, we showed that our bioinformatics analysis of PfCRT led to results similar to those obtained with the recently obtained experimental structure35 regarding the overall fold, architecture of the open cavity and alteration of its electrostatic surface potential in drug-resistant PfCRT mutant structures. Such results strengthen the usefulness of tertiary structure prediction of transmembrane proteins through homology modeling, even when a protein to model shares less than 15% sequence identity than a similarly folded template structure. Furthermore, we complement the experimental work of Kim et al. (2019)35 by providing an occluded state model of PfCRT and identifying a putative binding pocket where a substrate is likely maintained and could trigger a structural transition between the inward- and outward-facing states. This pocket, of smaller volume than the open cavity, also provides with a shortened list of highly conserved putative functional sites to be experimentally tested.
Materials and Methods
Collection of PfCRT orthologous sequences
The PfCRT amino acid sequence (PlasmoDB database identifier: PF3D7_0709000) was queried against the specialized Plasmodium database (PlasmoDB, release 38) and the non-redundant protein database of NCBI using blastp and tblastn searches (e-value cutoff = 10−6, BLOSUM62 scoring matrix)65. Twenty four CRT protein and corresponding cDNA sequences from distinct Plasmodium species were retrieved (Supplementary Table S1). For P. ovale spp. and P. inui sequences, exon skipping was manually performed so as to reconstruct the full cDNA and amino acid sequences.
Multiple protein and codon sequence alignments
The CRT multiple sequence alignment was first generated with TM-aligner66 using the transmembrane-specific substitution matrix PHAT67 and with gap opening and extension penalties set to 10 and 1, respectively. The output alignment was visually inspected and manually edited using BioEdit v7.2.568. The positions of the CRT multiple alignment containing gaps in ≥30% of all sequences were removed. Then, a crt nucleotide sequence alignment was generated with PAL2NAL69 using the cleaned CRT protein sequence alignment as template.
Phylogenetic analysis
A phylogenetic tree was built from the crt nucleotide sequence alignment by using the maximum likelihood method implemented in PhyML v3.070, after determining the best-fitting nucleotide substitution model using the Smart Model Selection (SMS) package71. A general time-reversible model with optimized equilibrium frequencies, gamma distributed among-site rate variation and estimated proportion of invariant sites (GTR + G + I) was used, as selected by the Akaike Information Criterion. The nearest neighbor interchange approach implemented in PhyML was chosen for tree improving, and branch supports were estimated using the approximate likelihood ratio aLRT SH-like method72. The crt phylogenetic tree was rooted using the most distant bird-infecting Plasmodium species as outgroup and displayed using the iTOL server73.
Analysis of selective pressures acting on crt
To investigate the evolutionary regime that has shaped the CRT-coding DNA sequence during Plasmodium evolution, the nucleotide sequence alignment and the maximum likelihood phylogenetic tree were submitted to the codeml tool of PAML v.4.9h39,74. The level of coding sequence conservation was measured by calculating the non-synonymous (dN) to synonymous (dS) substitution rate ratio ω (=dN/dS). Theoretically, positive selection at codon sites is indicated by ω > 1, while codon sites associated with ω = 1 and ω < 1 suggest neutral evolution and purifying selection, respectively. We investigated the heterogeneity of ω among lineages and along the crt sequence; then we searched for codons evolving under positive selection using a set of random-site and branch models (free-ratio, M0, M1a, M2a, M3, M7 and M8)75,76. Model M0 supposes the same dN/dS ratio for all branches of the phylogeny, while the “free-ratio” branch model assumes independent dN/dS ratio for every branch. Model M1a allows two classes of sites under neutral evolution (ω = 1) and purifying selection (0 ≤ ω < 1), whereas model M2a adds a third class of sites to model M1a as positive selection (ω > 1). In the discrete model M3, several classes (k = [3–5] in this study) of independent ω were estimated, each of them being associated with a specific proportion. Model M7 assumes a β-distribution of ten ω ratios limited to the interval [0, 1] with two shape parameters p and q. In model M8, an additional site class is estimated, with ω possibly > 1 as M2a does. The heterogeneity of ω among codon sites was tested by comparing model M0 to model M3, while paired models M1a:M2a and M7:M8 allow to detect positively selected sites75. Candidate sites for positive selection were identified in M2a and M8 models using the Bayes empirical Bayes inference (BEB)77, which calculates the posterior probability that every codon site belongs to a site class affected by positive selection; and using the naïve empirical Bayes (NEB) inference in model M3 (in which no BEB approach is implemented yet), considered as less powered than the BEB inference. For models considering k classes of dN/dS ratio, we used the mean ω value at each codon site, calculated as the sum of the ω values of each k class weighted by their estimated probabilities (Supplementary Data S1)39.
Statistical analyses based on structural information of the PfCRTIF and PfCRTOC models were performed using the partitioning function implemented in codeml58 in order to test i) whether the vacuolar-half side is more or less conserved than the cytoplasmic-half side; and ii) whether TMs are more or less conserved than non-TMs (or loops). Partitionings are based on fixed-site models A to E. Model A – that corresponds to model M0 – assumes identical parameters for transition/transversion (κ) rate ratio, branch lengths and equilibrium nucleotide frequencies, and unique ω ratio for partitions. Model B allows homogeneity among partitions in codon frequencies, κ and ω ratios but different rates of evolution, whereas model C further accounts for potential variation in codon frequency. Model D uses different κ and ω ratios based on the same codon frequencies for partitions, and model E ultimately allows different κ, ω and codon frequencies between partitions. The A:B comparison tests for different nucleotide substitution rates among partitions, while both B:D and C:E comparisons evaluate the variation in κ, ω, and codon frequencies among partitions58.
The comparisons of models were performed using Likelihood Ratio Tests (LRTs)78. For every LRT, twice the log-likelihood difference between alternative and null models (2Δℓ) was compared to critical values from a chi-squared distribution with degrees of freedom equal to the difference in the number of estimated parameters between both models79.
Modeling the PfCRT tertiary structure
When the analyses were performed, no experimental structure was determined for PfCRT. Consequently, we aimed at modeling it in different conformational states using a template-based protein structure modeling approach. The PfCRT sequence was submitted to interactive Phyre242 and HHpred43 servers to find the best available, appropriate templates. The nucleotide sugar GDP-mannose transporter Vrg4 (inward-facing state, PDB ID: 5oge)36 and the triose-phosphate/phosphate translocator TPT (occluded state, PDB ID: 5y79)32 were identified as sharing putative similar structural folds with PfCRT with confidence >99%. The very low sequence identity of these proteins with PfCRT, reaching only 14% with TPT, makes homology modeling challenging but still feasible44. In homology modeling, the sequence alignment of the target (here PfCRT) and the template (here other DMT proteins) is a critical step. Because hydrophobicity reflects the transmembrane topology and is typically conserved in TMs during evolution, we first produced a superfamily-averaged hydropathy profile alignment with AlignMe50 using the CRT multiple sequence alignment and an alignment of different members of the DMT superfamily. The profile (CRT)-profile (DMT) alignment pinpointed some amino acids which are supposed to be important to align in the final target-template sequence alignment (Supplementary Figs. S6 and S7). Finally, the target-template alignment was produced using the AlignMe PST algorithm, designed for distantly related proteins (i.e. with a sequence identity <15%)50, then manually optimized it in order to align the putative important amino acid positions identified from the superfamily-averaged hydropathy profile alignment (Supplementary Figs. S6 and S7). The long TM 7-TM 8 vacuolar loop and the cytoplasmic N- and C-terminal extremities were not considered for PfCRT modeling because of too few numbers of amino acid positions available from templates to properly cover them. Initially, we built 1,000 3D models for each PfCRT state satisfying the spatial restrains of the template structures using MODELLER 9.1780. The best models among them were selected based on scores calculated from discrete optimized protein energy (DOPE) and GA341 functions. Then, the best PfCRTOC and PfCRTIF models were submitted to GalaxyRefine81 for atomic-level, high-resolution refinement, which helped to achieve significant improvement in physical quality of the local structure. Finally, the refined models were subjected to an energy minimization to get the most stable conformations using the YASARA server82.
The refined, energy-minimized PfCRT models were then validated using i) MolProbity83 for local stereochemistry; ii) fine packing quality control implemented in the What IF server54 by estimating directional atomic contacts; and iii) ERRAT51, ProSA II52 and QMEANBrane53 for tertiary fold and global 3D quality metrics. Cɑ-based intra-protein contacts and pairwise amino acid contact distributions were investigated with the CMWeb server (default parameters)56. Finally, the Dali server55 was used to study the Cɑ-based structural conservation between the PfCRT models and the high-resolution PfCRTcryo-EM structure. The PDB files of the refined, energy-minimized PfCRTOC and PfCRTIF models are provided in Supplementary Files S1 and S2, respectively.
Analyses of structural features
Protein electrostatic surface potential was calculated using Adaptive Poisson-Boltzmann Solver (APBS)84, after determining the per-atom charge and radius of the structure with PDB2PQR v.2.1.185. The Poisson-Boltzmann equation was solved at 298 K using a grid-based method, with solute and solvent dielectric constants fixed at 2 and 78.5, respectively. We used a scale of −8 kT/e to + 8 kT/e to map the electrostatic surface potential in a radius of 1.4 Å.
The cavity/pocket volume of the PfCRT models was analytically calculated using the Connolly’s surface (or molecular surface model) with CASTp 3.0 (default parameters)86.
Putative functional patches (i.e. highly conserved amino acid positions that are in close physical proximity in the tertiary structure) in the two PfCRT models were searched for using the FuncPatch server by estimating site-specific substitution rates, possibly spatially correlated37. We submitted to the server the PfCRTOC and PfCRTIF models along with the CRT multiple sequence alignment and the maximum-likelihood crt phylogenetic tree. The spatial correlation of the site-specific amino acid substitution rates in the PfCRT predicted structures was tested using a Bayesian model comparison, where a null model (model 0), in which no spatial correlation of site-specific substitution rates is present, is compared to the alternative model (model 1). As suggested by the FuncPatch’ authors, a spatial correlation was considered as significant if the estimated log Bayes factor (model 1 versus model 0) was larger than the conservative cutoff value of 837. Finally, evolutionary rates were also estimated at each amino acid site of the PfCRT sequence and then mapped onto the tertiary structure with Consurf (default parameters)61, a widely used web server implementing advanced probabilistic evolutionary models. Contrary to FuncPatch, Consurf does not include the spatial correlation of site-specific substitution rates attributed to tertiary structure37,61.
Identifying candidate functional sites of PfCRT
In order to identify candidate functional amino acid sites possibly involved in PfCRT physiological substrate(s) trafficking, a comparative evolutionary and structural analysis was performed to filter step-by-step PfCRT amino acid sites. Because functional amino acids are usually very conserved over evolutionary time87, only the strictly conserved amino acid positions (i.e. the absence of non-synonymous substitutions for a given position) of PfCRT during Plasmodium evolution were kept. From this subset of positions, we then focused on those involved in the putative substrate-binding pocket in the PfCRTOC model since it is widely accepted that DMT proteins transport substrates through binding sites located in their pocket24,32,36,41. Finally, the DMT template structures used to predict the conformational states of PfCRT tertiary structure (Vrg4 and TPT)32,36, in addition with another DMT protein (YddG)41 were aligned with the PfCRTOC and PfCRTIF models. By focusing on the amino acid sites from these three structures that have been shown to be important or essential for substrate binding or transport, we listed PfCRT candidate positions that might contribute to substrate trafficking (Supplementary Fig. S13).
In silico production of PfCRT mutant structures
Some in silico mutagenesis from the refined, energy-minimized PfCRT models were performed. We generated three PfCRT mutant models per PfCRT conformational state based on the Dd2, GB4 and 7G8 haplotypes which are associated with CQR (Table 3)10,88. The CQ-resistant PfCRTIF or PfCRTOC Dd2 and 7G8 structures were then subjected to individual additional mutations that attenuate CQR but confer PPQR (Table 3)19,21,62,63. Mutations were introduced using the swapaa function of UCSF Chimera by substituting the residue with the most probable rotameric conformation57.
Structure visualization
Molecular drawings were produced using UCSF Chimera57.
Supplementary information
Acknowledgements
We are extremely grateful to I. Callebaut for her expert guidance and advices for the homology modeling analyses and for her critical review of the manuscript. We thank B. Offmann for helpful suggestions regarding the critical evaluation of model validity. We thank Y-F. Huang for help with FuncPatch analysis, R. Coucke for early work on the project, and R. Duval for helpful discussions. R. Coppée is a recipient of a PhD fellowship from the doctoral school MTCI n°563. This work was partly supported by the Agence Nationale de la Recherche (ANR-17-CE15-0013-03 to J. Clain).
Author contributions
R.C., A.S. and J.C. designed the research. A.S. and J.C. supervised the whole work. R.C. performed the building of datasets, statistical and evolutionary analyses, homology modeling and structural visualization. All the authors participated in data interpretation and wrote the manuscript.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Romain Coppée, Email: romain.coppee@gmail.com.
Audrey Sabbagh, Email: audrey.sabbagh@parisdescartes.fr.
Jérôme Clain, Email: jerome.clain@parisdescartes.fr.
Supplementary information
is available for this paper at 10.1038/s41598-020-61181-1.
References
- 1.Wootton JC, et al. Genetic diversity and chloroquine selective sweeps in Plasmodium falciparum. Nature. 2002;418:320–323. doi: 10.1038/nature00813. [DOI] [PubMed] [Google Scholar]
- 2.WHO | World malaria report WHO, http://www.who.int/malaria/publications/world-malaria-report-2018/report/en/ (2018).
- 3.Milani KJ, Schneider TG, Taraschi TF. Defining the morphology and mechanism of the hemoglobin transport pathway in Plasmodium falciparum-infected erythrocytes. Eukaryotic Cell. 2015;14:415–426. doi: 10.1128/EC.00267-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kapishnikov S, et al. Mode of action of quinoline antimalarial drugs in red blood cells infected by Plasmodium falciparum revealed in vivo. Proc. Natl. Acad. Sci. USA. 2019;116:22946–22952. doi: 10.1073/pnas.1910123116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Yayon A, Cabantchik ZI, Ginsburg H. Identification of the acidic compartment of Plasmodium falciparum-infected human erythrocytes as the target of the antimalarial drug chloroquine. EMBO J. 1984;3:2695–2700. doi: 10.1002/j.1460-2075.1984.tb02195.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Krogstad DJ, Gluzman IY, Herwaldt BL, Schlesinger PH, Wellems TE. Energy dependence of chloroquine accumulation and chloroquine efflux in Plasmodium falciparum. Biochem. Pharmacol. 1992;43:57–62. doi: 10.1016/0006-2952(92)90661-2. [DOI] [PubMed] [Google Scholar]
- 7.Geng Y, Kohli L, Klocke BJ, Roth KA. Chloroquine-induced autophagic vacuole accumulation and cell death in glioma cells is p53 independent. Neuro-oncology. 2010;12:473–481. doi: 10.1093/neuonc/nop048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Goldberg DE. Hemoglobin degradation in Plasmodium-infected red blood cells. Semin. Cell Biol. 1993;4:355–361. doi: 10.1006/scel.1993.1042. [DOI] [PubMed] [Google Scholar]
- 9.Su X, Kirkman LA, Fujioka H, Wellems TE. Complex polymorphisms in an approximately 330 kDa protein are linked to chloroquine-resistant P. falciparum in Southeast Asia and Africa. Cell. 1997;91:593–603. doi: 10.1016/S0092-8674(00)80447-X. [DOI] [PubMed] [Google Scholar]
- 10.Fidock DA, et al. Mutations in the P. falciparum digestive vacuole transmembrane protein PfCRT and evidence for their role in chloroquine resistance. Mol. Cell. 2000;6:861–871. doi: 10.1016/S1097-2765(05)00077-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zhang, M. et al. Uncovering the essential genes of the human malaria parasite Plasmodium falciparum by saturation mutagenesis. Science360 (2018). [DOI] [PMC free article] [PubMed]
- 12.Wellems TE, Hayton K, Fairhurst RM. The impact of malaria parasitism: from corpuscles to communities. J. Clin. Invest. 2009;119:2496–2505. doi: 10.1172/JCI38307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Martin RE, et al. Chloroquine transport via the malaria parasite’s chloroquine resistance transporter. Science. 2009;325:1680–1682. doi: 10.1126/science.1175667. [DOI] [PubMed] [Google Scholar]
- 14.Gabryszewski SJ, et al. Evolution of Fitness Cost-Neutral Mutant PfCRT Conferring P. falciparum 4-Aminoquinoline Drug Resistance Is Accompanied by Altered Parasite Metabolism and Digestive Vacuole Physiology. PLOS Pathogens. 2016;12:e1005976. doi: 10.1371/journal.ppat.1005976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lewis IA, et al. Metabolic QTL analysis links chloroquine resistance in Plasmodium falciparum to impaired hemoglobin catabolism. PLoS Genet. 2014;10:e1004085. doi: 10.1371/journal.pgen.1004085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Sidhu ABS, Verdier-Pinard D, Fidock DA. Chloroquine resistance in Plasmodium falciparum malaria parasites conferred by pfcrt mutations. Science. 2002;298:210–213. doi: 10.1126/science.1074045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Cooper RA, et al. Alternative mutations at position 76 of the vacuolar transmembrane protein PfCRT are associated with chloroquine resistance and unique stereospecific quinine and quinidine responses in Plasmodium falciparum. Mol. Pharmacol. 2002;61:35–42. doi: 10.1124/mol.61.1.35. [DOI] [PubMed] [Google Scholar]
- 18.Valderramos SG, et al. Identification of a mutant PfCRT-mediated chloroquine tolerance phenotype in Plasmodium falciparum. PLoS Pathog. 2010;6:e1000887. doi: 10.1371/journal.ppat.1000887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ross LS, et al. Emerging Southeast Asian PfCRT mutations confer Plasmodium falciparum resistance to the first-line antimalarial piperaquine. Nature Communications. 2018;9:3314. doi: 10.1038/s41467-018-05652-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dhingra SK, et al. Global Spread of Mutant PfCRT and Its Pleiotropic Impact on Plasmodium falciparum Multidrug Resistance and Fitness. mBio. 2019;10:e02731–18. doi: 10.1128/mBio.02731-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.van der Pluijm Rob W, Imwong Mallika, Chau Nguyen Hoang, Hoa Nhu Thi, Thuy-Nhien Nguyen Thanh, Thanh Ngo Viet, Jittamala Podjanee, Hanboonkunupakarn Borimas, Chutasmit Kitipumi, Saelow Chalermpon, Runjarern Ratchadaporn, Kaewmok Weerayuth, Tripura Rupam, Peto Thomas J, Yok Sovann, Suon Seila, Sreng Sokunthea, Mao Sivanna, Oun Savuth, Yen Sovannary, Amaratunga Chanaki, Lek Dysoley, Huy Rekol, Dhorda Mehul, Chotivanich Kesinee, Ashley Elizabeth A, Mukaka Mavuto, Waithira Naomi, Cheah Phaik Yeong, Maude Richard J, Amato Roberto, Pearson Richard D, Gonçalves Sónia, Jacob Christopher G, Hamilton William L, Fairhurst Rick M, Tarning Joel, Winterberg Markus, Kwiatkowski Dominic P, Pukrittayakamee Sasithon, Hien Tran Tinh, Day Nicholas PJ, Miotto Olivo, White Nicholas J, Dondorp Arjen M. Determinants of dihydroartemisinin-piperaquine treatment failure in Plasmodium falciparum malaria in Cambodia, Thailand, and Vietnam: a prospective clinical, pharmacological, and genetic study. The Lancet Infectious Diseases. 2019;19(9):952–961. doi: 10.1016/S1473-3099(19)30391-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bellanca S, et al. Multiple drugs compete for transport via the Plasmodium falciparum chloroquine resistance transporter at distinct but interdependent sites. J. Biol. Chem. 2014;289:36336–36351. doi: 10.1074/jbc.M114.614206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Richards SN, et al. Molecular Mechanisms for Drug Hypersensitivity Induced by the Malaria Parasite’s Chloroquine Resistance Transporter. PLoS Pathog. 2016;12:e1005725. doi: 10.1371/journal.ppat.1005725. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Martin RE, Kirk K. The malaria parasite’s chloroquine resistance transporter is a member of the drug/metabolite transporter superfamily. Mol. Biol. Evol. 2004;21:1938–1949. doi: 10.1093/molbev/msh205. [DOI] [PubMed] [Google Scholar]
- 25.Juge N, et al. Plasmodium falciparum chloroquine resistance transporter is a H+-coupled polyspecific nutrient and drug exporter. Proc Natl Acad Sci USA. 2015;112:3356–3361. doi: 10.1073/pnas.1417102112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Bakouh N, et al. Iron is a substrate of the Plasmodium falciparum chloroquine resistance transporter PfCRT in Xenopus oocytes. J. Biol. Chem. 2017;292:16109–16121. doi: 10.1074/jbc.M117.805200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Maughan SC, et al. Plant homologs of the Plasmodium falciparum chloroquine-resistance transporter, PfCRT, are required for glutathione homeostasis and stress responses. Proc. Natl. Acad. Sci. USA. 2010;107:2331–2336. doi: 10.1073/pnas.0913689107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Patzewitz E-M, et al. Glutathione transport: a new role for PfCRT in chloroquine resistance. Antioxid. Redox Signal. 2013;19:683–695. doi: 10.1089/ars.2012.4625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Zhang H, Howard EM, Roepe PD. Analysis of the antimalarial drug resistance protein Pfcrt expressed in yeast. J. Biol. Chem. 2002;277:49767–49775. doi: 10.1074/jbc.M204005200. [DOI] [PubMed] [Google Scholar]
- 30.Zhang H, Paguio M, Roepe PD. The antimalarial drug resistance protein Plasmodium falciparum chloroquine resistance transporter binds chloroquine. Biochemistry. 2004;43:8290–8296. doi: 10.1021/bi049137i. [DOI] [PubMed] [Google Scholar]
- 31.Nessler S, et al. Evidence for activation of endogenous transporters in Xenopus laevis oocytes expressing the Plasmodium falciparum chloroquine resistance transporter, PfCRT. J. Biol. Chem. 2004;279:39438–39446. doi: 10.1074/jbc.M404671200. [DOI] [PubMed] [Google Scholar]
- 32.Lee Y, et al. Structure of the triose-phosphate/phosphate translocator reveals the basis of substrate specificity. Nat Plants. 2017;3:825–832. doi: 10.1038/s41477-017-0022-8. [DOI] [PubMed] [Google Scholar]
- 33.Mitchell P. A General Theory of Membrane Transport From Studies of Bacteria. Nature. 1957;180:134–136. doi: 10.1038/180134a0. [DOI] [PubMed] [Google Scholar]
- 34.Jardetzky O. Simple Allosteric Model for Membrane Pumps. Nature. 1966;211:969–970. doi: 10.1038/211969a0. [DOI] [PubMed] [Google Scholar]
- 35.Kim Jonathan, Tan Yong Zi, Wicht Kathryn J., Erramilli Satchal K., Dhingra Satish K., Okombo John, Vendome Jeremie, Hagenah Laura M., Giacometti Sabrina I., Warren Audrey L., Nosol Kamil, Roepe Paul D., Potter Clinton S., Carragher Bridget, Kossiakoff Anthony A., Quick Matthias, Fidock David A., Mancia Filippo. Structure and drug resistance of the Plasmodium falciparum transporter PfCRT. Nature. 2019;576(7786):315–320. doi: 10.1038/s41586-019-1795-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Parker JL, Newstead S. Structural basis of nucleotide sugar transport across the Golgi membrane. Nature. 2017;551:521–524. doi: 10.1038/nature24464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Huang Y-F, Golding GB. FuncPatch: a web server for the fast Bayesian inference of conserved functional patches in protein 3D structures. Bioinformatics. 2015;31:523–531. doi: 10.1093/bioinformatics/btu673. [DOI] [PubMed] [Google Scholar]
- 38.Carlton JM. Evolution of human malaria. Nat Microbiol. 2018;3:642–643. doi: 10.1038/s41564-018-0170-2. [DOI] [PubMed] [Google Scholar]
- 39.Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 2007;24:1586–1591. doi: 10.1093/molbev/msm088. [DOI] [PubMed] [Google Scholar]
- 40.Kuhn Y, et al. Trafficking of the phosphoprotein PfCRT to the digestive vacuolar membrane in Plasmodium falciparum. Traffic. 2010;11:236–249. doi: 10.1111/j.1600-0854.2009.01018.x. [DOI] [PubMed] [Google Scholar]
- 41.Tsuchiya H, et al. Structural basis for amino acid export by DMT superfamily transporter YddG. Nature. 2016;534:417–420. doi: 10.1038/nature17991. [DOI] [PubMed] [Google Scholar]
- 42.Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJE. The Phyre2 web portal for protein modeling, prediction and analysis. Nat Protoc. 2015;10:845–858. doi: 10.1038/nprot.2015.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Söding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 2005;33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Nikolaev DM, et al. A Comparative Study of Modern Homology Modeling Algorithms for Rhodopsin Structure Prediction. ACS Omega. 2018;3:7555–7566. doi: 10.1021/acsomega.8b00721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Yen MR, Chen JS, Marquez JL, Sun EI, Saier MH. Multidrug resistance: phylogenetic characterization of superfamilies of secondary carriers that include drug exporters. Methods Mol. Biol. 2010;637:47–64. doi: 10.1007/978-1-60761-700-6_3. [DOI] [PubMed] [Google Scholar]
- 46.Chaptal V, et al. Crystal structure of lactose permease in complex with an affinity inactivator yields unique insight into sugar recognition. Proc Natl Acad Sci USA. 2011;108:9361–9366. doi: 10.1073/pnas.1105687108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Huang Y, Lemieux MJ, Song J, Auer M, Wang D-N. Structure and mechanism of the glycerol-3-phosphate transporter from Escherichia coli. Science. 2003;301:616–620. doi: 10.1126/science.1087619. [DOI] [PubMed] [Google Scholar]
- 48.Yin Y, He X, Szewczyk P, Nguyen T, Chang G. Structure of the Multidrug Transporter EmrD from Escherichia coli. Science. 2006;312:741–744. doi: 10.1126/science.1125629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Lolkema JS, Slotboom D-J. Estimation of structural similarity of membrane proteins by hydropathy profile alignment. Molecular Membrane Biology. 1998;15:33–42. doi: 10.3109/09687689809027516. [DOI] [PubMed] [Google Scholar]
- 50.Stamm M, Staritzbichler R, Khafizov K, Forrest LR. AlignMe–a membrane protein sequence alignment web server. Nucleic Acids Res. 2014;42:W246–251. doi: 10.1093/nar/gku291. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Colovos C, Yeates TO. Verification of protein structures: patterns of nonbonded atomic interactions. Protein Sci. 1993;2:1511–1519. doi: 10.1002/pro.5560020916. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wiederstein M, Sippl MJ. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res. 2007;35:W407–410. doi: 10.1093/nar/gkm290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Studer G, Biasini M, Schwede T. Assessing the local structural quality of transmembrane protein models using statistical potentials (QMEANBrane) Bioinformatics. 2014;30:i505–511. doi: 10.1093/bioinformatics/btu457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Vriend G. WHAT IF: A molecular modeling and drug design program. Journal of Molecular Graphics. 1990;8:52–56. doi: 10.1016/0263-7855(90)80070-V. [DOI] [PubMed] [Google Scholar]
- 55.Holm L, Rosenström P. Dali server: conservation mapping in 3D. Nucleic Acids Res. 2010;38:W545–549. doi: 10.1093/nar/gkq366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Kozma D, Simon I, Tusnády GE. CMWeb: an interactive on-line tool for analysing residue-residue contacts and contact prediction methods. Nucleic Acids Res. 2012;40:W329–333. doi: 10.1093/nar/gks488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Pettersen EF, et al. UCSF Chimera—A visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 58.Yang Z, Swanson WJ. Codon-substitution models to detect adaptive evolution that account for heterogeneous selective pressures among site classes. Mol. Biol. Evol. 2002;19:49–57. doi: 10.1093/oxfordjournals.molbev.a003981. [DOI] [PubMed] [Google Scholar]
- 59.Lee AH, et al. Evidence for Regulation of Hemoglobin Metabolism and Intracellular Ionic Flux by the Plasmodium falciparum Chloroquine Resistance Transporter. Sci Rep. 2018;8:13578. doi: 10.1038/s41598-018-31715-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Coppée R, Jeffares DC, Miteva MA, Sabbagh A, Clain J. Comparative structural and evolutionary analyses predict functional sites in the artemisinin resistance malaria protein K13. Sci Rep. 2019;9:1–17. doi: 10.1038/s41598-019-47034-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Ashkenazy H, et al. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. 2016;44:W344–350. doi: 10.1093/nar/gkw408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Dhingra, S. K. et al. A Variant PfCRT Isoform Can Contribute to Plasmodium falciparum Resistance to the First-Line Partner Drug Piperaquine. MBio8 (2017). [DOI] [PMC free article] [PubMed]
- 63.Pelleau S, et al. Adaptive evolution of malaria parasites in French Guiana: Reversal of chloroquine resistance by acquisition of a mutation in pfcrt. Proc. Natl. Acad. Sci. USA. 2015;112:11672–11677. doi: 10.1073/pnas.1507142112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Sanchez CP, Stein W, Lanzer M. Trans stimulation provides evidence for a drug efflux carrier as the mechanism of chloroquine resistance in Plasmodium falciparum. Biochemistry. 2003;42:9383–9394. doi: 10.1021/bi034269h. [DOI] [PubMed] [Google Scholar]
- 65.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 66.Bhat B, Ganai NA, Andrabi SM, Shah RA, Singh A. TM-Aligner: Multiple sequence alignment tool for transmembrane proteins with reduced time and improved accuracy. Sci Rep. 2017;7:12543. doi: 10.1038/s41598-017-13083-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Ng PC, Henikoff JG, Henikoff S. PHAT: a transmembrane-specific substitution matrix. Predicted hydrophobic and transmembrane. Bioinformatics. 2000;16:760–766. doi: 10.1093/bioinformatics/16.9.760. [DOI] [PubMed] [Google Scholar]
- 68.Hall T. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symposium Series. 1999;41:95–98. [Google Scholar]
- 69.Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34:W609–W612. doi: 10.1093/nar/gkl315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Guindon S, et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 2010;59:307–321. doi: 10.1093/sysbio/syq010. [DOI] [PubMed] [Google Scholar]
- 71.Lefort V, Longueville J-E, Gascuel O. SMS: Smart Model Selection in PhyML. Mol Biol Evol. 2017;34:2422–2424. doi: 10.1093/molbev/msx149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Anisimova M, Bielawski JP, Yang Z. Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution. Mol. Biol. Evol. 2001;18:1585–1592. doi: 10.1093/oxfordjournals.molbev.a003945. [DOI] [PubMed] [Google Scholar]
- 73.Letunic I, Bork P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016;44:W242–W245. doi: 10.1093/nar/gkw290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Goldman N, Yang Z. A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol. Biol. Evol. 1994;11:725–736. doi: 10.1093/oxfordjournals.molbev.a040153. [DOI] [PubMed] [Google Scholar]
- 75.Nielsen R, Yang Z. Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics. 1998;148:929–936. doi: 10.1093/genetics/148.3.929. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Yang Z. Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol. Biol. Evol. 1998;15:568–573. doi: 10.1093/oxfordjournals.molbev.a025957. [DOI] [PubMed] [Google Scholar]
- 77.Yang Z, Wong WSW, Nielsen R. Bayes empirical bayes inference of amino acid sites under positive selection. Mol. Biol. Evol. 2005;22:1107–1118. doi: 10.1093/molbev/msi097. [DOI] [PubMed] [Google Scholar]
- 78.Vuong QH. Likelihood Ratio Tests for Model Selection and Non-Nested Hypotheses. Econometrica. 1989;57:307–333. doi: 10.2307/1912557. [DOI] [Google Scholar]
- 79.Yang Z, Nielsen R, Goldman N, Pedersen AM. Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics. 2000;155:431–449. doi: 10.1093/genetics/155.1.431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Webb B, Sali A. Comparative Protein Structure Modeling Using MODELLER. Curr Protoc Bioinformatics. 2016;54:5.6.1–5.6.37. doi: 10.1002/cpbi.3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Heo L, Park H, Seok C. GalaxyRefine: Protein structure refinement driven by side-chain repacking. Nucleic Acids Res. 2013;41:W384–388. doi: 10.1093/nar/gkt458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Krieger E, et al. Improving physical realism, stereochemistry, and side-chain accuracy in homology modeling: Four approaches that performed well in CASP8. Proteins. 2009;77(Suppl 9):114–122. doi: 10.1002/prot.22570. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Chen VB, et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr D Biol Crystallogr. 2010;66:12–21. doi: 10.1107/S0907444909042073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Baker NA, Sept D, Joseph S, Holst MJ, McCammon JA. Electrostatics of nanosystems: application to microtubules and the ribosome. Proc. Natl. Acad. Sci. USA. 2001;98:10037–10041. doi: 10.1073/pnas.181342398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Dolinsky TJ, Nielsen JE, McCammon JA, Baker NA. PDB2PQR: an automated pipeline for the setup of Poisson-Boltzmann electrostatics calculations. Nucleic Acids Res. 2004;32:W665–667. doi: 10.1093/nar/gkh381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Tian W, Chen C, Lei X, Zhao J, Liang J. CASTp 3.0: computed atlas of surface topography of proteins. Nucleic Acids Res. 2018;46:W363–W367. doi: 10.1093/nar/gky473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Echave J, Spielman SJ, Wilke CO. Causes of evolutionary rate variation among protein sites. Nat. Rev. Genet. 2016;17:109–121. doi: 10.1038/nrg.2015.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Durrand V, et al. Variations in the sequence and expression of the Plasmodium falciparum chloroquine resistance transporter (Pfcrt) and their relationship to chloroquine resistance in vitro. Mol. Biochem. Parasitol. 2004;136:273–285. doi: 10.1016/j.molbiopara.2004.03.016. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.