Skip to main content
F1000Research logoLink to F1000Research
. 2016 Aug 12;5:1964. [Version 1] doi: 10.12688/f1000research.9372.1

South-East Asian strains of Plasmodium falciparum display higher ratio of non-synonymous to synonymous polymorphisms compared to African strains

Gajinder Pal Singh 1,a, Amit Sharma 1
PMCID: PMC5089136  PMID: 27853513

Abstract

Resistance to frontline anti-malarial drugs, including artemisinin, has repeatedly arisen in South-East Asia, but the reasons for this are not understood. Here we test whether evolutionary constraints on Plasmodium falciparum strains from South-East Asia differ from African strains. We find a significantly higher ratio of non-synonymous to synonymous polymorphisms in P. falciparum from South-East Asia compared to Africa, suggesting differences in the selective constraints on P. falciparum genome in these geographical regions. Furthermore, South-East Asian strains showed a higher proportion of non-synonymous polymorphism at conserved positions, suggesting reduced negative selection. There was a lower rate of mixed infection by multiple genotypes in samples from South-East Asia compared to Africa. We propose that a lower mixed infection rate in South-East Asia reduces intra-host competition between the parasite clones, reducing the efficiency of natural selection. This might increase the probability of fixation of fitness-reducing mutations including drug resistant ones.

Keywords: artemisinin resistance, resistance evolution, Plasmodium falciparum, non-synonymous polymorphism

Introduction

Artemisinin combination therapy (ACT) is the frontline treatment for malaria caused by Plasmodium falciparum and has played a major role in reducing malaria mortality from an estimated 840,000 deaths in the year 2000 to 440,000 deaths in the year 2015 1. The emergence and spread of artemisinin resistance in South-East Asia, however, poses a serious threat to malaria control, and the containment of artemisinin resistance is thus a global public heath priority 28.

One of the most important unanswered questions in anti-malarial drug resistance is why it has repeatedly emerged in South-East Asia 3, 4, 6, 9. The resistance to chloroquine was first reported in South-East Asia in 1957 before spreading to India and Africa where it resulted in the significant increase in malaria child mortality possibly killing millions of children 1012. The resistance to sulphadoxine-pyrimethamine also emerged in South-East Asia in the late 1960s following a similar route to India and Africa 9. Worryingly, the resistance to artemisinin has emerged independently at multiple places in South-East Asia 1317 and is now present 25 km from the Indian border 17 threatening to follow the same trajectory as resistance to previous anti-malarial drugs. Improved understanding of the process of how and why anti-malarial drug resistance emerges in South-East Asia could provide critical information in developing strategies to prevent the spread of the current wave of artemisinin resistance.

Here we ask whether there are evolutionary constraints on P. falciparum strains from South-East Asia that differ from African strains and thus might explain the higher predisposition of South-East Asia strains to evolve drug resistance. To answer this question we utilized a recent large global genome sequencing data from ~3400 clinical samples which identified nearly million high-quality single nucleotide polymorphisms (SNPs) in the exonic regions of P. falciparum 18.

Results

Higher ratio of non-synonymous to synonymous polymorphism in P. falciparum from South-East Asia

Resistance to anti-malarial drugs often involves changes in the amino-acid sequence within specific proteins. Thus, we tested whether the ratio of non-synonymous (amino acid changing) to synonymous polymorphism is higher in South-East Asia (SEA). Figure 1 shows a significantly higher ratio of non-synonymous to synonymous polymorphism (N/S) in SEA samples compared to African samples with almost no overlap in their distributions. The mean and median N/S for samples from SEA were 2.33, compared to 2.06 for Africa (Wilcox test p-value 0, number SEA samples 1600, and number Africa samples 1647). The higher N/S in SEA compared to Africa was also evident at the gene level with a larger number of genes showing higher N/S in SEA than in Africa ( Figure 2). Mean and median N/S for genes in SEA samples were 2.1 and 1.9 respectively, while for African samples the mean and median N/S were 1.9 and 1.8 respectively (paired t-test p-value 1E-43, paired Wilcox-test p-value 4E-27, n = 4792). There were 75 genes with more than 3-fold higher N/S in SEA samples relative to African samples and N/S of more than four in SEA. Interestingly, most of these genes were not related to antigenic variation ( Supplementary Table 1), but perform basic housekeeping functions, suggesting that higher N/S of these genes in SEA might not be primarily driven by differential host immune selection. In addition to Kelch13, -the only gene known to be causally associated with artemisinin resistance- the list includes CRT (chloroquine-resistance transporter) which shows an 8-fold higher N/S in SEA samples compared to African samples and has previously been shown to be associated with artemisinin resistance in a genome-wide association studies (GWAS) study 15. In summary, P. falciparum strains from SEA show a higher ratio of non-synonymous to synonymous polymorphisms than African strains.

Figure 1. Higher ratio of non-synonymous to synonymous polymorphism in P. falciparum samples from SEA.

Figure 1.

The ratios of non-synonymous to synonymous polymorphism (N/S) for 3394 samples from 22 countries are shown. The y-axis is truncated at the top with 13 samples not shown.

Figure 2. P. falciparum genes show a higher rate N/S in SEA compared to Africa.

Figure 2.

The scatter-plot shows N/S in SEA and Africa for 4792 genes. Genes previously associated with artemisinin resistance in a GWAS study 15 are shown in red, with Kelch13 and chloroquine-resistance transporter (CRT) labelled. The diagonal line is shown and the numbers of genes on both sides of the diagonal are indicated. The x and y-axes are truncated with 28 genes not shown.

Higher non-synonymous changes at the conserved positions in South-East Asia

Highly conserved proteins in P. falciparum show a much lower N/S, indicating lower tolerance for non-synonymous polymorphism 18. We tested whether the correlation between N/S and protein conservation might be different in SEA and Africa. The correlation between N/S and conservation was much weaker in SEA ( Figure 3) with Pearson correlation of -0.43 (95% CI: -0.41 to -0.46) compared to -0.69 (95% CI: -0.68 to -0.71) in Africa. The lower correlation in SEA suggests a higher ratio of non-synonymous to synonymous changes at conserved positions. Indeed, non-synonymous polymorphisms specifically observed in SEA are more likely to occur at conserved positions compared to those specific to Africa ( Figure 4). Samples from SEA show higher N/S compared to Africa when considering only conserved positions ( Figure 5). These results suggest a lower efficiency of negative selection in SEA in removing potentially deleterious mutations. This may be important for the acquisition of antimalarial drug resistance since these mutations preferentially occur at the conserved sites 19, e.g. artemisinin resistance mutations in Klech13 occur in the conserved region of the protein 18, resistance mutations also occur in the conserved regions in DHFR (dihydrofolate reductase), DHPS (dihydropteroate synthase), and CRT (chloroquine-resistance transporter) 19. In summary, P. falciparum strains from SEA show a higher ratio of non-synonymous to synonymous polymorphisms at conserved sites in the protein sequences than African strains.

Figure 3. Lower correlation between N/S and protein conservation in SEA.

Figure 3.

A) Scatter-plot of N/S in Africa and percent protein conservation and B) Scatter-plot of N/S in SEA and percent protein conservation. Percent conservation for each protein is the percent of residues identical across orthologs in seven Plasmodium species ( P. berghei, P. chabaudi, P. cynomolgi, P. knowlesi, P. reichenowi, P. vivax, P. yoelii). Only proteins with orthologs in all Plasmodium species are shown (4075 proteins). Y-axis is truncated with 7 points not shown in Figure 3a and 112 points not shown in Figure 3b.

Figure 4. Non-synonymous polymorphisms specifically observed in SEA are more likely to occur at conserved positions.

Figure 4.

A) Conserved sites were defined as sites identical across orthologs in Plasmodium species ( P. berghei, P. chabaudi, P. cynomolgi, P. knowlesi, P. reichenowi, P. vivax, P. yoelii) in multiple sequence alignment. B) Conserved sites were defined as sites identical across orthologs in diverse eukaryotes ( S. cerevisiae, D. melanogaster, C. elegans, H. sapiens) in multiple sequence alignment. Error bars indicate 95% confidence intervals of the mean from 1,000 bootstrap samples.

Figure 5. Higher ratio of non-synonymous to synonymous polymorphism at conserved sites in P. falciparum from SEA.

Figure 5.

A) Box-plot showing N/S for 3394 samples from 22 countries at sites identical across orthologs in seven Plasmodium species in multiple sequence alignment. B) Box-plot showing the N/S ratio at sites identical across orthologs in diverse eukaryotes ( S. cerevisiae, D. melanogaster, C. elegans and H. sapiens) in multiple sequence alignment. The y-axis is truncated at the top with 10 samples not shown in both panels.

Lower mixed infection rate in South-East Asia

Blood samples may contain more than one haploid parasite clone due to mixed infections by multiple genotypes. The rate of mixed infection is generally lower in areas of low-transmission such as SEA 20. The lower efficiency of negative selection in removing potentially deleterious mutations at conserved positions in SEA could result from lower competition between parasite clones in the hosts. Indeed, the estimated rate of mixed infections, detected by a high proportion of heterozygous calls in the sequencing data, was much lower in South-East Asia compared to Africa ( Figure 6).

Figure 6. Lower estimated rate of mixed infections in samples from SEA.

Figure 6.

Mixed infection defined as samples with >10% SNP calls as heterozygous. This cut-off was determined from the distribution of heterozygous SNPs across the samples ( Supplementary Figure 2). The numbers of samples from each country are shown at the top of bar plots.

Discussion

Based on our observations that P. falciparum from SEA shows 1) a higher ratio of non-synonymous to synonymous polymorphism 2) a higher proportion of non-synonymous polymorphism at conserved sites and 3) a lower rate of mixed infections, we propose a model for the higher propensity of SEA populations to acquire drug resistance ( Supplementary Figure 1). Lower mixed infections in SEA may allow even less-fit parasites to be transmitted to the next set of hosts due to reduced level of intra-host competition between multiple genotypes. Thus, fitness-reducing mutations including drug-resistance mutations might have a higher chance of spreading in SEA. In contrast, the higher mixed infection rate in Africa may drive more intense competition between genotypes within the host, and may therefore reduce the probability of transmission of less-fit parasites.

This model can explain some surprising observations and predicts conditions under which resistance might spread to Africa. The much larger population size of P. falciparum in Africa, as also evidenced by high rate of mixed infection ( Figure 6) should make it easier for resistance mutation to appear. Indeed, artemisinin resistance mutations in Kelch13 were observed in 10 samples from Africa, including two samples with the most common artemisinin resistance mutation C580Y 18. The C580Y mutation is capable of generating artemisinin resistance in vitro in the NF54 parasite strain considered to be of African origin 21. This raises an important question as to why artemisinin resistance is not spreading in Africa. Since artemisinin resistance is likely to incur a fitness cost in the drug-free environment 2224, we propose that strains with these mutations are continuously arising in Africa but get competitively removed by the more fit drug-sensitive strains 23. However once a strain acquires compensatory mutations that may reduce the fitness cost of the original mutation, it may be able to spread in a more competitive environment in Africa. While compensatory mutations can occur anywhere in the genome and may even spread in South-East Asia, these could be unlinked by recombination in areas with high transmission rate such as Africa 25. Thus, compensatory mutations in the same gene might be more likely to spread in high-transmission areas. Indeed, drug-resistance genes often acquire multiple mutations before spreading to Africa, e.g. pyrimethamine resistance gene dhfr acquired at least three different mutations in South-East Asia before it spread to Africa 9. All chloroquine resistance strains have the K76T mutation in CRT (chloroquine-resistance transporter) but are accompanied by a number of mutations in the same gene 26. While at present Klech13 does not appear to have multiple mutations 27, it would be critical to monitor the acquisition of additional mutations in the Kelch13 protein which might compensate the fitness cost of Kelch13 resistant mutations in the drug-free environment.

It is important to note that higher N/S in SEA populations does not necessarily imply higher mutation rate. Brown et al. previously found similar substitution rates in samples from Africa and SEA 28. Thus substitution rate in SEA population appears to be similar to that of African populations, but a higher fraction of those substitutions occur at conserved non-synonymous positions in SEA populations. The MalariGEN study from where we obtained the dataset reported much higher density (per sample) of both synonymous and non-synonymous polymorphisms in Africa compared to SEA 18. It is also important to note that higher density of SNP/sample does not imply higher substitution rate in Africa, rather it reflects the higher rate of mixed infection in Africa, i.e. more SNPs are identified in samples from Africa because of the higher number of different parasite clones per samples ( Figure 6). While our data suggest that differences in the selective constraints on P. falciparum strains from SEA and Africa may contribute towards the higher propensity for resistance emergence and spread in SEA, other factors such as differences in the drug pressure, usage of artemisinin mono-therapy, social factors and host immune factors may also be important 3, 4.

Resistance to chloroquine and sulphadoxine-pyrimethamine spread from SEA to India to Africa 4. Interestingly we observed higher mixed infection rates in Bangladesh than in neighboring SEA. The Indian subcontinent has areas with widely variable transmission rates 1. This might allow drug-resistant P. falciparum evolved in low transmission areas in SEA to gradually adapt to higher transmission areas in the Indian subcontinent, which could then spread to the high transmission areas in Africa. Therefore, it would be critical to track the spread of artemisinin resistance in the Indian subcontinent.

Methods

The SNP data of P. falciparum was obtained from the MalariaGen community webpage ( https://www.malariagen.net/data/p-falciparum-community-project-jan-2016-data-release) 18. The SNP data consist of filtered and high quality 939,687 exonic SNPs with 631,715 non-synonymous and 307,972 synonymous SNPs. The data comprised 3,394 samples from 22 countries, with roughly equal number of samples from South-East Asia (1,600 samples) and Africa (1,647 samples). The N/S ratio for each sample was obtained by dividing the number of non-synonymous SNPs by number of synonymous SNPs in that sample. Proteome sequences of P. falciparum, P. berghei, P. chabaudi, P. cynomolgi, P. knowlesi, P. reichenowi, P. vivax, P. yoelii were downloaded from PlasmoDB database and proteome sequences of S. cerevisiae, D. melanogaster, C. elegans and H. sapiens were downloaded from European Bioinformatics Institute (EBI) database. Orthologous sequences were identified using best bidirectional hit algorithm 29 and aligned using ClustalO 30. The conservation score for P. falciparum proteins was calculated as the percentage of positions identical across all orthologous proteins from Plasmodium species. The N/S ratio for each gene in South-East Asia and Africa was calculated by dividing the number of unique non-synonymous SNPs by the number of unique synonymous SNPs across samples from the two geographical areas. The Pearson correlation between N/S for each gene and the conservation score was calculated in R. All figures were created in R version 3.2.3. Mixed infection samples were defined as samples with >10% SNP calls as heterozygous. This cut-off was determined from the distribution of heterozygous SNPs across the samples ( Supplementary Figure 2). It is important to note that this method is not likely to accurately classify each sample into a polyclonal (mixed infection) or a monoclonal sample, but the overall trend of higher rate of mixed infection in African samples compared to SEA samples is likely to be robust.

Data availability

The data referenced by this article are under copyright with the following copyright statement: Copyright: © 2016 Singh GP and Sharma A

This publication uses data from the MalariaGEN Plasmodium falciparum Community Project as described in Genomic epidemiology of artemisinin resistant malaria, eLife, 2016 (DOI: http://dx.doi.org/10.7554/eLife.08714). This data is also available from the MalariaGEN website ( https://www.malariagen.net/data/p-falciparum-community-project-jan-2016-data-release).

Acknowledgements

Genome sequencing was performed by the Wellcome Trust Sanger Institute and the Community Project is coordinated by the MalariaGEN Resource Centre with funding from the Wellcome Trust (098051, 090770).

Funding Statement

The work is supported by J. C. Bose Fellowship to A.S. by Department of Science and Technology, Govt. of India and by an Early Career Fellowship to G.P.S. by the Wellcome Trust/DBT India Alliance (IA/E/15/1/502297).

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 1; referees: 3 approved with reservations

Supplementary material

Supplementary Figure 1. A model for the higher propensity of SEA populations to acquire drug resistance.

Supplementary Figure 1.

Lower mixed infections in SEA may allow even less-fit parasites to be transmitted to next set of hosts due to reduced level of intra-host inter-parasite competition. Thus, fitness-reducing mutations including drug-resistance mutations might have a higher chance of spreading in SEA. In contrast, the higher mixed infection rate in Africa may drive more intense inter-clone competition within the host, thereby reducing the probability of transmission of the less-fit clones, including potentially drug-resistant clones. The public domain image of the outline of the human body was obtained from https://en.wikipedia.org/wiki/File:Outline-body.png.

Supplementary Figure 2. The frequency distribution of percentage heterozygous calls in samples.

Supplementary Figure 2.

About half of the 3,394 samples showed percentage heterozygous calls less than ten percent and were defined as monoclonal. Rest of the samples showed roughly uniform distribution of percentage heterozygous calls and were defined as polyclonal (mixed infection).

Supplementary Table 1. List of functionally annotated genes with more than 3-fold higher N/S in SEA than Africa and N/S in SEA more than 4.

Geneld Description N/S SEA N/S Africa Fold Higher in SEA
PF3D7_0709000 chloroquine resistance transporter (CRT) 10.3 1.3 8.1
PF3D7_0302700 CDGSH iron-sulfur domain-containing protein 10.0 1.4 7.2
PF3D7_0816200 vacuolar protein-sorting protein VPS2 6.0 0.9 6.4
PF3D7_0410400 exosome complex component RRP4 (RRP4) 7.0 1.2 5.9
PF3D7_0922900 beta-ketoacyl-acyl carrier protein reductase (FabG) 11.0 2.0 5.5
PF3D7_0314100.1 vesicle transport v-SNARE protein 6.0 1.1 5.5
PF3D7_1124800 nuclear preribosomal assembly protein 8.0 1.5 5.4
PF3D7_0936000 ring-exported protein 2+(REX2) 13.0 2.6 5.1
PF3D7_0720000 3'-5'+exoribonuclease Csl4 homolog 6.0 1.2 5.1
PF3D7_0110900 adenylate kinase 8.0 1.7 4.8
PF3D7_1313700 septum formation protein MAF homologue 10.0 2.1 4.7
PF3D7_0304600 circumsporozoite (CS) protein (CSP) 12.0 2.7 4.5
PF3D7_0515100 rhomboid protease ROM9 (ROM9) 9.0 2.0 4.4
PF3D7_0109400 tubulin-specific chaperone a 7.0 1.6 4.3
PF3D7_0522700 iron-sulfur assembly protein (SufA) 8.0 1.9 4.1
PF3D7_0702300 sporozoite threonine and asparagine-rich protein (STARP) 15.5 3.9 4.0
PF3D7_1435000 cytochrome c+oxidase assembly protein 9.0 2.3 3.9
PF3D7_0109300 fatty acid elongation protein, GNS1/SUR4 family 4.5 1.2 3.9
PF3D7_1410100 alpha/beta hydrolase 5.3 1.4 3.9
PF3D7_0319600 elongation factor 1+(EF-1) 7.0 1.8 3.9
PF3D7_1143200 DnaJ protein 14.0 3.6 3.8
PF3D7_1028300 nucleolar preribosomal assembly protein 13.0 3.5 3.8
PF3D7_1358300 rhomboid protease ROM7 (ROM7) 5.5 1.5 3.7
PF3D7_1343700 kelch protein 4.2 1.1 3.7
PF3D7_0416100 glutamyl-tRNA(Gln) amidotransferase subunit A 8.0 2.2 3.6
PF3D7_0423400 asparagine-rich protein (AARP) 9.0 2.5 3.5
PF3D7_1210900 GPI mannosyltransferase I+(PIG-M) 5.0 1.4 3.5
PF3D7_1015800 ribonucleotide reductase small subunit 5.0 1.5 3.3
PF3D7_1217600 anaphase promoting complex subunit 10 6.0 1.9 3.2
PF3D7_1143300 DNA-directed RNA polymerase I 4.3 1.4 3.1
PF3D7_1405200 sybindin-like protein 5.0 1.6 3.1
PF3D7_1023400 HORMA domain protein 8.0 2.6 3.1
PF3D7_1434000 CCR4-associated factor 16 (CAF16) 6.5 2.1 3.1

References

  • 1. World Health Organization: World Malaria Report 2015.2015. Reference Source [Google Scholar]
  • 2. Ashley EA, Dhorda M, Fairhurst RM, et al. : Spread of artemisinin resistance in Plasmodium falciparum malaria. N Engl J Med. 2014;371(5):411–423. 10.1056/NEJMoa1314981 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Roberts L: Malaria wars. Science. 2016;352(6284):398–402, 404-5. 10.1126/science.352.6284.398 [DOI] [PubMed] [Google Scholar]
  • 4. Dondorp AM, Yeung S, White L, et al. : Artemisinin resistance: current status and scenarios for containment. Nat Rev Microbiol. 2010;8(4):272–280. 10.1038/nrmicro2331 [DOI] [PubMed] [Google Scholar]
  • 5. Dondorp AM, Ringwald P: Artemisinin resistance is a clear and present danger. Trends Parasitol. 2013;29(8):359–360. 10.1016/j.pt.2013.05.005 [DOI] [PubMed] [Google Scholar]
  • 6. White NJ: Artemisinin resistance--the clock is ticking. Lancet. 2010;376(9758):2051–2052. 10.1016/S0140-6736(10)61963-0 [DOI] [PubMed] [Google Scholar]
  • 7. World Health Organization: Emergency response to artemisinin resistance in the Greater Mekong subregion.2013. Reference Source [Google Scholar]
  • 8. World Health Organization: Update on artemisinin and ACT resistance.2015. Reference Source [Google Scholar]
  • 9. Roper C, Pearce R, Nair S, et al. : Intercontinental spread of pyrimethamine-resistant malaria. Science. 2004;305(5687):1124. 10.1126/science.1098876 [DOI] [PubMed] [Google Scholar]
  • 10. Korenromp EL, Williams BG, Gouws E, et al. : Measurement of trends in childhood malaria mortality in Africa: an assessment of progress toward targets based on verbal autopsy. Lancet Infect Dis. 2003;3(6):349–358. 10.1016/S1473-3099(03)00657-1 [DOI] [PubMed] [Google Scholar]
  • 11. Trape JF: The public health impact of chloroquine resistance in Africa. Am J Trop Med Hyg. 2001;64(1–2 Suppl):12–17. [DOI] [PubMed] [Google Scholar]
  • 12. Trape JF, Pison G, Preziosi MP, et al. : Impact of chloroquine resistance on malaria mortality. C R Acad Sci III. 1998;321(8):689–697. 10.1016/S0764-4469(98)80009-7 [DOI] [PubMed] [Google Scholar]
  • 13. Takala-Harrison S, Jacob CG, Arze C, et al. : Independent emergence of artemisinin resistance mutations among Plasmodium falciparum in Southeast Asia. J Infect Dis. 2015;211(5):670–679. 10.1093/infdis/jiu491 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Ye R, Hu D, Zhang Y, et al. : Distinctive origin of artemisinin-resistant Plasmodium falciparum on the China-Myanmar border. Sci Rep. 2016;6: 20100. 10.1038/srep20100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Miotto O, Amato R, Ashley EA, et al. : Genetic architecture of artemisinin-resistant Plasmodium falciparum. Nat Genet. 2015;47(3):226–234. 10.1038/ng.3189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Miotto O, Almagro-Garcia J, Manske M, et al. : Multiple populations of artemisinin-resistant Plasmodium falciparum in Cambodia. Nat Genet. 2013;45(6):648–655. 10.1038/ng.2624 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Tun KM, Imwong M, Lwin KM, et al. : Spread of artemisinin-resistant Plasmodium falciparum in Myanmar: a cross-sectional survey of the K13 molecular marker. Lancet Infect Dis. 2015;15(4):415–421. 10.1016/S1473-3099(15)70032-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. MalariaGEN Plasmodium falciparum Community Project: Genomic epidemiology of artemisinin resistant malaria. eLife. 2016;5:pii: e08714. 10.7554/eLife.08714 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Gardner KB, Sinha I, Bustamante LY, et al. : Protein-based signatures of functional evolution in Plasmodium falciparum. BMC Evol Biol. 2011;11:257. 10.1186/1471-2148-11-257 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Anderson TJ, Haubold B, Williams JT, et al. : Microsatellite markers reveal a spectrum of population structures in the malaria parasite Plasmodium falciparum. Mol Biol Evol. 2000;17(10):1467–1482. 10.1093/oxfordjournals.molbev.a026247 [DOI] [PubMed] [Google Scholar]
  • 21. Ghorbal M, Gorman M, Macpherson CR, et al. : Genome editing in the human malaria parasite Plasmodium falciparum using the CRISPR-Cas9 system. Nat Biotechnol. 2014;32(8):819–821. 10.1038/nbt.2925 [DOI] [PubMed] [Google Scholar]
  • 22. Hott A, Tucker MS, Casandra D, et al. : Fitness of artemisinin-resistant Plasmodium falciparum in vitro. J Antimicrob Chemother. 2015;70(10):2787–2796. 10.1093/jac/dkv199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Wilairat P, Kumpornsin K, Chookajorn T: Plasmodium falciparum malaria: Convergent evolutionary trajectories towards delayed clearance following artemisinin treatment. Med Hypotheses. 2016;90:19–22. 10.1016/j.mehy.2016.02.022 [DOI] [PubMed] [Google Scholar]
  • 24. Chotivanich K, Tripura R, Das D, et al. : Laboratory detection of artemisinin-resistant Plasmodium falciparum. Antimicrob Agents Chemother. 2014;58(6):3157–3161. 10.1128/AAC.01924-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Volkman SK, Sabeti PC, DeCaprio D, et al. : A genome-wide map of diversity in Plasmodium falciparum. Nat Genet. 2007;39(1):113–119. 10.1038/ng1930 [DOI] [PubMed] [Google Scholar]
  • 26. Wellems TE, Plowe CV: Chloroquine-resistant malaria. J Infect Dis. 2001;184(6):770–776. 10.1086/322858 [DOI] [PubMed] [Google Scholar]
  • 27. Singh GP, Goel P, Sharma A: Structural mapping of Kelch13 mutations associated with artemisinin resistance in malaria. J Struct Funct Genomics. 2016;1–6. 10.1007/s10969-016-9205-1 [DOI] [PubMed] [Google Scholar]
  • 28. Brown TS, Jacob CG, Silva JC, et al. : Plasmodium falciparum field isolates from areas of repeated emergence of drug resistant malaria show no evidence of hypermutator phenotype. Infect Genet Evol. 2015;30:318–22. 10.1016/j.meegid.2014.12.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Wolf YI, Koonin EV: A tight link between orthologs and bidirectional best hits in bacterial and archaeal genomes. Genome Biol Evol. 2012;4(12):1286–1294. 10.1093/gbe/evs100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Sievers F, Wilm A, Dineen D, et al. : Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011;7:539. 10.1038/msb.2011.75 [DOI] [PMC free article] [PubMed] [Google Scholar]
F1000Res. 2016 Sep 21. doi: 10.5256/f1000research.10094.r16468

Referee response for version 1

Tim J Anderson 1

  • Title and Abstract

The title is appropriate.

The abstract makes some jumps in logic:  e.g Furthermore, South-East Asian strains showed a higher proportion of non-synonymous polymorphism at conserved positions, suggesting reduced negative selection.” This pattern can equally well be explained by stronger positive selection in SEA than in Africa.

 

  • Article content

 

Major points

Results differ from the prior analysis of this dataset. The article comes to different conclusions from those reported by the MalariaGEN consortium in eLife (ref 18) who write:  "Accordingly, we found virtually identical distributions of the ratio of non-synonymous to synonymous mutations (N/S ratio) in the two regions ( Figure 3c)” page 6 bottom: MalariaGEN Plasmodium falciparum Community Project. eLife 2016;5:e08714. DOI: 10.7554/eLife.08714). It is unclear to me why this F1000 submission and the published eLife paper reach very different conclusions using the same dataset? I suggest that the authors should cite the conclusions of the eLife paper and explain why their analysis comes to a very different conclusion. This point needs urgently needs satisfactory resolution as this is the central conclusion of the paper.

Methodology/analysis biases. Reduced ability to score non-synonymous variants in African populations could generate the N/S ratios observed. Such a bias could be generated because infections are more complex in Africa. Non-synonymous mutations tend to be at lower frequency than synonymous mutations. In complex mixed African infections low frequency non-synonymous mutations may be filtered out of the dataset (because very few reads show the mutation) but scored in SE Asian samples, where infections are simpler (so multiple reads show non-synonymous mutations). To eliminate this potential methodological bias, the authors should filter their data to minimize mixed infections prior to analysis. This could be done by setting a threshold for numbers of mixed base calls. Again, it is important to understand why the conclusions reached by this analysis differ from the published eLife paper.

As pointed out in the eLife paper there is a dramatic difference in the allele frequency spectrum in the two locations. The excess of rare variants in Africa could also potentially alter the N/S ratios.

Population genetics explanation. Assuming the central result is correct, a simpler model explains these results: The effective population size (Ne) of parasite populations is lower in SEA than in Africa. Therefore purifying selection is weaker in SEA than in Africa resulting in less efficient removal of deleterious mutations. Discussion of this simple alternative explanation would improve the paper. Also Cheeseman et al, MBE 2016 (Mol Biol Evol. 2016 Mar;33(3):603-20. doi: 10.1093/molbev/msv282.), who make this argument to explain patterns of global copy number polymorphism in Plasmodium falciparum.

Statistics used: The paper counts numbers of non-synonymous and synonymous mutations and estimates the ratio (N/S). This is an unusual way to present such data – it would be more informative to show dN/dS – (Nonsynonymous changes per non-synonymous site/synonymous variants per synonymous site). This statistic is more useful because the expected ratio under neutrality is 1. This should not change the results but will be more easily interpretable with reference to a neutral model.

Related questions: How is N/S ratio determined for genes in which there are NO synonymous mutations? Details of the methods for calculating this ratio, and how they differ from those used in the eLife paper should be provided.

More support needed for intrahost competition explanation: The authors argue that the observed pattern may result from more within infection competition in Africa relative to Asia. If competition is invoked to explain the results, then it would be useful to examine the categories of genes contributing to the excess of NS variations. This argument would imply that this effect should be seen in subsets of genes involved in within host competition. On the other hand, if there is no particular enrichment of particular gene classes, then the simple population genetics explanation seems more likely.

 

  • Conclusions

 

Lack of balance. The authors argue that drug resistance may arise in SEA rather than Africa because intrahost competition prevents emergence of resistance mutations associated with fitness costs. I agree in part – this may certainly contribute. However, greater selection for resistance to drugs in SE Asia is likely to be a critical factor. Most infections in SEA are symptomatic and infected individuals seek treatment. However, most infections in Africa are asymptomatic, so people to no seek treatment. Discussion of the difference in selection strength between continents would add balance to this paper.

Other issues. I was very surprised to see this analysis of MalariaGEN data without involvement from malariaGEN authors. My understanding is that access to these data requires a "Fort Lauderdate" type agreement with MalariaGEN (https://www.malariagen.net/data/terms-use/pf3k-terms-use). The authors may have already discussed this with relevant people at MalariaGEN - I would suggest that the authors directly contact Dominic Kwiatkowski, if they have not already done so.

I have read this submission. I believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.

F1000Res. 2016 Oct 17.
Gajinder Singh 1

We are grateful for your valuable comments, which we believe have significantly improved our manuscript.

The abstract makes some jumps in logic:  e.g “Furthermore, South-East Asian strains showed a higher proportion of non-synonymous polymorphism at conserved positions, suggesting reduced negative selection.” This pattern can equally well be explained by stronger  positive selection in SEA than in Africa.

Author response: Positive selection can indeed increase (though not in all cases) N/S for the gene under selection, as has been observed for the kelch13 gene in SEA. However, the pattern of higher N/S in strains in SEA which can be observed at genome-wide level is much more consistent with relaxed negative selection in SEA, rather than higher positive selection in SEA, which typically acts at a few sites. Furthermore, reduced constraints at conserved sites at the genome-wide level supports relaxed negative selection rather than positive selection. We thus believe that our use of word “suggesting” is appropriate in the statement.

  • Article content

 

Major points

Results differ from the prior analysis of this dataset. The article comes to  different conclusions from those reported by the MalariaGEN consortium in eLife (ref 18) who write: "Accordingly, we found virtually identical distributions of the ratio of non-synonymous to synonymous mutations (N/S ratio) in the two regions ( Figure 3c )” page 6 bottom: MalariaGEN Plasmodium falciparum Community Project. eLife 2016;5:e08714. DOI: 10.7554/eLife.08714). It is unclear to me why this F1000 submission and the published eLife paper reach very different conclusions  using the same dataset? I suggest that the authors should cite the conclusions of the eLife paper and explain why their analysis comes to a very different conclusion.  This point needs urgently needs satisfactory resolution as this is the central conclusion of the paper.

Author response: The authors in the MalariaGEN study did not perform any statistical test to support their statement that N/S in SEA and Africa is similar at the gene level. While the difference is subtle, it is highly statistically significant, as we report in our manuscript. Furthermore, the authors in MalariaGEN study did not analyse differences in N/S at the sample level, where we clearly see the differences between SEA and African strains (Figure 1). We have now added these statements to the Discussion section of the revised manuscript.

Methodology/analysis biases. Reduced ability to score non-synonymous variants in African populations could generate the N/S ratios observed. Such a bias could be generated because infections are more complex in Africa. Non-synonymous mutations tend to be at lower frequency than synonymous mutations. In complex mixed African infections low frequency non-synonymous mutations may be filtered out of the dataset (because very few reads show the mutation) but scored in SE Asian samples, where infections are simpler (so multiple reads show non-synonymous mutations). To eliminate this potential methodological bias, the authors should filter their data to minimize mixed infections prior to analysis. This could be done by setting a threshold for numbers of mixed base calls.  Again, it is important to understand why the conclusions reached by this analysis differ from the published eLife paper.

 

Author response: Separately analysing predicted monoclonal and polyclonal samples did not change conclusions (Supplementary Figure 1 in the revised manuscript). This is not surprising given that we observed almost no overlap in the distributions of N/S in SEA and African samples (Figure 1), as we wrote in the manuscript. We would like to emphasize that authors in MalariaGEN study did not perform N/S analyses at the sample level.

As pointed out in the eLife paper there is a dramatic difference in the allele frequency spectrum in the two locations. The excess of rare variants in Africa could also potentially alter the N/S ratios.

Author response: As proposed by the reviewer in the previous comment, the excess of rare variants in Africa might increase N/S in Africa, thus the higher N/S in SEA could not be explained by excess rare variants in Africa. 

Population genetics explanation. Assuming the central result is correct, a simpler model explains these results: The effective population size (Ne) of parasite populations is lower in SEA than in Africa. Therefore purifying selection is weaker in SEA than in Africa resulting in less efficient removal of deleterious mutations. Discussion of this simple alternative explanation would improve the paper. Also Cheeseman et al, MBE 2016 (MolBiolEvol. 2016 Mar;33(3):603-20. doi: 10.1093/molbev/msv282.), who make this argument to explain patterns of global copy number polymorphism in Plasmodium falciparum.

 

Author response: Indeed relaxed negative selection in SEA due to lower effective population (reflected in lower polyclonal infections) is exactly what we propose in our manuscript for our observation of higher N/S in SEA. Lower Ne (effective population size) is bound to be associated with lower polyclonal infection rate, which has been shown to lead to within-host competition in P. falciparum 1. Thus we do not believe that the two explanations are alternative independent explanations, one of which needs to be discounted. Rather the observation made by Cheeseman  et al, and in our manuscript are convergent which further supports reduced negative selection in P. falciparum in SEA and we have added these statements to the Discussion section of the revised manuscript.

Statistics used: The paper counts numbers of non-synonymous and synonymous mutations and estimates the ratio (N/S). This is an unusual way to present such data – it would be more informative to show dN/dS – (Nonsynonymous changes per non-synonymous site/synonymous variants per synonymous site). This statistic is more useful because the expected ratio under neutrality is 1. This should not change the results but will be more easily interpretable with reference to a neutral model.

 

Author response: As pointed out by the reviewer dN/dS comparison would not change the results. P. falciparum is unusual in having very high N/S compared to other organisms 2, thus the usual expectation of dN/dS of 1 for neutral evolution is not applicable for P. falciparum.

Related questions: How is N/S ratio determined for genes in which there are NO synonymous mutations? Details of the methods for calculating this ratio, and how they differ from those used in the eLife paper should be provided.

Author response: There were 136 genes with zero synonymous SNPs in SEA (none in Africa) and thus were excluded from the analyses. We have added this statement in the Method section of the revised manuscript.

More support needed for intrahost competition explanation: The authors argue that the observed pattern may result from more within infection competition in Africa relative to Asia. If competition is invoked to explain the results, then it would be useful to examine the categories of genes contributing to the excess of NS variations. This argument would imply that this effect should be seen in subsets of genes involved in within host competition. On the other hand, if there is no particular enrichment of particular gene classes, then the simple population genetics explanation seems more likely.

Author response: There is no dataset of genes known to be involved in within-host competition in P. falciparum. We do note however that many housekeeping genes show higher N/S in SEA than Africa (Supplementary Table 1). As discussed in response to previous comments by the reviewer, the lower Ne, lower rate of polyclonal infections and within-host completion are not mutually exclusive explanations, one of which needs to be discounted.

 

  • Conclusions

 

Lack of balance. The authors argue that drug resistance may arise in SEA rather than Africa because intrahost competition prevents emergence of resistance mutations associated with fitness costs. I agree in part – this may certainly contribute. However, greater selection for resistance to drugs in SE Asia is likely to be a critical factor. Most infections in SEA are symptomatic and infected individuals seek treatment. However, most infections in Africa are asymptomatic, so people to no seek treatment. Discussion of the difference in selection strength between continents would add balance to this paper.

 

Author response: The higher rate asymptomatic infections as well as untreated patients in Africa would mean higher competition between drug resistant and drug sensitive clones in the absence of drug, further decreasing the spread of drug resistance mutations with a fitness cost. Thus higher immunity, lower treatment rates, and higher polyclonal infections are likely to work synergistically. We have added these statements to the Discussion section in the revised manuscript.   

 

Other issues. I was very surprised to see this analysis of MalariaGEN data without involvement from malariaGEN authors. My understanding is that access to these data requires a "Fort Lauderdate" type agreement with MalariaGEN (https://www.malariagen.net/data/terms-use/pf3k-terms-use). The authors may have already discussed this with relevant people at MalariaGEN - I would suggest that the authors directly contact Dominic Kwiatkowski, if they have not already done so.

Author response: We have taken permission from the authors of the MalariaGEN study, and their contribution has been appropriately acknowledged in our publication.

We would again like to thank you for your critical comments on our manuscript, and would be happy to address any further concerns that you may have in the revised version of the manuscript.

Reference List

    1.    Bushman, M. et al. Within-host competition and drug resistance in the human malaria parasite Plasmodium falciparum. Proc. Biol. Sci. 283, 20153038 (2016).

    2.    Chang, H. H. et al. Malaria life cycle intensifies both natural selection and random genetic drift. Proc. Natl. Acad. Sci. U. S. A. 110, 20129-20134 (2013).

F1000Res. 2016 Sep 20. doi: 10.5256/f1000research.10094.r15909

Referee response for version 1

Georges Snounou 1

In this article the authors conduct a detailed analysis that focused on the frequency of non-synonymous mutations across an extensive genome set derived by the MalariaGEN consortium from an extensive set of P. falciparum isolates collected mainly in Africa and SE Asia. The analyses extend the work presented recently (DOI: 10.7554/eLife.08714) where a trend was noted for a higher level of non-synonymous mutations in the SE Asian parasites a compared to that in the African parasites. Specifically, the authors have presented the analysis for the parasites from individual countries and focused on SNPs that occur at residues that are highly conserved in orthologues within a variety of Plasmodium species and across eukaryotes in general. The authors have thus provided convincing evidence for statistically significant albeit relatively subtle differences between the parasites from SE Asia and Africa (it is unclear to me why parasites from Peru and Colombia were included in the analysis, all the more as their numbers were relatively low).

Whereas, the quality of the data, the analyses and the results are not into question, I have some misgivings and indeed frustrations as to the conclusions put forward in an attempt to account for the differences observed. The scenario inspired by the observations that the authors propose as an explanation for the propensity of antimalarial drug resistance to emerge and spread in P. falciparum from SE Asia first before doing so in Africa is plausible and could be supported by the data presented. However, this scenario is based on a set of assumptions whose limitations are not discussed, without any justification as to why this particular scenario is the most likely.

 

  1. What is the nature of the intra-host competition between the genotypes of P. falciparum that circulate in humans, and is there any evidence that this actually alters transmissibility? Immunity is the most obvious selective intra-host factor. To which extend differences in the levels of immunity among the patients that contributed the P. falciparum isolates analysed here alters the multiplicity of the infection or indeed the likelihood of genetic crosses?

  2. Is there any evidence that parasite with the higher levels of non-synonymous mutations across the genome are biologically less fit? There is evidence that this is the case for drug resistant parasites in vitro, that for parasites growing in vivo is less straightforward and varies for different drugs.

  3. To what extend are the isolates collected from a limited number of febrile patients reflect the overall parasite population across a whole country. Could a bias be introduced simply because of differences in the degree of acquired immunity on admission, the time between the onset of the infection and the time treatment was sought, or the level of admission parasitaemia?

  4. Exposure to drug is clearly a strong selective constraint for those genes implicated in overcoming the effects of the drug. What can be the nature of the selective constrains that maintain the higher levels of non-synonymous mutations across numerous genes spread throughout the genome? Clearly not all can be considered to provide some speculative compensatory effect to a potential reduction in fitness following drug selection.

  5. It should be pointed out that in the 1950’s resistance to chloroquine first appeared not only in SE Asia but also in Colombia. Similarly resistance to pyrimethamine did not first appear in SE Asia, but in all areas where the use of this drug became widespread and systematic (this was the case in both SE Asia and African countries in the 1950’s). The “unanswered” question as to the reason why drug resistance has repeatedly emerged in SE Asia, might have more to do with drug usage than with some special property of the parasites.

 

Ultimately, I would suggest that the authors should present the limitations of their conclusions as well as alternative scenarios to account for their observations (and rank or discount them if they can). This will surely be welcome by readers. Advances in knowledge are seeded by diversity in speculation (conservative or less so) and genome wide analyses are a rich source for this.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

References

  • 1. MalariaGEN Plasmodium falciparum Community Project: Genomic epidemiology of artemisinin resistant malaria. Elife.2016;5: 10.7554/eLife.08714 10.7554/eLife.08714 [DOI] [PMC free article] [PubMed] [Google Scholar]
F1000Res. 2016 Oct 17.
Gajinder Singh 1

We are grateful for your valuable comments, which we believe have significantly improved our manuscript.

In this article the authors conduct a detailed analysis that focused on the frequency of non-synonymous mutations across an extensive genome set derived by the MalariaGEN consortium from an extensive set of P. falciparum isolates collected mainly in Africa and SE Asia. The analyses extend the work presented recently (DOI: 10.7554/eLife.08714) where a trend was noted for a higher level of non-synonymous mutations in the SE Asian parasites a compared to that in the African parasites. Specifically, the authors have presented the analysis for the parasites from individual countries and focused on SNPs that occur at residues that are highly conserved in orthologues within a variety of Plasmodium species and across eukaryotes in general. The authors have thus provided convincing evidence for statistically significant albeit relatively subtle differences between the parasites from SE Asia and Africa (it is unclear to me why parasites from Peru and Colombia were included in the analysis, all the more as their numbers were relatively low).

Author response: We would like to clarify that MalariaGEN authors did not write or suggest that N/S ratio of P. falciparum strains from SEA and Africa are different. We have discussed the difference between the work reported in MalariaGEN study and our manuscript in the Discussion section of the revised manuscript. We added data from Peru and Columbia in Figures 1, Figure 5 and Figure 6 only for completeness, but we have refrained from making any firm conclusions based on these samples. We have nevertheless commented upon the observations from South-American samples in the Discussion section in response to your comment.

Whereas, the quality of the data, the analyses and the results are not into question, I have some misgivings and indeed frustrations as to the conclusions put forward in an attempt to account for the differences observed. The scenario inspired by the observations that the authors propose as an explanation for the propensity of antimalarial drug resistance to emerge and spread in P. falciparum from SE Asia first before doing so in Africa is plausible and could be supported by the data presented. However, this scenario is based on a set of assumptions whose limitations are not discussed, without any justification as to why this particular scenario is the most likely.

  • What is the nature of the intra-host competition between the genotypes of P. falciparum that circulate in humans, and is there any evidence that this actually alters transmissibility? Immunity is the most obvious selective intra-host factor. To which extend differences in the levels of immunity among the patients that contributed the P. falciparum isolates analysed here alters the multiplicity of the infection or indeed the likelihood of genetic crosses?

Author response: We thank the reviewer for pointing this out. There is indeed evidence of intra-host competition between genotypes of P. falciparum in humans that reduces parasite density 1. The association between parasite density and gametocyte density in humans has also been previously shown 2. The reasons for intra-host competition however remain unknown. Strain-transcending immunity, resource competition (e.g. RBCs), and direct interference between strains have been proposed as possible mechanisms responsible for within-host competition 3-6. It is also possible that lower recombination rates due to lower rate of mixed infections in SEA may reduce the removal of deleterious mutations. We have added these sentences to the Discussion section of the revised manuscript.

  • Is there any evidence that parasite with the higher levels of non-synonymous mutations across the genome are biologically less fit? There is evidence that this is the case for drug resistant parasites in vitro, that for parasites growing in vivo is less straightforward and varies for different drugs.

Author response: There is currently no evidence that P. falciparum strains from SEA are biologically less fit that strains from Africa. Indeed that is the prediction of our model. It would be fascinating to test this hypothesis. We have added these sentences to the Discussion section of the revised manuscript.

  • To what extend are the isolates collected from a limited number of febrile patients reflect the overall parasite population across a whole country. Could a bias be introduced simply because of differences in the degree of acquired immunity on admission, the time between the onset of the infection and the time treatment was sought, or the level of admission parasitaemia?

Author response: It is possible that samples collected may not be representative of the whole country; however the consistent results we obtain for across countries in Africa and SEA suggest that our observations of higher N/S is SEA compared to Africa is robust to random variations introduced due to sampling.

It is possible that some biases in sampling may be introduced that could lead to differences in the observed rate of polyclonal infections between Africa and SEA. We separately analysed predicted monoclonal and polyclonal samples and find results similar to those obtained using all samples (Supplementary Figure 1 in the revised manuscript). We thus currently have no hypothesis of how biased sampling could actually lead to higher genome-wide N/S in SEA.

  • Exposure to drug is clearly a strong selective constraint for those genes implicated in overcoming the effects of the drug. What can be the nature of the selective constrains that maintain the higher levels of non-synonymous mutations across numerous genes spread throughout the genome? Clearly not all can be considered to provide some speculative compensatory effect to a potential reduction in fitness following drug selection.

Author response: As further explained in the Discussion section, the higher genome scale N/S in SEA is consistent with relaxed negative selection. We hypothesise that relaxed negative selection could be due to low level of polyclonal infection and thus lower within-host competition in SEA. 

  • It should be pointed out that in the 1950’s resistance to chloroquine first appeared not only in SE Asia but also in Colombia. Similarly resistance to pyrimethamine did not first appear in SE Asia, but in all areas where the use of this drug became widespread and systematic (this was the case in both SE Asia and African countries in the 1950’s). The “unanswered” question as to the reason why drug resistance has repeatedly emerged in SE Asia, might have more to do with drug usage than with some special property of the parasites.

Author response: Indeed resistance to chloroquine and sulphadoxine-pyrimethamine also appeared independently in South-America (but not in Africa 7). We have very few samples from South America (27 samples from 2 countries) to make robust conclusions. However it is interesting that samples from South America do show low rate of polyclonal infections and higher N/S compared to Africa (Figure 1 and Figure 6). We have added these sentences to the Discussion section of the revised manuscript.

Ultimately, I would suggest that the authors should present the limitations of their conclusions as well as alternative scenarios to account for their observations (and rank or discount them if they can). This will surely be welcome by readers. Advances in knowledge are seeded by diversity in speculation (conservative or less so) and genome wide analyses are a rich source for this. 

Author response: We have significantly expanded the Discussion section which we hope addresses the reviewer’s concerns. We would be happy to address any further concerns that you may have in the revised version of the manuscript.

Reference List

   1.   Bushman, M. et al. Within-host competition and drug resistance in the human malaria parasite Plasmodium falciparum. Proc. Biol. Sci. 283, 20153038 (2016).

   2.   Ross, A., Killeen, G. & Smith, T. Relationships between host infectivity to mosquitoes and asexual parasite density in Plasmodium falciparum. Am. J. Trop. Med. Hyg. 75, 32-37 (2006).

   3.   Metcalf, C. J. et al. Partitioning regulatory mechanisms of within-host malaria dynamics using the effective propagation number. Science. %19;333, 984-988 (2011).

   4.   Raberg, L. et al. The role of immune-mediated apparent competition in genetically diverse malaria infections. Am. Nat. 168, 41-53 (2006).

   5.   Bruce, M. C. & Day, K. P. Cross-species regulation of malaria parasitaemia in the human host. Curr. Opin. Microbiol. 5, 431-437 (2002).

   6.   Yap, G. S. & Stevenson, M. M. Blood transfusion alters the course and outcome of Plasmodium chabaudi AS infection in mice. Infect. Immun. 62, 3761-3765 (1994).

   7.   Roper, C. et al. Intercontinental spread of pyrimethamine-resistant malaria. Science. %20;305, 1124 (2004).

F1000Res. 2016 Sep 8. doi: 10.5256/f1000research.10094.r15966

Referee response for version 1

Daniel N Hupalo 1

This is a compelling manuscript from Gajinder Pal Singh and Amit Sharma, which conducts an analysis of the accrual of non-synonymous changes within Plasmodium falciparum ( Pf) genes across the breadth of the MalariaGen population genomic dataset. This work builds upon a trend observed in the initial MalariaGen publication(s) that South East Asian (SEA) populations exhibited a general trend of higher Non-synonymous to synonymous mutation ratio (N/S) compared to African populations. The authors replicate and expand this analysis and then compare this trend to a protein dataset comprised of Eukaryotic sequences to show that even among the most conserved sites SEA continues to accrue more NS polymorphisms than African isolates. Lastly the authors set out an argument for the mechanism behind this apparent relaxed negative selection within SEA Pf, attributing it to the lower rate of mixed genotype infections across SEA and the subsequent lack of competition within host. This would create the circumstances that would allow for multiple independent resistance mutations arising and fixing within the SEA population. Exploring general patterns of genome architecture within P. falciparum is vital to understanding the spread and prevention of resistance to anti-malarial drugs, and as a result this work is of interest to the malaria community. There are some clarifications required, and two significant areas of improvement for the paper:

Major clarifications:

 

  1. More detailed results/discussion sections, and deeper discussion of previous literature.While the authors cite the MalariaGen publication (citation 18) they should more explicitly describe the relationship between that publication and this analysis. Specifically, highlighting Figure 5 of citation 18 and its supplemental material, which shows a similar genome-wide analysis of N/S ratio between Africa/SEA. It would be good to directly acknowledge this article/figure as the first description of this African/SEA pattern. Furthermore, it would be constructive for the reader if the authors added a deeper justification for their extension of this work. 

  2. Language used to describe the accrual of Nonsynonymous sites overstates the general trend of NS SNPs at conserved sites. Figure 4 shows that NS SNPs accrue at a higher rate in SEA across the set of conserved sites within Plasmodium and at the set of conserved sites across Eukaryotes. However, the language used to describe this figure in the legend states that “NS polymorphisms… in SEA are more likely to occur at conserved positions.” This statement implies that this is a general genome-wide trend of higher rate of fixing NS SNPs at sites with high conservation. Perhaps extending this legend to mention that NS SNPs are “are more likely to occur at conserved positions when compared to Africa” would avoid this implied generalization. Additionally, on page 3 the authors state “these mutations preferentially occur at the conserved sites”. This statement repeats the highlighted issue above, and adds the word “preferentially” which may create an unintentional teleological statement about the occurrences of NS mutations. NS SNPs occur at a higher rate within the pre-selected subset of conserved sites when compared to Africa, but the figures do not show a general genome-wide preference for NS mutations at conserved sites. 

Minor clarifications are as follows:

  • “Kelch” is misspelled multiple times throughout the article.

  • “Cambodia” is misspelled in multiple figures.

  • “PNG” Abbreviation should be written out fully.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

F1000Res. 2016 Oct 17.
Gajinder Singh 1

This is a compelling manuscript from Gajinder Pal Singh and Amit Sharma, which conducts an analysis of the accrual of non-synonymous changes within Plasmodium falciparum (Pf) genes across the breadth of the MalariaGen population genomic dataset. This work builds upon a trend observed in the initial MalariaGen publication(s) that South East Asian (SEA) populations exhibited a general trend of higher Non-synonymous to synonymous mutation ratio (N/S) compared to African populations. The authors replicate and expand this analysis and then compare this trend to a protein dataset comprised of Eukaryotic sequences to show that even among the most conserved sites SEA continues to accrue more NS polymorphisms than African isolates. Lastly the authors set out an argument for the mechanism behind this apparent relaxed negative selection within SEA Pf, attributing it to the lower rate of mixed genotype infections across SEA and the subsequent lack of competition within host. This would create the circumstances that would allow for multiple independent resistance mutations arising and fixing within the SEA population. Exploring general patterns of genome architecture within P. falciparum is vital to understanding the spread and prevention of resistance to anti-malarial drugs, and as a result this work is of interest to the malaria community. There are some clarifications required, and two significant areas of improvement for the paper:

Author response: We thank you for your review and comments. We would like to clarify that higher N/S in P. falciparum strains from SEA compared to African strains has not been stated or suggested in the MalariaGen manuscript (Elife. 2016). In fact the authors of the MalariaGEN wrote that at the gene level “we found virtually identical distributions of the ratio of non-synonymous to synonymous mutations (N/S ratio) in the two regions”, however, no statistical test was performed by the authors. Furthermore, no comparison of N/S at the sample level was performed in the MalariaGEN study. We have added these statements to the Discussion section of the revised manuscript.

 Major clarifications:

 

  • More detailed results/discussion sections, and deeper discussion of previous literature. While the authors cite the MalariaGen publication (citation 18) they should more explicitly describe the relationship between that publication and this analysis. Specifically, highlighting Figure 5 of citation 18 and its supplemental material, which shows a similar genome-wide analysis of N/S ratio between Africa/SEA. It would be good to directly acknowledge this article/figure as the first description of this African/SEA pattern. Furthermore, it would be constructive for the reader if the authors added a deeper justification for their extension of this work. 

Author response: The aim of Figure 5 of the MalariaGen manuscript (Elife. 2016) was to test whether neutral evolution could account for the pattern of mutations in kelch13. Figure 5a plots genic N/S vs. protein conservation in Africa and SEA. The Figure shows that kelch13 gene is an outlier in the scatter-plot for SEA, but follows a general trend in Africa. Figure 5b plots the ratio of N/S ratios in SEA and Africa, for all genes (with ≥5 synonymous and ≥5 non-synonymous SNPs) and shows again that kelch13 is an outlier. Figure 5—Supplementary Figure 1 is same as Figure 5a, but also highlights other drug resistance genes. Figure 5—Supplementary Figure 2a plots number of non-synonymous SNPs/kbp vs. protein conservation in Africa and SEA and show that kelch13 follows the trend among genes in Africa, but has excess non-synonymous polymorphisms in SEA, while Figure 5—Supplementary Figure 2b plots number of synonymous SNPs/kbp vs. protein conservation in Africa and SEA and shows that k elch13 follows the normal trend among genes in Africa, but has far fewer synonymous SNPs than expected in SEA. 

Thus the authors conclude that high prevalence of non-synonymous SNPs in kelch13 in SEA is not explainable by neutral evolution, but is consistent with neutral evolution in Africa. We would like to reemphasize that nowhere in the manuscript do the authors state or imply that strains from SEA have a higher N/S than strains from Africa.

  • Language used to describe the accrual of Nonsynonymous sites overstates the general trend of NS SNPs at conserved sites. Figure 4 shows that NS SNPs accrue at a higher rate in SEA across the set of conserved sites within Plasmodium and at the set of conserved sites across Eukaryotes. However, the language used to describe this figure in the legend states that “NS polymorphisms… in SEA are more likely to occur at conserved positions.” This statement implies that this is a general genome-wide trend of higher rate of fixing NS SNPs at sites with high conservation. Perhaps extending this legend to mention that NS SNPs are “are more likely to occur at conserved positions when compared to Africa” would avoid this implied generalization. Additionally, on page 3 the authors state “these mutations preferentially occur at the conserved sites”. This statement repeats the highlighted issue above, and adds the word “preferentially” which may create an unintentional teleological statement about the occurrences of NS mutations. NS SNPs occur at a higher rate within the pre-selected subset of conserved sites when compared to Africa, but the figures do not show a general genome-wide preference for NS mutations at conserved sites. 

Author response: We would like to thank the reviewer for pointing out this mistake. We have modified the Figure legend in the revised manuscript to “Non-synonymous polymorphisms specifically observed in SEA are more likely to occur at conserved positions when compared to non-synonymous polymorphisms specifically observed in Africa” as suggested. We have also modified the statement “This may be important for the acquisition of antimalarial drug resistance since these mutations preferentially occur at the conserved sites” to “This may be important for the acquisition of antimalarial drug resistance since drug-resistance mutations preferentially occur at the conserved sites”.

Minor clarifications are as follows:

  • “Kelch” is misspelled multiple times throughout the article.

     

  • “Cambodia” is misspelled in multiple figures.

     

  • “PNG” Abbreviation should be written out fully.

 

Author response: Thanks for pointing out these mistakes; we have corrected these mistakes in the revised manuscript. We would be happy to address any further concerns that you may have about the revised version of the manuscript.

F1000Res. 2016 Aug 30. doi: 10.5256/f1000research.10094.r15669

Referee response for version 1

François H Nosten 1

In this paper the authors analyze the data produced by the MalariaGen community that provides high quality data on over 900,000 SNP in more than 3000 samples from 22 countries. The authors report a significantly higher ratio of non-synonymous to synonymous polymorphisms in  P. falciparum from South-East Asia compared to Africa and propose that this could explain the emergence of resistance in Asia. However this observation (and also the higher multiplicity of infections in Africa) is not novel.

Furthermore the large number of K13 alleles, their dynamics and relative contributions to the clinical phenotype are likely to also be important. I think they could improve by showing that their analysis helps to understand the spread of a specific allele of K13 such as the C580Y.

It would be useful if the authors could provide more explanations as to why they believe their findings are innovative and how they help us to understand the evolution of artemisinin resistance, since the mechanism is clearly different to that of other antimalarials.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

F1000Res. 2016 Oct 17.
Gajinder Singh 1

Thank you for your review and comments.

In this paper the authors analyze the data produced by the MalariaGen community that provides high quality data on over 900,000 SNP in more than 3000 samples from 22 countries. The authors report a significantly higher ratio of non-synonymous to synonymous polymorphisms in P. falciparum from South-East Asia compared to Africa and propose that this could explain the emergence of resistance in Asia. However this observation (and also the higher multiplicity of infections in Africa) is not novel.

Author response: We are not aware of any manuscript that reported higher N/S ratio of P. falciparum of SEA strains of P. falciparum compared to African strains. If the reviewer is referring to the MalariaGen study (Elife. 2016), the authors in that manuscript did not report or suggest these results. In fact the authors of the MalariaGEN wrote that at the gene level “we found virtually identical distributions of the ratio of non-synonymous to synonymous mutations (N/S ratio) in the two regions”, however, no statistical test was performed by the authors. Furthermore, no comparison of N/S at the sample level was performed in the MalariaGEN study. We have added these statements to the Discussion section of the revised manuscript.

We agree with the reviewer that the general conclusion of a higher multiplicity of infection in Africa is not novel, indeed we wrote “The rate of mixed infection is generally lower in areas of low-transmission such as SEA 20” before we reported our results. However we have confirmed the higher mixed infection rate in Africa compared to SEA in the MalariaGEN dataset, which is the largest whole-genome dataset on P. falciparum till date. We have added these statements to the Discussion section of the revised manuscript.

Furthermore the large number of K13 alleles, their dynamics and relative contributions to the clinical phenotype are likely to also be important. I think they could improve by showing that their analysis helps to understand the spread of a specific allele of K13 such as the C580Y.

Author response: Our manuscript is an attempt to understand why in general anti-malarial resistance often arises and spreads in SEA and it is beyond the scope of this manuscript to understand why a specific allele of K13 is dominating in certain parts of SEA.

It would be useful if the authors could provide more explanations as to why they believe their findings are innovative and how they help us to understand the evolution of artemisinin resistance, since the mechanism is clearly different to that of other antimalarials.

Author response: We have modified the Discussion section of the manuscript to clarify how our results differ from the MalariaGEN study. We have also discussed the implications of our results for the current wave of artemisinin resistance. These findings have not been previously reported. As stated previously our manuscript is an attempt to understand why in general anti-malarial resistance often arises and spreads in SEA, but whether and how the evolutionary mechanism of artemisinin resistance is different from other antimalarial is beyond the scope of this manuscript.

We would again like to thank you for comments on our manuscript, and would be happy to address any further concerns that you may have about the revised version of the manuscript.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Availability Statement

    The data referenced by this article are under copyright with the following copyright statement: Copyright: © 2016 Singh GP and Sharma A

    This publication uses data from the MalariaGEN Plasmodium falciparum Community Project as described in Genomic epidemiology of artemisinin resistant malaria, eLife, 2016 (DOI: http://dx.doi.org/10.7554/eLife.08714). This data is also available from the MalariaGEN website ( https://www.malariagen.net/data/p-falciparum-community-project-jan-2016-data-release).


    Articles from F1000Research are provided here courtesy of F1000 Research Ltd

    RESOURCES