Skip to main content
PeerJ logoLink to PeerJ
. 2018 Dec 14;6:e6141. doi: 10.7717/peerj.6141

The Plasmodium knowlesi Pk41 surface protein diversity, natural selection, sub population and geographical clustering: a 6-cysteine protein family member

Md Atique Ahmed 1, Ki-Back Chu 2, Fu-Shi Quan 1,3,
Editor: Erika Braga
PMCID: PMC6296336  PMID: 30581686

Abstract

Introduction

The zoonotic malaria parasite Plasmodium knowlesi has currently become the most dominant form of infection in humans in Malaysia and is an emerging infectious disease in most Southeast Asian countries. The P41 is a merozoite surface protein belonging to the 6-cysteine family and is a well-characterized vaccine candidate in P. vivax and P. falciparum; however, no study has been done in the orthologous gene of P. knowlesi. This study investigates the level of polymorphism, haplotypes and natural selection of pk41 genes in clinical isolates from Malaysia.

Method

Thirty-five full-length pk41 sequences from clinical isolates of Malaysia along with four laboratory lines (along with H-strain) were downloaded from public databases. For comparative analysis between species, orthologous P41 genes from P. falciparum, P. vivax, P. coatneyi and P. cynomolgi were also downloaded. Genetic diversity, polymorphism, haplotype and natural selection were determined using DnaSP 5.10 software. Phylogenetic relationships between Pk41 genes were determined using MEGA 5.0 software.

Results

Analysis of 39 full-length pk41 sequences along with the H-strain identified 36 SNPs (20 non-synonymous and 16 synonymous substitutions) resulting in 31 haplotypes. Nucleotide diversity across the full-length gene was low and was similar to its ortholog in P. vivax; pv41. Domain-wise amino acid analysis of the two s48/45 domains indicated low level of polymorphisms for both the domains, and the glutamic acid rich region had extensive size variations. In the central domain, upstream to the glutamate rich region, a unique two to six (K-E)n repeat region was identified within the clinical isolates. Overall, the pk41 genes were indicative of negative/purifying selection due to functional constraints. Domain-wise analysis of the s48/45 domains also indicated purifying selection. However, analysis of Tajima’s D across the genes identified non-synonymous SNPs in the s48/45 domain II with high positive values indicating possible epitope binding regions. All the 6-cysteine residues within the s48/45 domains were conserved within the clinical isolates indicating functional conservation of these regions. Phylogenetic analysis of full-length pk41 genes indicated geographical clustering and identified three subpopulations of P. knowlesi; one originating in the laboratory lines and two originating from Sarawak, Malaysian Borneo.

Conclusion

This is the first study to report on the polymorphism and natural selection of pk41 genes from clinical isolates of Malaysia. The results reveal that there is low level of polymorphism in both s48/45 domains, indicating that this antigen could be a potential vaccine target. However, genetic and molecular immunology studies involving higher number of samples from various parts of Malaysia would be necessary to validate this antigen’s candidacy as a vaccine target for P. knowlesi.

Keywords: Plasmodium knowlesi, Pk41, Natural selection, 6-Cysteine family, s48/45 domain, Polymorphism, Malaysia, Clinical samples, Low diversity, Vaccine

Introduction

Malaria is a major public health concern as it causes the death of a half a million people around the globe annually (WHO, 2017). The zoonotic malaria parasite Plasmodium knowlesi is now considered as the fifth Plasmodium species capable of infecting humans, and cases are rapidly emerging in many countries of Southeast Asia (Ahmed & Cox-Singh, 2015; Cox-Singh et al., 2008; Garnham, 1966). According to latest reports, P. knowlesi accounts for 70–80% of all malaria cases in Malaysian Borneo (Barber et al., 2017; Barber et al., 2011; Daneshvar et al., 2009; William et al., 2013) and as per the world malaria report 2017, there is a rapid increase of human cases in Malaysia (WHO, 2017). P. knowlesi is often misdiagnosed as P. malariae under the microscope, therefore molecular testing using PCR for confirmation is essential (Cox-Singh et al., 2008). The erythrocytic cycle of this parasite is 24 h, which is the fastest among all human malarias, and increase in parasite count has been associated with severe malaria in humans with fatal outcome (Cox-Singh et al., 2008; Daneshvar et al., 2009; Willmann et al., 2012). Genetic studies on invasion genes as well as whole genome studies have identified at least three subpopulations of the parasite in clinical samples from Malaysia and two of the populations were associated with the monkey hosts; Macaca fascicularis and Macaca nemestrina in Malaysian Borneo (Ahmed et al., 2014, 2016; Assefa et al., 2015; Pinheiro et al., 2015). A recent study on two genes that have been extensively used for phylogenetic studies that is the mitochondrial cytochrome oxidase I (cox 1) and smaller subunit ribosomal rRNA of P. knowlesi from clinical samples and wild macaques identified two distinct subpopulation which clustered geographically to Peninsular Malaysia and Malaysian Borneo (Yusof et al., 2016). These studies indicate that P. knowlesi infections in humans is complex and involves multiple subpopulations of the parasite and some of the infections may cause severe disease.

Antimalarial vaccine is an important tool for malaria control and elimination, but high polymorphism displayed by field isolates within candidate antigens remains as one of the major factor hindering vaccine development. Leading vaccine antigens that are found to be under positive (diversifying) selection (like ama1, msp1 and msp2) are parasite surface proteins and have evolved extensive genetic diversity in order to evade host immune response in natural parasite populations (Barry & Arnott, 2014; Patel et al., 2017; Takala et al., 2009). These candidates when tested in the field show allele-specific immune response thereby reducing the efficacy of the candidate antigen in endemic regions (Osier et al., 2010). For example, currently RTS,S, is the only candidate for vaccine development which is designed based on the P. falciparum circumsporozoite protein. It has reached phase IV clinical trials (Coelho et al., 2017); however, has low efficacy in the field and one of the reasons was parasite diversity and allele-specific immune response observed within field isolates of P. falciparum (Neafsey et al., 2015). The pkcsp diversity in clinical isolates of Malaysia is also high and the epitope binding regions were under the influence of positive natural selection (Fong et al., 2015). Many such antigens in P. falciparum (Barry et al., 2013; Genton et al., 2002; Patel et al., 2017; Soe et al., 2017) and P. vivax (Lo et al., 2017) have high nucleotide diversity, leading to allele-specific immune responses or low efficacy during vaccine trials. Thus, it is important to assess the level of diversity, type of natural selection and significance toward further studying of the antigen as a vaccine candidate.

Among merozoite invasion proteins widely studied in P. falciparum and P. vivax, the 6-cysteine protein family is a mutli-stage conserved surface proteins which are expressed in both asexual (merozoites) and sexual (gametocytes) stages of the parasite (Angel et al., 2008; Garcia et al., 2009). The protein family in P. falciparum consists of five members; Pf12, Pf12P, Pf38, Pf41 and Pf92 proteins which are expressed at the asexual stages of the parasite’s life cycle and is characterized by the presence of six conserved cysteine residues called the s48/45 domains (Arredondo & Kappe, 2017; Garcia et al., 2009). The protein has a signal peptide but lacks the transmembrane domain (Garcia et al., 2009). The Pf41 has two high affinity binding peptides in the s48/45 domains and is anchored to the merozoite surface by forming an inverted heteroduplex with Pf12 (Taechalertpaisarn et al., 2012; Tonkin et al., 2013). Recent structural studies on Pf41 has revealed that there is an intra-domain insertion which is necessary for binding to Pf12 and this protects it from proteolytic cleavage (Parker, Peng & Boulanger, 2015). The P. vivax ortholog pv41 gene has been characterized (Angel et al., 2008). The Pv41 is localized at the merozoite surface and high immunogenicity observed from patient sera suggests its exposure to the host’s immune system (Cheng et al., 2013). Pf41 has been under positive natural selection (Tonkin et al., 2013) and recognized by serum from naturally infected individuals with seroprevalences in the range of 32–88% (Osier et al., 2014; Richards et al., 2013). Population genetic analysis has revealed positive balancing selection for C-terminal regions of the pv41 genes from China–Myanmar area indicating immune evasion by the parasite (Wang et al., 2014). Despite the significance of these studies, which has demonstrated the potential of P41 as a vaccine candidate, no study has been conducted in the P. knowlesi ortholog Pk41 protein.

In this study, the domains of Pk41 protein were characterized based on the amino acid sequence alignment to its orthologs in P. vivax (Pv41) and P. falciparum (Pf41). Genetic diversity, natural selection, number of haplotypes and haplotype diversity within 39 isolates [35 clinical isolates and four laboratory lines (along with the H-strain)] from Malaysia were determined using full-length Pk41 genes. Since this is the first study of Pk41 from clinical samples, the information obtained from this study will be helpful to understand the level of polymorphism within the functional domains in field isolates for future functional studies as well as rational design and formulation of a blood-stage vaccine against P. knowlesi.

Materials and Methods

Pk41 sequence data

Thirty-nine full-length pk41 gene sequences were obtained from published database from clinical isolates originating from Malaysian Borneo and three long-time isolated lines from Peninsular Malaysia (along with the H-strain, PKNH_0303000) and the Philippine Strain (Table S1) (Assefa et al., 2015). The genomes were downloaded from the European Nucleotide Archive (https://www.ebi.ac.uk/ena). A map showing the geographical location of the isolates used in the study is shown in Fig. S1. Signal peptide for the full-length Pk41 was predicted using Signal IP 3.0 prediction software (Petersen et al., 2011). Sequence data were aligned using the CLUSTAL-W program in MegAlign Lasergene v 7.0 (DNASTAR) and polymorphism and phylogenetic analyses were conducted in MEGA 5.0 software. In order to determine the relationship between Pk41 sequences (laboratory lines and clinical isolates from Sarawak, Malaysian Borneo), phylogenetic analyses were conducted using deduced amino acid sequences using Maximum Likelihood (ML) method based on Poisson correction model as described in MEGA 5.0 with 1,000 bootstrap replicates to test the robustness of the trees (Table S1). Orthologous members of P. falciparum (PF3D7_04049000), P. cynomolgi (PCYB_031270), P. coatneyi (PCOAH_00006000) and P. vivax Sal-1 (PVX_000995) were also included in the analyses. ML-based phylogenetic trees were also constructed based on the s48/41 domain I and II of Pk41 protein.

Sequence diversity and natural selection

Sequence diversity (π), which is defined as the average number of nucleotide differences per site between two sequences was determined by DnaSP v5.10 software (Librado & Rozas, 2009). Number of polymorphic sites, parsimony informative sites (sites that have a minimum of two nucleotides that are present at least twice), number of synonymous (silent mutations) and non-synonymous substitutions (replacement mutations or mutations leading to change in amino acids), number of haplotypes (H), singletons (a nucleotide variant that appears only once in among the sequences) and haplotype diversity within the pk41 sequences were also determined by DnaSP software. Graphical representation of nucleotide diversity was conducted using the same software with window length 50 bp and step size 12 bp.

Natural selection was determined by calculating the rates of synonymous substitutions per synonymous site (dS) and nonsynonymous substitutions per nonsynonymous site (dN) which were computed by using Nei & Gojobori’s method (1986). Juke-Cantor correction and their standard errors of these parameters were estimated by the bootstrap method with 1,000 pseudo replicates as implemented in the MEGA 5.0 program (Tamura et al., 2011). Additionally, the Tajima’s D, Fu & Li’s D* and F* neutrality tests were performed as implemented in DnaSP v5.10 software. Tajima’s D is expected to be zero under neutrality. When Tajima’s D values are positive and significant it indicate positive/balancing selection, whereas negative values suggest negative selection or population expansion. Graphical representation of Tajima’s D was also conducted using the same software. Significant positive values for Fu & Li’s D* and F* also indicates population contraction due to a selection event while negative values indicated population expansion and excess of singletons.

Genetic differentiation

The ARLEQUIN software v.3.5.1.3 (Excoffier, Laval & Schneider, 2007) was used to compute pairwise differences (FST) between P. knowlesi subpopulations which were identified though ML-based phylogenetic analysis. The FST values were determined with 10,100 permutations. FST is a comparison of the sum of genetic variability within and between populations based on the differences in allelic frequencies. FST values are interpreted as no (0), low (>0–0.05), moderate (0.05–0.15) and high (0.15–0.25) genetic differentiation.

Results

Pk41 sequence identity within ortholg members and diversity within P. knowlesi population

The signal peptide of the Pk41 protein was detected between amino acid positions 21 and 22 using the Signal IP server (Fig. S2). There was no transmembrane domain predicted within the alignment. Alignment and comparison of the amino acid sequences of the full-length P. knowlesi H reference strain Pk41 sequences with its ortholog in P. vivax Sal-1 reference and P. falciparum 3D7 reference strain showed 84.9% and 41.5% identities respectively. The sequence identity differed mainly due to the central region which had a glutamate rich region in P. vivax and P. knowlesi but absent in P. falciparum. The schematic structure of pk41 gene with domain coordinates and 6-cysteine residues in comparison with its orthologs in P. vivax and P. falciparum are shown in Fig. 1. Other orthologs in primate malarias that is P. cynomolgi B strain and P. coatneyi Hackeri strain showed 84.1% and 87.5% sequence identity respectively. Within the full-length pk41 sequences (n = 39), there were 36 polymorphic sites which led to 20 synonymous and 16 non-synonymous substitutions. There were 28 parsimony informative sites, of which three sites were of three variants and 18 singleton variable sites leading to 31 haplotypes (Table S2). All the 6-cysteine residues within the two s48/45 domains were conserved, indicating active functional binding to host erythrocytes (Figs. S3 and S4). In addition to non-synonymous SNPs, the Pk41 had a repeat region encoding Lysine-Glutamic acid (K-E)n region (Fig. 2). The number of repeats varied from two to six within the laboratory lines and Sarawak, Malaysian Borneo. In addition to the (K-E)n repeat units, variation of glutamic acid (E)n repeats within the central domain led to size variations of 1,182–1,233 bp within the isolates (Fig. 2).

Figure 1. Schematic structure of P41 proteins in P. falciparum, P. vivax and P. knowlesi.

Figure 1

The conserved two 6-Cys domains, labeled as s48/45 Domain I and II along with its molecular weight, are in shaded and dotted background, respectively. (K-E)n repeat region is shown in yellow. The position of cystiene residues are given. The arrow marks indicate amino acid co-ordinates for each domain.

Figure 2. Amino acid polymorphism within Pk41 proteins sequences from Malaysia.

Figure 2

The green shaded regions represents the polymorphism with s48/45 domains I and II and the polymorphic residues are shaded in orange. The yellow shaded region is the (K-E) repeat region and blue shaded region is the Glu-rich region and the polymorphism within them. The amino acid position colored in red represents the insertions within the clinical isolates from Malaysian Borneo.

The overall nucleotide diversity of full-length pk41 was higher (π = 0.00959 ± SD 0.0001) compared to its ortholog in P. vivax (Forero-Rodriguez, Garzon-Ospina & Patarroyo, 2014) (Table 1). However, high nucleotide diversity within pk41 could be attributed to the size variations observed within the (K-E)n repeat region and the Glu-rich region within the P. knowlesi clinical isolates. The graphical representation of nucleotide diversity using the sliding window plot (window length 50 bp and step size 12 bp) also revealed that the diversity was high only within the central domain region due to the size variations (Fig. 3A). The total number of insertion–deletion (InDel) sites within the central domain was found to be 172. Of the 172 sites, 55 sites could be analyzed and the indel diversity was found to be high [π (i) = 0.354]. Most SNPs identified within these s48/45 domains were synonymous (Table 1). Domain-wise analysis of the s48/45 domain I and domain II indicated that both domains had very low levels of polymorphisms similar to its ortholog in P. vivax (Forero-Rodriguez, Garzon-Ospina & Patarroyo, 2014). Only two non-synonymous mutations N123D and M124V were identified within the s48/45 domain I (Fig. 2), which had minor allele frequency (MAF) greater than 10%. The s48/45 domain II had five mutations, of which only two non-synonymous mutations with MAF > 10% (H271D and T272K) were identified (Fig. 2). The remaining three mutations V262I, K332R and T344A were singleton non-synonymous mutations (Fig. 2). The two s48/45 domains shared similar levels of nucleotide diversity and it ranged from π = 0.00582–0.00530. The domain I had a slightly higher haplotype diversity compared to domain II (Table 1).

Table 1. Estimates of nucleotide diversity, haplotype diversity and neutrality indices of pk41.

Domain No. samples SNPs Syn NonSyn No. haplotype Diversity ± SD Taj D dN-dS ± S.E Fu & Li’s D* Fu & Li’s F*
Haplotype Nucleotide
Full-length 39 36 20 16 31 0.982 ± 0.011 0.00959 ± 0.0001 −0.79 (P > 0.1) −0.02 ± 0.006 −1.29 (P > 0.1) −1.37 (P > 0.1)
S48/45 Domain I 12 10 2 14 0.870 ± 0.038 0.00582 ± 0.0009 −0.21 (P > 0.1) −0.03 ± 0.001 0.46 (P > 0.1) 0.28 (P > 0.1)
Glutamic-rich region (excluding InDel sites) 9 7 2 10 0.750 ± 0.060 0.00543 ± 0.0007 −1.31 (P > 0.1) 0.002 ± 0.0001 −2.06 (P > 0.1) −2.14 (P > 0.1)
S48/45 Domain II 10 5 5 9 0.727 ± 0.064 0.00530 ± 0.0007 −0.71 (P > 0.1) −0.013 ± 0.008 −1.48 (P > 0.1) −1.45 (P > 0.1)

Note:

SNPs, Single nucleotide polymorphisms; SD, Standard deviation: Syn, Synonymous substitutions; NonSyn, Non synonymous substitutions: SE, Standard error.

Figure 3. (A) Graphical representation of nucleotide diversity (π) within Pk41 genes and (B) Tajimas D value across the full-length Pk41 genes.

Figure 3

The three domains; s48/45 domain I, II and the central domains are marked and with double-sided arrows. Dotted lines were used to indicate the peaks in π graph and the D graph. The asterisk indicates high positive Tajimas D values within the domain. The π graph and the D graph were drawn with window length 50 and step size 12 in Dnasp.

Natural selection of Pk41

Analysis for natural selection of pk41 genes from Malaysia indicated that the gene is under purifying/negative selection, implying functional constraints. The overall dN-dS, Taj D, Fu and Li’s D* and F* values across the genes were negative (Table 1) but not significant. Domain-wise analysis of dN-dS also indicated that the s48/45 domains were under negative selection. Taj D values and Fu & Li’s D* and F* were also negative, but not significant indicating purifying selection and population expansion in each domain (Table 1). However, graphical representation of Taj D across the entire gene identified certain regions within the s48/45 domains, which showed high positive D values (Fig. 3B) indicating non-synonymous SNPs in the region could be under positive balancing selection. These results indicated that a higher number of samples will be necessary for a statistically significant negative selection at this locus.

Phylogenetic analysis

Phylogenetic analysis of the 39 full-length Pk41 deduced amino acid sequences with other Plasmodium species using ML method identified three distinct P. knowlesi sub-populations. Of the three subpopulations, two populations originated from Malaysian Borneo (Fig. 4). The four laboratory lines, the H-strain, the Malayan Strain, MR4 and the Philippine Strain, which originated from Peninsular Malaysia and Philippines formed the third subpopulation (Fig. 4). The pk41 sequences were obtained from a previously published genomic study (Assefa et al., 2015) where the clinical samples were found to be associated with the primary hosts of P. knowlesi, that is Macaca fascicularis and Macaca nemestrina. Thus, cluster 1 and cluster 2 identified in this study also associated with Macaca fascicularis and Macaca nemestrina, respectively. The ML method showed that the Pk41 was more closely related to P. coatneyi 41 compared to its ortholog in P. vivax, P. falciparum and P. cynomolgi. Independent domain-wise ML-based phylogenetic analysis (for s48/45 domain I and II) also indicated same pattern of three subpopulations (Figs. S5 and S6). Two distinct amino acid polymorphisms N123D and T272K in domain I and II of s48/45, respectively, were the core determinants in separating P. knowlesi clinical isolates into two subpopulations in Sarawak (Fig. 2). The minor alleles (N and T) were associated with Macaca nemestrina and the major allele (D and K) were associated with Macaca fascicularis. However, phylogenetic trees constructed based on the central domain did not indicate any pattern of subpopulations (data not shown).

Figure 4. Phylogenetic relationship of Pk41 proteins from clinical isolates of Malaysia and the ortholog in other Plasmodium species based on Maximum Likelihood method.

Figure 4

The two P. knowlesi Pk41 subpopulations identified in Malaysian Borneo are shown as cluster 1 and cluster 2 and the four laboratory lines formed the cluster 3 from Peninsular Malaysia. Numbers at the nodes indicate bootstrap values.

Genetic differentiation within P. knowlesi subpopulations

Pairwise population differentiation index (FST values) using ARLEQUIN software identified very high and significant genetic differentiation within the subpopulations originating from Sarawak, Malaysian Borneo (FST = 0.732, P < 0.000, between cluster 1 and cluster 2), Table S3. Similar high and significant genetic differentiation was also observed between parasites population of within the laboratory lines and Sarawak, Malaysian Borneo clusters (Table S3) suggesting that parasitic transmission is confined to each of the regions. Previous studies have established that these two distinct sympatric subpopulation clusters from Sarawak, Malaysian Borneo were associated with their primary hosts that is Macaca fascicularis and Macaca nemestrina (Assefa et al., 2015; Divis et al., 2017).

Discussion

Blood stage antigens, which are localized at the merozoite surface, play an important role in invasion into erythrocytes. These antigens are directly exposed to host immune response during merozoite egress, and thus are excellent vaccine candidates. A candidate antigen should optimally possess low polymorphism to be efficacious across different geographical locations and avoid allele specific immune response. Recent P. knowlesi studies on known vaccine candidates for example normocyte binding protein xa and xb (Ahmed et al., 2014), csp (Fong et al., 2015), msp-142 (Yap et al., 2017) showed high genetic diversity in field isolates of Malaysia. However, some merozoite surface proteins for example pkaarp (Muh et al., 2018) and pkmsp1p (Ahmed, Fauzi & Han, 2018a) showed low levels of polymorphisms, which signifies potential candidacy for vaccine studies. The 6-cysteine protein family is conserved across Plasmodium species and Pf41 and Pv41 are major blood stage antigens that generate protective immune response in patients (Crosnier et al., 2013; Hostetler et al., 2015). Low levels of polymorphisms in field isolates have been reported and may prove to be excellent vaccine candidates (Forero-Rodriguez, Garzon-Ospina & Patarroyo, 2014). The objectives of the present study were to genetically characterize the pk41 gene from clinical isolates from Sarawak, Malaysian Borneo and study the level of genetic diversity, natural selection acting on the full-length Pk41 and at its functional s48/45 domains. Sequence alignment of 39 full-length amino acid sequences of Pk41 showed that it shares approximately 84.9% sequence identity with its ortholog Pv41. Since P. knowlesi is a primate malaria parasite and closely related to P. coatneyi (Muehlenbein et al., 2015), its sequence identity was highest with Pco41. The overall nucleotide diversity of pk41 was low within the clinical isolates (36 SNPs) and were of similar levels to its ortholog species P. vivax (Forero-Rodriguez, Garzon-Ospina & Patarroyo, 2014). Despite the fact that phylogenetic analysis of Pk41 amino acid sequences revealed three distinct subpopulations in Malaysia, the non-synonymous SNPs in each of the s48/45 domains were very low indicating functional conservation. It is interesting to note that the diversity observed within the pk41 genes was due to the central domain which had two repeat regions that is (K-E)n repeat region and the glutamic acid (E)n repeat region. Insertions of two to three (K-E)n repeats were detected only in isolates originating from Sarawak, Malaysian Borneo, but not in the laboratory lines which originated from Peninsular Malaysia (H-strain) and Philippines. Variation of repeat numbers that is in (K-E)n and (E)n led to size variations within the clinical isolates. It is also important to note that this (K-E)n repeat region is unique only to P. knowlesi and absent in its ortholog P. vivax and the clinical isolate originating from Sarawak had higher number of repeats. The implications of these repeats only in clinical isolates could be due to recent evolution of P. knowlesi in human population and these repeat units could be potential targets for immune evasion. Thus, immunological studies considering these repeat units would be necessary to further characterize as a vaccine candidate.

Tests of natural selection (using dN-dS, Taj’s D and Li and Fu’s D* and F*) yielded negative values indicating that the Pk41 is under negative/purifying selection; however, the P-values were not significant indicating higher sample size would be necessary for a statistically significant result. Domain-wise analysis of natural selection also indicated negative selection. It is interesting to note that Pv41 and Pf41 have been found to be under positive natural selection in various geographical locations (Tonkin et al., 2013; Wang et al., 2014). The contrasting results observed in P. knowlesi could be because of the three subpopulations identified in this study and majority of the clinical isolates were from Sarawak where two subpopulations coexists. In addition to this, there were natural variations of repeat units (within the central domain) in the clinical isolates and not all codons could be analyzed for natural selection. Thus, it would be important to collect higher number of samples to determine natural selection within each subpopulations. In the current study, it is also worth mentioning the presence of non-synonymous SNPs within the s48/45 domain II of pk41 showing high Taj D values, which may indicate possible epitope binding regions under immune selection pressure. However, these would need further confirmation through immunological studies targeted at these regions. Reports of positive balancing selection within the C terminal s48/45 domain II of pv41 genes have been recently reported from an endemic region (Wang et al., 2014). Similar positive peaks for Taj D values for CTL epitope regions in P. falciparum and P. knowlesi TRAP protein have been identified (Ahmed, Lau & Quan, 2018b; Weedall et al., 2007).

The ML-based phylogenetic tree showed separation of the Pk41 proteins from Malaysian Borneo into two populations while the four laboratory lines (H-strain, Malayan strain, Philippine Strain and the MR4 strain) formed the third subpopulation originating from Peninsular Malaysia and Philippines as observed by Assefa et al. (2015). Interestingly, only s48/45 domain I and II contributed to the grouping of the parasites into three subpopulations indicating selective forces may play a decisive role in bifurcation of trees. Previous studies on blood vaccine candidates such as the PkNBPXa (Ahmed et al., 2016), PkAMA1 (Faber et al., 2015) and PkMSP1P (Ahmed, Fauzi & Han, 2018a) from Malaysian Borneo also documented geographical separation and the presence of two P. knowlesi subpopulations for these antigens.

Population differentiation index FST based on Pk41 between the parasite sub-populations had significantly higher genetic differentiation (FST > 0.7, Table S3) and this can be attributed to the sympatric nature of the parasite populations and their host associated factors as identified previously in a genomic study (Assefa et al., 2015). High genetic differentiation values between long-term isolated laboratory lines and Sarawak, Malaysian Borneo isolates were due to the geographical distance between the two regions, which is separated by the South China Sea (Fig. S1). These results indicate localized transmission in South Asian countries and Malaysian Borneo and the co-existence of the two sympatric subpopulations in Sarawak, Malaysian Borneo. However, a vaccine designed based on the low polymorphic s48/45 domains could still be effective for all the three subpopulations. Thus, further characterization through genetic as well as immunological studies is necessary.

Conclusions

The present study is the first to investigate genetic diversity and natural selection of the pk41 gene from clinical samples of Sarawak, Malaysia. Low level of genetic diversity was observed within the s48/45 domains of the gene, accompanied by extensive size variations due to the repeat regions in the central domain. Overall, the gene is probably under negative/purifying natural selection; however, certain regions in the s48/45 domain II showed high Tajima’s D values thus could be under balancing selection. Further genetic studies with higher number of clinical isolates (specifically form Peninsular Malaysia) as well as immunological studies characterizing the functional domains would be necessary to validate Pk41 as a potential vaccine candidate.

Supplemental Information

Supplemental Information 1. Map of Malaysia with geographical location of sample collection sites.

Geographical location of sequences used in the study. Note: the Philippine Strain (SRR2225573) used in the study originated from the Philippines.

DOI: 10.7717/peerj.6141/supp-1
Supplemental Information 2. Signal peptide prediction.

Signal peptide was predicted with cleavage site between pos. 21 and 22.

DOI: 10.7717/peerj.6141/supp-2
Supplemental Information 3. Conserved 6-cysteine residues within s48/45 domain I of Pk41 clinical isolates.

The conserved cysteine residues are highlighted in yellow.

DOI: 10.7717/peerj.6141/supp-3
Supplemental Information 4. Conserved 6-cysteine residues within s48/45 domain II of Pk41 clinical isolates.

The conserved cysteine residues are highlighted in yellow.

DOI: 10.7717/peerj.6141/supp-4
Supplemental Information 5. Phylogenetic relationship of Pk41 proteins (s48/45 domain I) from clinical isolates of Malaysia and the ortholog in other Plasmodium species based on Maximum Likelihood method.

The two P. knowlesi sub-populations identified based on s48/45 domain I in Malaysian Borneo are shown as cluster 1 and cluster 2 and the four laboratory lines formed the cluster 3 from Peninsular Malaysia. Numbers at the nodes indicate bootstrap values.

DOI: 10.7717/peerj.6141/supp-5
Supplemental Information 6. Phylogenetic relationship of Pk41 proteins (s48/45 domain II) from clinical isolates of Malaysia and the ortholog in other Plasmodium species based on Maximum Likelihood method.

The two P. knowlesi sub-populations identified based on s48/45 domain II in Malaysian Borneo are shown as cluster 1 and cluster 2 and the four laboratory lines formed the cluster 3 from Peninsular Malaysia. Numbers at the nodes indicate bootstrap values.

DOI: 10.7717/peerj.6141/supp-6
Supplemental Information 7. Accession number of Pk41 sequences used in the study and their geographical origin.

P: Peninsular.

DOI: 10.7717/peerj.6141/supp-7
Supplemental Information 8. The 31 pk41 nucleotide haplotypes identified with P. knowlesi clinical isolates.
DOI: 10.7717/peerj.6141/supp-8
Supplemental Information 9. Population differentiation values (FST) from each sub-populations of Malaysia based on Pk41 genes.
DOI: 10.7717/peerj.6141/supp-9

Funding Statement

This work was supported by grants from the National Research Foundation of Korea (NRF) (2018R1A2B6003535, 2018R1A6A1A03025124). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Additional Information and Declarations

Competing Interests

The authors declare that they have no competing interests.

Author Contributions

Md Atique Ahmed conceived and designed the experiments, performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, prepared figures and/or tables, authored or reviewed drafts of the paper, approved the final draft, extracted the Pk41 genes from the genomic dataset from clinical isolates of Malaysia.

Ki-Back Chu performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, approved the final draft.

Fu-Shi Quan conceived and designed the experiments, contributed reagents/materials/analysis tools, authored or reviewed drafts of the paper, approved the final draft.

Data Availability

References

  • Ahmed et al. (2014).Ahmed AM, Pinheiro MM, Divis PC, Siner A, Zainudin R, Wong IT, Lu CW, Singh-Khaira SK, Millar SB, Lynch S, Willmann M, Singh B, Krishna S, Cox-Singh J. Disease progression in Plasmodium knowlesi malaria is linked to variation in invasion gene family members. PLOS Neglected Tropical Diseases. 2014;8:e3086. doi: 10.1371/journal.pntd.0003086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ahmed & Cox-Singh (2015).Ahmed MA, Cox-Singh J. Plasmodium knowlesi—an emerging pathogen. ISBT Science Series. 2015;10:134–140. doi: 10.1111/voxs.12115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ahmed, Fauzi & Han (2018a).Ahmed MA, Fauzi M, Han ET. Genetic diversity and natural selection of Plasmodium knowlesi merozoite surface protein 1 paralog gene in Malaysia. Malaria Journal. 2018a;17:115. doi: 10.1186/s12936-018-2256-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ahmed et al. (2016).Ahmed MA, Fong MY, Lau YL, Yusof R. Clustering and genetic differentiation of the normocyte binding protein (nbpxa) of Plasmodium knowlesi clinical isolates from Peninsular Malaysia and Malaysia Borneo. Malaria Journal. 2016;15:241. doi: 10.1186/s12936-016-1294-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ahmed, Lau & Quan (2018b).Ahmed MA, Lau YL, Quan FS. Diversity and natural selection on the thrombospondin-related adhesive protein (TRAP) gene of Plasmodium knowlesi in Malaysia. Malaria Journal. 2018b;17:274. doi: 10.1186/s12936-018-2423-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Angel et al. (2008).Angel DI, Mongui A, Ardila J, Vanegas M, Patarroyo MA. The Plasmodium vivax Pv41 surface protein: identification and characterization. Biochemical and Biophysical Research Communications. 2008;377:1113–1117. doi: 10.1016/j.bbrc.2008.10.129. [DOI] [PubMed] [Google Scholar]
  • Arredondo & Kappe (2017).Arredondo SA, Kappe SHI. The s48/45 six-cysteine proteins: mediators of interaction throughout the Plasmodium life cycle. International Journal for Parasitology. 2017;47:409–423. doi: 10.1016/j.ijpara.2016.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Assefa et al. (2015).Assefa S, Lim C, Preston MD, Duffy CW, Nair MB, Adroub SA, Kadir KA, Goldberg JM, Neafsey DE, Divis P, Clark TG, Duraisingh MT, Conway DJ, Pain A, Singh B. Population genomic structure and adaptation in the zoonotic malaria parasite Plasmodium knowlesi. Proceedings of the National Academy of Sciences of the United States of America. 2015;112:13027–13032. doi: 10.1073/pnas.1509534112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Barber et al. (2017).Barber BE, Rajahram GS, Grigg MJ, William T, Anstey NM. World Malaria Report: time to acknowledge Plasmodium knowlesi malaria. Malaria Journal. 2017;16:135. doi: 10.1186/s12936-017-1787-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Barber et al. (2011).Barber BE, William T, Jikal M, Jilip J, Dhararaj P, Menon J, Yeo TW, Anstey NM. Plasmodium knowlesi malaria in children. Emerging Infectious Diseases. 2011;17:814–820. doi: 10.3201/eid1705.101489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Barry & Arnott (2014).Barry AE, Arnott A. Strategies for designing and monitoring malaria vaccines targeting diverse antigens. Front Immunol. 2014;5:359. doi: 10.3389/fimmu.2014.00359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Barry et al. (2013).Barry AE, Schultz L, Senn N, Nale J, Kiniboro B, Siba PM, Mueller I, Reeder JC. High levels of genetic diversity of Plasmodium falciparum populations in Papua New Guinea despite variable infection prevalence. American Journal of Tropical Medicine and Hygiene. 2013;88:718–725. doi: 10.4269/ajtmh.12-0056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Cheng et al. (2013).Cheng Y, Lu F, Tsuboi T, Han ET. Characterization of a novel merozoite surface protein of Plasmodium vivax, Pv41. Acta Tropica. 2013;126:222–228. doi: 10.1016/j.actatropica.2013.03.002. [DOI] [PubMed] [Google Scholar]
  • Coelho et al. (2017).Coelho CH, Doritchamou JYA, Zaidi I, Duffy PE. Advances in malaria vaccine development: report from the 2017 malaria vaccine symposium. NPJ Vaccines. 2017;2:34. doi: 10.1038/s41541-017-0035-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Cox-Singh et al. (2008).Cox-Singh J, Davis TM, Lee KS, Shamsul SS, Matusop A, Ratnam S, Rahman HA, Conway DJ, Singh B. Plasmodium knowlesi malaria in humans is widely distributed and potentially life threatening. Clinical Infectious Diseases. 2008;46:165–171. doi: 10.1086/524888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Crosnier et al. (2013).Crosnier C, Wanaguru M, McDade B, Osier FH, Marsh K, Rayner JC, Wright GJ. A library of functional recombinant cell-surface and secreted P. falciparum merozoite proteins. Molecular and Cellular Proteomics. 2013;12:3976–3986. doi: 10.1074/mcp.O113.028357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Daneshvar et al. (2009).Daneshvar C, Davis TM, Cox-Singh J, Rafa’ee MZ, Zakaria SK, Divis PC, Singh B. Clinical and laboratory features of human Plasmodium knowlesi infection. Clinical Infectious Diseases. 2009;49:852–860. doi: 10.1086/605439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Divis et al. (2017).Divis PC, Lin LC, Rovie-Ryan JJ, Kadir KA, Anderios F, Hisam S, Sharma RS, Singh B, Conway DJ. Three divergent subpopulations of the malaria parasite Plasmodium knowlesi. Emerging Infectious Diseases. 2017;23:616–624. doi: 10.3201/eid2304.161738. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Excoffier, Laval & Schneider (2007).Excoffier L, Laval G, Schneider S. Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evolutionary Bioinformatics Online. 2007;1:47–50. [PMC free article] [PubMed] [Google Scholar]
  • Faber et al. (2015).Faber BW, Abdul Kadir K, Rodriguez-Garcia R, Remarque EJ, Saul FA, Vulliez-Le Normand B, Bentley GA, Kocken CH, Singh B. Low levels of polymorphisms and no evidence for diversifying selection on the Plasmodium knowlesi apical membrane antigen 1 gene. PLOS ONE. 2015;10:e0124400. doi: 10.1371/journal.pone.0124400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Fong et al. (2015).Fong MY, Ahmed MA, Wong SS, Lau YL, Sitam F. Genetic diversity and natural selection of the Plasmodium knowlesi circumsporozoite protein nonrepeat regions. PLOS ONE. 2015;10:e0137734. doi: 10.1371/journal.pone.0137734. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Forero-Rodriguez, Garzon-Ospina & Patarroyo (2014).Forero-Rodriguez J, Garzon-Ospina D, Patarroyo MA. Low genetic diversity in the locus encoding the Plasmodium vivax P41 protein in Colombia’s parasite population. Malaria Journal. 2014;13:388. doi: 10.1186/1475-2875-13-388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Garcia et al. (2009).Garcia J, Curtidor H, Pinzon CG, Vanegas M, Moreno A, Patarroyo ME. Identification of conserved erythrocyte binding regions in members of the Plasmodium falciparum Cys6 lipid raft-associated protein family. Vaccine. 2009;27:3953–3962. doi: 10.1016/j.vaccine.2009.04.039. [DOI] [PubMed] [Google Scholar]
  • Garnham (1966).Garnham PCC. Malaria parasites and other haemosporidia. Oxford: Blackwell Scientific Publications; 1966. [Google Scholar]
  • Genton et al. (2002).Genton B, Betuela I, Felger I, Al-Yaman F, Anders RF, Saul A, Rare L, Baisor M, Lorry K, Brown GV, Pye D, Irving DO, Smith TA, Beck HP, Alpers MP. A recombinant blood-stage malaria vaccine reduces Plasmodium falciparum density and exerts selective pressure on parasite populations in a phase 1–2b trial in Papua New Guinea. Journal of Infectious Diseases. 2002;185:820–827. doi: 10.1086/339342. [DOI] [PubMed] [Google Scholar]
  • Hostetler et al. (2015).Hostetler JB, Sharma S, Bartholdson SJ, Wright GJ, Fairhurst RM, Rayner JC. A library of Plasmodium vivax recombinant merozoite proteins reveals new vaccine candidates and protein–protein interactions. PLOS Neglected Tropical Diseases. 2015;9:e0004264. doi: 10.1371/journal.pntd.0004264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Librado & Rozas (2009).Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25:1451–1452. doi: 10.1093/bioinformatics/btp187. [DOI] [PubMed] [Google Scholar]
  • Lo et al. (2017).Lo E, Lam N, Hemming-Schroeder E, Nguyen J, Zhou G, Lee MC, Yang Z, Cui L, Yan G. Frequent spread of Plasmodium vivax malaria maintains high genetic diversity at the Myanmar–China border, without distance and landscape barriers. Journal of Infectious Diseases. 2017;216:1254–1263. doi: 10.1093/infdis/jix106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Muehlenbein et al. (2015).Muehlenbein MP, Pacheco MA, Taylor JE, Prall SP, Ambu L, Nathan S, Alsisto S, Ramirez D, Escalante AA. Accelerated diversification of nonhuman primate malarias in Southeast Asia: adaptive radiation or geographic speciation? Molecular Biology and Evolution. 2015;32:422–439. doi: 10.1093/molbev/msu310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Muh et al. (2018).Muh F, Ahmed MA, Han JH, Nyunt MH, Lee SK, Lau YL, Kaneko O, Han ET. Cross-species analysis of apical asparagine-rich protein of Plasmodium vivax and Plasmodium knowlesi. Scientific Reports. 2018;8:5781. doi: 10.1038/s41598-018-23728-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Neafsey et al. (2015).Neafsey DE, Juraska M, Bedford T, Benkeser D, Valim C, Griggs A, Lievens M, Abdulla S, Adjei S, Agbenyega T, Agnandji ST, Aide P, Anderson S, Ansong D, Aponte JJ, Asante KP, Bejon P, Birkett AJ, Bruls M, Connolly KM, D’Alessandro U, Dobano C, Gesase S, Greenwood B, Grimsby J, Tinto H, Hamel MJ, Hoffman I, Kamthunzi P, Kariuki S, Kremsner PG, Leach A, Lell B, Lennon NJ, Lusingu J, Marsh K, Martinson F, Molel JT, Moss EL, Njuguna P, Ockenhouse CF, Ogutu BR, Otieno W, Otieno L, Otieno K, Owusu-Agyei S, Park DJ, Pelle K, Robbins D, Russ C, Ryan EM, Sacarlal J, Sogoloff B, Sorgho H, Tanner M, Theander T, Valea I, Volkman SK, Yu Q, Lapierre D, Birren BW, Gilbert PB, Wirth DF. Genetic diversity and protective efficacy of the RTS,S/AS01 malaria vaccine. New England Journal of Medicine. 2015;373:2025–2037. doi: 10.1056/NEJMoa1505819. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Nei & Gojobori (1986).Nei M, Gojobori T. Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Molecular Biology and Evolution. 1986;3:418–426. doi: 10.1093/oxfordjournals.molbev.a040410. [DOI] [PubMed] [Google Scholar]
  • Osier et al. (2014).Osier FH, Mackinnon MJ, Crosnier C, Fegan G, Kamuyu G, Wanaguru M, Ogada E, McDade B, Rayner JC, Wright GJ, Marsh K. New antigens for a multicomponent blood-stage malaria vaccine. Science Translational Medicine. 2014;6:247ra102. doi: 10.1126/scitranslmed.3008705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Osier et al. (2010).Osier FH, Weedall GD, Verra F, Murungi L, Tetteh KK, Bull P, Faber BW, Remarque E, Thomas A, Marsh K, Conway DJ. Allelic diversity and naturally acquired allele-specific antibody responses to Plasmodium falciparum apical membrane antigen 1 in Kenya. Infection and Immunity. 2010;78:4625–4633. doi: 10.1128/IAI.00576-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Parker, Peng & Boulanger (2015).Parker ML, Peng F, Boulanger MJ. The structure of Plasmodium falciparum blood-stage 6-Cys protein Pf41 reveals an unexpected intra-domain insertion required for Pf12 coordination. PLOS ONE. 2015;10:e0139407. doi: 10.1371/journal.pone.0139407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Patel et al. (2017).Patel P, Bharti PK, Bansal D, Raman RK, Mohapatra PK, Sehgal R, Mahanta J, Sultan AA, Singh N. Genetic diversity and antibody responses against Plasmodium falciparum vaccine candidate genes from Chhattisgarh, Central India: implication for vaccine development. PLOS ONE. 2017;12:e0182674. doi: 10.1371/journal.pone.0182674. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Petersen et al. (2011).Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nature Methods. 2011;8:785–786. doi: 10.1038/nmeth.1701. [DOI] [PubMed] [Google Scholar]
  • Pinheiro et al. (2015).Pinheiro MM, Ahmed MA, Millar SB, Sanderson T, Otto TD, Lu WC, Krishna S, Rayner JC, Cox-Singh J. Plasmodium knowlesi genome sequences from clinical isolates reveal extensive genomic dimorphism. PLOS ONE. 2015;10:e0121303. doi: 10.1371/journal.pone.0121303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Richards et al. (2013).Richards JS, Arumugam TU, Reiling L, Healer J, Hodder AN, Fowkes FJ, Cross N, Langer C, Takeo S, Uboldi AD, Thompson JK, Gilson PR, Coppel RL, Siba PM, King CL, Torii M, Chitnis CE, Narum DL, Mueller I, Crabb BS, Cowman AF, Tsuboi T, Beeson JG. Identification and prioritization of merozoite antigens as targets of protective human immunity to Plasmodium falciparum malaria for vaccine and biomarker development. Journal of Immunology. 2013;191:795–809. doi: 10.4049/jimmunol.1300778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Soe et al. (2017).Soe TN, Wu Y, Tun MW, Xu X, Hu Y, Ruan Y, Win AYN, Nyunt MH, Mon NCN, Han KT, Aye KM, Morris J, Su P, Yang Z, Kyaw MP, Cui L. Genetic diversity of Plasmodium falciparum populations in southeast and western Myanmar. Parasite & Vectors. 2017;10:322. doi: 10.1186/s13071-017-2254-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Taechalertpaisarn et al. (2012).Taechalertpaisarn T, Crosnier C, Bartholdson SJ, Hodder AN, Thompson J, Bustamante LY, Wilson DW, Sanders PR, Wright GJ, Rayner JC, Cowman AF, Gilson PR, Crabb BS. Biochemical and functional analysis of two Plasmodium falciparum blood-stage 6-cys proteins: P12 and P41. PLOS ONE. 2012;7:e41937. doi: 10.1371/journal.pone.0041937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Takala et al. (2009).Takala SL, Coulibaly D, Thera MA, Batchelor AH, Cummings MP, Escalante AA, Ouattara A, Traore K, Niangaly A, Djimde AA, Doumbo OK, Plowe CV. Extreme polymorphism in a vaccine antigen and risk of clinical malaria: implications for vaccine development. Science Translational Medicine. 2009;1:2ra5. doi: 10.1126/scitranslmed.3000257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Tamura et al. (2011).Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Molecular Biology and Evolution. 2011;28:2731–2739. doi: 10.1093/molbev/msr121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Tonkin et al. (2013).Tonkin ML, Arredondo SA, Loveless BC, Serpa JJ, Makepeace KA, Sundar N, Petrotchenko EV, Miller LH, Grigg ME, Boulanger MJ. Structural and biochemical characterization of Plasmodium falciparum 12 (Pf12) reveals a unique interdomain organization and the potential for an antiparallel arrangement with Pf41. Journal of Biological Chemistry. 2013;288:12805–12817. doi: 10.1074/jbc.M113.455667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wang et al. (2014).Wang Y, Ma A, Chen SB, Yang YC, Chen JH, Yin MB. Genetic diversity and natural selection of three blood-stage 6-Cys proteins in Plasmodium vivax populations from the China–Myanmar endemic border. Infection, Genetics and Evolution. 2014;28:167–174. doi: 10.1016/j.meegid.2014.09.026. [DOI] [PubMed] [Google Scholar]
  • Weedall et al. (2007).Weedall GD, Preston BM, Thomas AW, Sutherland CJ, Conway DJ. Differential evidence of natural selection on two leading sporozoite stage malaria vaccine candidate antigens. International Journal for Parasitology. 2007;37:77–85. doi: 10.1016/j.ijpara.2006.09.001. [DOI] [PubMed] [Google Scholar]
  • WHO (2017).WHO . World Malaria Report. Geneva: World Health Organization; 2017. [Google Scholar]
  • William et al. (2013).William T, Rahman HA, Jelip J, Ibrahim MY, Menon J, Grigg MJ, Yeo TW, Anstey NM, Barber BE. Increasing incidence of Plasmodium knowlesi malaria following control of P. falciparum and P. vivax malaria in Sabah, Malaysia. PLOS Neglected Tropical Diseases. 2013;7:e2026. doi: 10.1371/journal.pntd.0002026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Willmann et al. (2012).Willmann M, Ahmed A, Siner A, Wong IT, Woon LC, Singh B, Krishna S, Cox-Singh J. Laboratory markers of disease severity in Plasmodium knowlesi infection: a case control study. Malaria Journal. 2012;11:363. doi: 10.1186/1475-2875-11-363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Yap et al. (2017).Yap NJ, Goh XT, Koehler AV, William T, Yeo TW, Vythilingam I, Gasser RB, Lim YAL. Genetic diversity in the C-terminus of merozoite surface protein 1 among Plasmodium knowlesi isolates from Selangor and Sabah Borneo, Malaysia. Infection, Genetics and Evolution. 2017;54:39–46. doi: 10.1016/j.meegid.2017.06.019. [DOI] [PubMed] [Google Scholar]
  • Yusof et al. (2016).Yusof R, Ahmed MA, Jelip J, Ngian HU, Mustakim S, Hussin HM, Fong MY, Mahmud R, Sitam FA, Japning JR, Snounou G, Escalante AA, Lau YL. Phylogeographic evidence for 2 genetically distinct zoonotic Plasmodium knowlesi parasites, Malaysia. Emerging Infectious Diseases. 2016;22:1371–1380. doi: 10.3201/eid2208.151885. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Information 1. Map of Malaysia with geographical location of sample collection sites.

Geographical location of sequences used in the study. Note: the Philippine Strain (SRR2225573) used in the study originated from the Philippines.

DOI: 10.7717/peerj.6141/supp-1
Supplemental Information 2. Signal peptide prediction.

Signal peptide was predicted with cleavage site between pos. 21 and 22.

DOI: 10.7717/peerj.6141/supp-2
Supplemental Information 3. Conserved 6-cysteine residues within s48/45 domain I of Pk41 clinical isolates.

The conserved cysteine residues are highlighted in yellow.

DOI: 10.7717/peerj.6141/supp-3
Supplemental Information 4. Conserved 6-cysteine residues within s48/45 domain II of Pk41 clinical isolates.

The conserved cysteine residues are highlighted in yellow.

DOI: 10.7717/peerj.6141/supp-4
Supplemental Information 5. Phylogenetic relationship of Pk41 proteins (s48/45 domain I) from clinical isolates of Malaysia and the ortholog in other Plasmodium species based on Maximum Likelihood method.

The two P. knowlesi sub-populations identified based on s48/45 domain I in Malaysian Borneo are shown as cluster 1 and cluster 2 and the four laboratory lines formed the cluster 3 from Peninsular Malaysia. Numbers at the nodes indicate bootstrap values.

DOI: 10.7717/peerj.6141/supp-5
Supplemental Information 6. Phylogenetic relationship of Pk41 proteins (s48/45 domain II) from clinical isolates of Malaysia and the ortholog in other Plasmodium species based on Maximum Likelihood method.

The two P. knowlesi sub-populations identified based on s48/45 domain II in Malaysian Borneo are shown as cluster 1 and cluster 2 and the four laboratory lines formed the cluster 3 from Peninsular Malaysia. Numbers at the nodes indicate bootstrap values.

DOI: 10.7717/peerj.6141/supp-6
Supplemental Information 7. Accession number of Pk41 sequences used in the study and their geographical origin.

P: Peninsular.

DOI: 10.7717/peerj.6141/supp-7
Supplemental Information 8. The 31 pk41 nucleotide haplotypes identified with P. knowlesi clinical isolates.
DOI: 10.7717/peerj.6141/supp-8
Supplemental Information 9. Population differentiation values (FST) from each sub-populations of Malaysia based on Pk41 genes.
DOI: 10.7717/peerj.6141/supp-9

Data Availability Statement

The following information was supplied regarding data availability:

https://www.ebi.ac.uk, ERR274221, ERR274222, ERR366425, ERR366426, ERR985374, ERR985376, ERR985377, ERR985378, ERR985379, ERR985380, ERR985381, ERR985382, ERR985385, ERR985386, ERR985387, ERR985388, ERR985390, ERR985392, ERR985393, ERR985394, ERR985395, ERR985396, ERR985397, ERR985404, ERR985405, ERR985406, ERR985407, ERR985408, ERR985409, ERR985410, ERR985411, ERR985416, ERR985417, ERR985418, SRR2225571 (MR4), SRR2225573 (Philippine Strain), PKNOH_S02295300 (Malayan Strain Pk1A), PKNH_0303000 (H-strain), ERR985389.


Articles from PeerJ are provided here courtesy of PeerJ, Inc

RESOURCES