Abstract
The merozoite surface protein-1 (MSP1) is a prime candidate for an asexual blood stage vaccine against malaria. However, polymorphism in this antigen could compromise the vaccine’s efficacy. Although the extent of sequence variation in MSP1 has been analyzed from various Plasmodium species, little is known about structural organization and diversity of this locus in Plasmodium malariae (PmMSP1). Herein, we have shown that PmMSP1 contained five conserved and four variable blocks based on analysis of the complete coding sequences. Variable blocks were characterized by short insertion and deletion variants (block II), polymorphic nonrepeat sequences (block IV), complex repeat structure with size variation (block VI) and degenerate octapeptide repeats (block VIII). Like other malarial MSP1s, evidences of intragenic recombination have been found in PmMSP1. The rate of nonsynonymous nucleotide substitutions significantly exceeded that of synonymous nucleotide substitutions in block IV, suggesting positive selection in this region. Codon-based analysis of deviation from neutrality has identified a codon under purifying selection located in close proximity to the homologous region of the 38 kDa/42 kDa cleavage site of P. falciparum MSP1. A number of predicted linear B-cell epitopes were identified across both conserved and variable blocks of the protein. However, polymorphism in repeat-containing blocks resulted in alteration of the predicted linear B-cell epitope scores across variants. Although a number of predicted HLA-class II-binding peptides were identified in PmMSP1, all variants of block IV seemed not to be recognized by common HLA-class II alleles among Thai population, suggesting that diversity in this positive selection region could probably affect host immune recognition. The data on structural diversity in PmMSP1 could be useful for further studies such as vaccine development and strain characterization of this neglected malaria parasite.
Subject terms: Microbiology, Molecular biology, Diseases
Introduction
Despite annual declines in global malaria cases caused by the two major human malaria parasites Plasmodium falciparum and P. vivax during the past 2 decades due to integrative control measures, an increase in the number of infections by the low prevalent species including P. malariae and P. ovale spp. has been observed in some African endemic areas, such as Tanzania, Gabon, Democratic Republic of Congo and Uganda1–4. Although P. malariae infection usually does not result in acute severe symptoms, repeated and long-term exposures may be associated with chronic glomerulonephritis in children and adolescents in some endemic areas, especially Sub-Saharan Africa and Papua New Guinea5–8. While more compelling evidences are required to document chloroquine-resistance in P. malariae, the blood stage infection of this Plasmodium species may persist for an unusually long period and can recrudesce after many years of dormancy9–11. Like other human malaria parasites, P. malariae has been incriminated in transfusion-transmitted malaria in which the prevalence seems to vary across endemic areas12,13. Meanwhile, the low parasite density of P. malariae among infected individuals has hampered efficient detection by conventional microscopy, especially when it co-infects with other malaria species14–16. On the basis of microscopy diagnosis, P. malariae infection accounted for approximately 0.1% of all malaria cases in Thailand17 whereas PCR could diagnose about five times higher than microscopic examination15,18–20. To achieve malaria control and elimination, it may require effective interventions against the low prevalent Plasmodium species including P. malariae.
One of the leading vaccine candidates against asexual blood stages of malaria parasites is merozoite surface protein-1 (MSP1) which is believed to play a crucial role in invasion of host erythrocytes by the merozoites and during their egression from infected cells after asexual reproductive maturation21,22. The MSP1 of P. falciparum (PfMSP1) is synthesized as a precursor protein during schizogony and subsequently processed into 4 polypeptides of 83, 30, 38 and 42 kDa. Prior to erythrocyte entry of the merozoites, secondary processing of the C-terminal 42-kDa fragment ensues, yielding 33- and 19-kDa protein fragments23. On the basis of amino acid sequence identity, PfMSP1 have been divided in to 17 blocks, containing five conserved, five semi-conserved and seven variable blocks24. The 19-kDa fragment containing two epidermal growth factor (EGF)-like domains has been considered to be a potential vaccine candidate because it is a target for invasion inhibitory antibodies while naturally acquired antibodies against this fragment have been associated with protection against symptomatic malaria among individuals living in malaria endemic areas25,26. Likewise, the tripeptide repeats in PfMSP1 could elicit protective antibodies among children in Sub-Saharan Africa27. Furthermore, erythrocyte-binding domains have been identified in the 83-, 38- and 33-kDa fragments of PfMSP128–31. Meanwhile, the MSP1 gene of P. vivax (PvMSP1) displays mosaic organization of variable blocks whereas those of P. ovale spp. (PoMSP1) and P. knowlesi (PkMSP1) exhibit structural variation that are different from that of PfMSP132–34.
To date, mainly partial sequences of the MSP1 gene of P. malariae (PmMSP1) have been determined using isolates from French Guiana, Cameroon, Brazil and Thai-Myanmar border which reveals conserved and variable blocks35–39. However, the organization of these blocks based on the complete coding sequences remains to be elucidated. Herein, we analyzed the complete coding sequence of PmMSP1 among clinical isolates from diverse endemic areas of Thailand. Results have shown that PmMSP1 contained four variable blocks flanked by five conserved blocks. Like other human malarial MSP1s, intragenic recombination and natural selection have influenced diversity at this locus24,32–34. Furthermore, analysis of predicted linear B-cell and helper T-cell epitopes has suggested that polymorphism in this protein could affect host immune recognition.
Results
Amplification and sequencing of PmMSP1
The complete coding region of PmMSP1 was amplified from 35 Thai isolates (PM1–PM35). The origins and years of sample collections are shown in Fig. 1. Of these, 15 isolates contained single infections of P. malariae and the remaining samples were co-infected with P. falciparum (n = 2), P. vivax (n = 17) and both P. falciparum and P. vivax (n = 1). However, the PCR primers used in this study were specific for amplification of PmMSP1 because direct sequencing of the PCR-amplified products yielded clear electropherogram without superimposed signals of the sequences of this locus. Therefore, no cross amplification of PfMSP1 and PvMSP1 was observed in isolates containing P. falciparum or P. vivax. The complete coding sequences of PmMSP1 in this study varied from 5088 to 5493 bp. In total, 20 alleles were identified in which alleles III, VI, X, XIII and XV contained more than one isolate (Fig. 1). Interestingly, the same alleles could be found from different sampling periods and from diverse endemic areas of the country. For example, allele XIII consisting of 5418 bp, occurred in five isolates from Mae Hong Son, Tak, Trat, Ranong and Yala Provinces collected during 1994, 2004, 2007 and 2008 (Fig. 1).
Structural organization of PmMSP1
To determine the structural organization of PmMSP1, nucleotide diversity was determined across the aligned complete coding sequences of 35 Thai isolates and the sequence from a Cameroonian patient (GenBank accession no. FJ824669) whose nucleotide and amino acid positions of the gene/protein were used as reference. Results revealed that the extent of nucleotide diversity was variable across PmMSP1 with two regions containing nucleotide diversity > 0.04 (Fig. 2A), a comparable level for variable blocks of PvMSP132. One was from codons 210 to 241, designated block IV, and the other was in block VI spanning amino acids 682 and 832. Short insertions and deletions (indels) were found between codons 57 and 69 at the N-terminal part of PmMSP1, designated block II, (Fig. 3). Meanwhile, Tandem Repeats Finder algorithm has identified two repeat-containing regions in PmMSP1, one corresponding to block VI and the other from codons 989 to 1032 (block VIII). Therefore, the remaining nonrepeat regions encompassing approximately 86% of the entire coding region in PmMSP1 with nucleotide diversity < 0.02 were assigned to conserved blocks, consisting of blocks I, III, V, VII and IX (Fig. 2B).
Diversity of indels in PmMSP1
Previous reports have shown that P. malariae and P. brasilianum possessed similar or almost indistinguishable genetic background38,40,41. To gain further insight into sequence diversity in PmMSP1, the previously reported partial sequences of PmMSP1 and the MSP1 sequences of P. brasilianum (PbrMSP1) were included for comparison between the corresponding regions38,39. Despite short indels in block II, nine variants were identified. Of these, six variants occurred in Thai isolates whereas five variants were found in PbrMSP1 in which alleles V and VI were shared between PmMSP1 and PbrMSP1 (Table 1).
Table 1.
Allele | Sequence | Total | Distribution | ||
---|---|---|---|---|---|
Species | Country* (n) | ||||
P.m | P.br | ||||
I | NKDGNT––TTN–--ANN | 21 | 21 | – | Thailand (21) |
II | NKDGNTSTTTN––-ANN | 8 | 8 | – | Thailand (8) |
III | NKDGNT––TTNANNANN | 1 | 1 | – | Thailand (1) |
IV | NKDGNTS-TTNANNANN | 1 | 1 | – | Thailand (1) |
V | NKDGNT––TTD–-–ANN | 9 | 7 | 2 |
Myanmar (3), Thailand (2), Cameroon (1), French Guiana (1) French Guiana (2) |
VI | ––––-–––-TN––-ANN | 7 | 3 | 4 | Thailand (2), Saudi Arabia (1), Brazil (4) |
VII | ––––-–––-TNANN––N | 2 | – | 2 | Brazil (2) |
VIII | ––––-–––-TDANN––N | 1 | – | 1 | Brazil (1) |
IX | NKDGST––TTD–-–ANN | 1 | – | 1 | French Guiana (1) |
P.m and P.br denote Plasmodium malariae and P. brasilianum, respectively. Thai isolates and GenBank accession numbers of isolates elsewhere are (I) PM2-PM8, PM10, PM14, PM16-PM19, PM22-PM26 and PM31-PM33; (II) PM9, PM11, PM12, PM21, PM28-PM30 and PM34; (III) PM1, (IV) PM35, (V) PM20, PM27, FJ824669, KX672046-KX672048, AF138879, AF138881 and AF138882; (VI) PM13, PM15, AF138878, FLQW01000468, KC906711, KC906714, KC906715, (VII) KC906713 and KC906716; (VIII) KC906712 and (IX) AF138880. *Bolds are countries for P. brasilianum.
Diversity of block IV in PmMSP1
Of 32 codons in block IV of PmMSP1, amino acid substitutions were found in 20 residues, resulting in 16 haplotypes based on analysis of isolates from Thailand and elsewhere including those belonging to PbrMSP1 (Table 2). Of these, 10 haplotypes were identified among Thai isolates in which three haplotypes were shared across endemic countries. All haplotypes of available PbrMSP1 sequences (n = 4) were distinct from those of PmMSP1. However, allele VI from four Brazilian isolates (GenBank accession nos. KR072269, KR072272, KR072278 and KR072279) and allele XV from a Peruvian Saimiri monkey (KR072284) were closely related with a single amino acid difference (I233K)38. It is noteworthy that seven of nine amino acid substitutions in block IV of PbrMSP1 were shared with those of PmMSP1 (Table 2).
Table 2.
Allele | Sequence | Total | Distribution | ||
---|---|---|---|---|---|
Species | Country* (n) | ||||
P.m | P.br | ||||
I | YDNIATTNKELEAPSGSGSDDEDIKNCDKKQK | 14 | 14 | Thailand (11), Brazil (3) | |
II | ......K......H.............A...E | 8 | 8 | Thailand (8) | |
III | .N...DE..K.....E...........NE... | 6 | 6 | Myanmar (3), Thailand (2), Cameroon (1) | |
IV | .............S.................. | 5 | 5 | Thailand (5) | |
V | ....T...N..VTSNVPRP.....T....... | 5 | 5 | Thailand (4), Saudi Arabia (1) | |
VI | ........N..K.L.E.RP.....T....... | 4 | 4 | Brazil (4) | |
VII | ........N..K...EPRP.....T....... | 3 | 3 | Brazil (3) | |
VIII | ........N..K...E.RP.....R....... | 1 | 1 | Brazil (1) | |
IX | ......K....................A...E | 1 | 1 | Thailand (1) | |
X | H.......N....S............Y..... | 1 | 1 | Thailand (1) | |
XI | ........N..K...E.RP.....T....... | 1 | 1 | Thailand (1) | |
XII | .N...DE..K.....E................ | 1 | 1 | Thailand (1) | |
XIII | .N...DE..K...S.E...........GI... | 1 | 1 | Thailand (1) | |
XIV | ........N..K.H.E.RT.....T....... | 2 | 2 | Brazil (2) | |
XV | ........N..K.L.E.RP....KT....... | 1 | 1 | Peru (1) | |
XVI | ........N..K.H.E.RT.....T.Y..... | 1 | 1 | Brazil (1) |
P.m and P.br denote Plasmodium malariae and P. brasilianum, respectively. Dots represent corresponding identical amino acids per allele I. Thai isolates and GenBank accession numbers of isolates elsewhere are (I) PM3-PM8, PM16-PM19, PM33, KR072273, KR072274 and KR072276; (II) PM13 and PM20-PM26; (III) PM1, PM27, KX672046–KX672048 and FJ824669; (IV) PM11, PM12 and PM28–PM30; (V) PM14, PM31, PM32, PM34 and FLQW01000468; (VI) KR072269, KR072272, KR072278 and KR072279; (VII) KR072271, KR072275 and KR072277; (VIII) KR072270; (IX) PM9; (X) PM10; (XI) PM15; (XII) PM2; (XIII) PM35; (XIV) KR072281 and KR072283; (XV) KR072284 and (XVI) KR072282. *Bolds are countries for P. brasilianum.
Diversity of repeats in PmMSP1
Block VI contained complex repeat motifs with multiple patterns of repeat arrays and arrangements. Together with previously reported sequences of PmMSP1 (n = 16) and PbrMSP1 (n = 5), 35 haplotypes have been identified in this block in which 19 haplotypes occurred among Thai isolates (Fig. 3). The N-terminal part of this block contains non-repetitive amino acid sequences with variable indels, resulting in eight to 40 residues in this region. On the basis of distinct repeats and arrangements, block VI could be classified into types A and B. Type A contained 17 alleles (A1.1–A1.17) whereas type B could be further subdivided into subtypes B1 and B2, containing eight and 10 alleles, respectively (Fig. 3). Interestingly, the amino acid sequence of allele A1.10 was shared between P. malariae from a Brazilian patient (KR072216) and P. brasilianum from a Peruvian Saimiri monkey (JX045641) whereas the remaining PbrMSP1 and most other PmMSP1 type A alleles seemed to be closely related. It is noteworthy that none of PbrMSP1 sequences belonged to type B. Meanwhile, the other repeat-containing region was located in block VIII spanning codons 989 and 1032 (residues after FJ824669), characterized by a degenerate octapeptide repeat motif, P(A)Q(T)P(S, T or Q)QA(S)A(S or T)L(S or V)P(V or -), with variation in the number of repeat units among isolates. Of 35 Thai isolates and 14 previously reported sequences, 13 haplotypes were identified in this block in which haplotype I was most common and occurred in isolates from Thailand, Myanmar and Brazil, followed by haplotype XIII which was shared between PmMSP1 and PbrMSP135,38,39 (Table 3).
Table 3.
Allele | Sequence | Distribution (n) | ||
---|---|---|---|---|
Species | Country* | |||
P.m | P.br | |||
I | PQPQAALPAQPQAALPAQPQAALPAQPQAAVPAQSQATVPAQSQAAVPATTQSSSVSAPT | 15 | Thailand (12), Myanmar (1), Brazil (2) | |
II | PQPQAALPAQPQAALPAQPQAAV----------------PAQSQAAVPATTQSSSVSAPT | 6 | Thailand (6) | |
III | PQQQSSS-AQPQAALPAQPQAAV--------PAQSQATVPAQSQAAVPATTQSSSVSAPT | 5 | Thailand (5) | |
IV | PQPQAALPAQPQAALPAQPQAALPAQPQAALPAQPQAAVPAQSQAAVPATT--------- | 4 | Thailand (4) | |
V | PQQQSSS-AQPQAAVPAQSQATV----------------PAQSQAAVPATTQSSSVSAPT | 4 | Thailand (3), Saudi Arabia (1) | |
VI | PQPQAALPAQPQAALPAQPQAAVPAQSQATV--------PAQSQAAVPATTQSSSVSAPT | 2 | Thailand (2) | |
VII | PQPQAALPAQPQAAVPAQSQATV----------------PAQSQAAVPATTQSSSVSAPT | 1 | Cameroon (1) | |
VIII | PQPQAALPAQPQAALPAQPQAALPAQPQAALPAQPQAAVPAQSQAAVPATTQSSSVSAPT | 1 | Thailand (1) | |
IX | PQQQSSS-AQPQAALPAQPQAAVPAQSQATVPAQSQAAVPAQSQAAVPATTQSSSVSAPT | 1 | Thailand (1) | |
X | PQPQAALPAQPQAAV––––––––––––––––––––––––––––––––PATTQSSSVSAPT | 1 | Thailand (1) | |
XI | PQPQAALPAQPQAALPAQPQAALPAQPQAAVPAQSQATVPAQSQAAVPATTQSSSVSAPT | 1 | Myanmar (1) | |
XII | PQPQAALPAQPQAALPAQPQAAVPAQSQAAL––––––––PAQSQAAVPATTQSSSVSAPT | 1 | Brazil (1) | |
XIII | PQPQAALPAQPQAALPAQPQAALPAQPQAAVPAQSQAALPAQSQAAVPATTQSSSVSAPT | 8 | 4 | Brazil (8), Brazil (3), Peru (1) |
P.m and P.br denote Plasmodium malariae and P. brasilianum, respectively. Dash indicates a deletion. Thai isolates and GenBank accession numbers of isolates elsewhere are (I) PM1, PM2, PM10, PM14, PM20–27; KX672048, KR072259 and KR072258; (II) PM3–PM7; (III) PM11, PM12 and PM28–PM30; (IV) PM16–PM19; (V) PM31, PM34, PM35 and FLQW01000468, (VI) PM32 and PM33; (VII) FJ824669; (VIII) PM9; (IX) PM13; (X) PM15; (XI) KX672047; (XII) KR072262; (XIII) KR072254–KR072257, KR072260–KR072263, KR072265–KR072268 and KY189272. *Bolds are countries for P. brasilianum.
Microheterogeneity in conserved blocks
The complete sequences of all 5 conserved blocks have been available from 35 Thai isolates and an isolate from Cameroon (FJ824669). All nucleotide substitutions in conserved blocks were dimorphic, i.e. either one or the other of any two bases occurred at given positions. In total, 37 mutations were observed in conserved blocks, resulting in three haplotypes in blocks I and III, nine in block V, four in block VII and 13 in block IX (Table 4). The levels of nucleotide diversity in conserved regions ranged from 0.00098 to 0.00232 in blocks I and VII, respectively, which was an order or two orders of magnitude less than those in variable blocks (blocks IV, VI and VIII). Since the 19-kDa fragment of PfMSP1 has been considered to be an asexual blood stage vaccine target26, microheterogeneity in this region is of concern for vaccine development. Analysis of the homologous region to the 19-kDa-fragment-encoding sequence in PmMSP1 has revealed 4 nucleotide substitutions: c.5045G>A (G1681E), c.5055A>C (E1684D), c.5060A>T (E1686V) and c.5074C>A (Q1691K) (positions after the FJ824669 sequence). In total, four haplotypes occurred in the putative 19-kDa fragment of PmMSP1, characterized by (1) G-E-E-Q, (2) E-E-E-K, (3) E-D-E-K and (4) E-E-V-K, in which haplotype I was found in the Cameroonian isolate (FJ824669) whereas the remaining haplotypes co-existed among P. malariae populations in Thailand. Of these four substituted residues, c.5044G>C+5045G>A (G1681Q) was found in three isolates from Brazil and constituted another haplotype characterized by Q-E-E-K although singletons were previously observed in other five positions of this region38.
Table 4.
Block | No. codons | M | S | H | h ± S.D | π ± S.E | dS ± S.E | dN ± S.E |
---|---|---|---|---|---|---|---|---|
I (conserved) | 56 | 2 | 2 | 3 | 0.160 ± 0.080 | 0.00098 ± 0.00071 | 0.01980 ± 0.02021 | 0.00501 ± 0.00503 |
II (indels) | 5–15 | – | – | 6 | 0.157 ± 0.077 | – | 0.00000 ± 0.00000 | 0.03810 ± 0.03041 |
III (conserved) | 140 | 2 | 2 | 3 | 0.398 ± 0.081 | 0.00103 ± 0.00090 | 0.00000 ± 0.00000 | 0.00392 ± 0.00271 |
IV (variable) | 32 | 30 | 27 | 10 | 0.838 ± 0.036 | 0.07763 ± 0.01894 | 0.01882 ± 0.01386 | 0.12613 ± 0.02543** |
V (conserved) | 440 | 10 | 10 | 9 | 0.740 ± 0.048 | 0.00144 ± 0.00061 | 0.00158 ± 0.00149 | 0.00397 ± 0.00130 |
VI (repeats) | 97–229 | – | – | 20 | 0.863 ± 0.040 | 0.12162 ± 0.01382 | – | – |
VII (conserved) | 156 | 10 | 10 | 4 | 0.340 ± 0.093 | 0.00232 ± 0.00088 | 0.01346 ± 0.00949 | 0.01282 ± 0.00454 |
VIII (repeats) | 28–60 | – | – | 10 | 0.567 ± 0.071 | 0.02852 ± 0.00954 | – | – |
IX (conserved) | 719 | 13 | 12 | 13 | 0.890 ± 0.027 | 0.00113 ± 0.00043 | 0.00115 ± 0.00110 | 0.00161 ± 0.00051 |
All | 1696–1831 | 72 | 68 | 21 | 0.944 ± 0.021 | 0.01204 ± 0.00162 | 0.00208 ± 0.00077 | 0.00447 ± 0.00065* |
M the number of mutations, S the number of segregating sites, H the number of haplotypes, h haplotype diversity, π nucleotide diversity, dS number of synonymous substitutions per synonymous site, dN number of nonsynonymous substitutions per nonsynonymous site, S.D. standard deviation, S.E. standard error. Analysis includes 35 Thai isolates and the FJ824669 sequence. Z-tests of the hypothesis that mean dS equals that of mean dN: * p < 0.05; **p < 0.0005.
Neutrality test
To test for departure from neutrality, nucleotide substitutions in nonrepeat regions were analyzed by comparing the rate of synonymous substitutions per synonymous site (dS) and that of nonsynonymous substitutions per nonsynonymous site (dN) for each block of PmMSP1. Results revealed that dN exceeded dS in conserved blocks II, V and IX, and variable block IV. However, significant difference between dN and dS was observed only in block IV (Z-test, p < 0.0005) (Table 4). Meanwhile, codon-based detection of deviation from neutrality by the FUBAR method has shown evidence of positive selection in blocks I (E26K), IV (P223L/S/H and K241E), V (P294R) and IX (P1045Q/R and E1684D). On the other hand, evidence of purifying selection was found at codon 1374 (GAT⟶GAC, p.D1374) in conserved block IX of PmMSP1, a homologous residue located in close proximity to the 38 kDa/42 kDa cleavage site in PfMSP1 (Supplemental Fig. S1).
Recombination
Evidence of intragenic recombination in the PmMSP1 gene was determined from 35 Thai isolates by using the RDP4 package which revealed 21 potential recombination sites across the coding region of this gene (Table 5). Recombination breakpoints were detected more commonly in repeats or variable blocks (28 of 42 sites, 66.7%) than in conserved blocks. On the other hand, no recombination event was detected in conserved blocks I, II and V. Recombination breakpoints spanned 41–3978 bp with an average length of 784 bp.
Table 5.
Event no. | Recombination breakpoints | Methods (p value) | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Between positions* | Between blocks | RDP | GENECONV | Bootscan | Maxchi | Chimaera | SiSscan | 3Seq | |||
1 | 2187 | 2728 | VI | VII | 4.69 × 10–12 | 1.07 × 10–9 | 1.72 × 10–10 | 7.26 × 10–14 | 1.93 × 10–14 | 1.79 × 10–19 | 2.03 × 10–21 |
2 | 2192 | 2841 | VI | VII | 3.02 × 10–4 | 4.76 × 10–3 | NS | 2.26 × 10–5 | 1.66 × 10–6 | 4.72 × 10–11 | 5.47 × 10–4 |
3 | 706 | 4684 | IV | IX | NS | 4.47 × 10–5 | 9.81 × 10–6 | 5.22 × 10–5 | 2.39 × 10–5 | 6.55 × 10–5 | 1.90 × 10–9 |
4 | 1556 | 4190 | V | IX | NS | 1.51 × 10–3 | 1.15 × 10–4 | 9.31 × 10–5 | NS | 9.78 × 10–5 | 1.19 × 10–8 |
5 | 2192 | 2353 | VI | VI | 4.47 × 10–4 | NS | NS | 3.65 × 10–4 | 5.06 × 10–5 | 8.60 × 10–5 | 8.78 × 10–8 |
6 | 1361 | 2979 | V | VIII | 1.87 × 10–7 | 6.47 × 10–6 | NS | NS | NS | NS | NS |
7 | 2146 | 2192 | VI | VI | NS | 0.02319 | NS | 6.99 × 10–7 | 9.12 × 10–3 | NS | NS |
8 | 2467 | 3057 | VI | VIII | 6.45 × 10–5 | 2.38 × 10–5 | 4.07 × 10–6 | 5.40 × 10–3 | 4.42 × 10–3 | NS | 2.97 × 10–7 |
9 | 2377 | 2462 | VI | VI | NS | 3.89 × 10–5 | 7.90 × 10–6 | NS | NS | NS | 9.50 × 10–5 |
10 | 2303 | 2369 | VI | VI | NS | NS | NS | 9.83 × 10–4 | 2.78 × 10–3 | NS | 1.08 × 10–5 |
11 | 644 | 1599 | IV | V | NS | NS | NS | 4.60 × 10–3 | NS | NS | 1.44 × 10–5 |
12 | 2482 | 2869 | VII | VII | NS | 6.08 × 10–3 | 2.19 × 10–3 | NS | NS | NS | 8.02 × 10–5 |
13 | 797 | 1844 | V | V | NS | 1.98 × 10–2 | 5.39 × 10–3 | NS | NS | NS | 8.63 × 10–4 |
14 | 2294 | 2466 | VI | VI | NS | NS | NS | NS | NS | 1.82 × 10–13 | 1.59 × 10–3 |
15 | 644 | 2147 | IV | VI | NS | NS | NS | 1.48 × 10–2 | NS | NS | 2.18 × 10–3 |
16 | 2197 | 2302 | VI | VI | NS | NS | NS | NS | NS | NS | 2.62 × 10–3 |
17 | 2212 | 2328 | VI | VI | NS | NS | NS | NS | NS | NS | 5.35 × 10–3 |
18 | 1847 | 2185 | V | VI | 8.05 × 10–3 | NS | NS | NS | NS | NS | 1.35 × 10–2 |
19 | 2357 | 2840 | VI | VII | NS | NS | NS | NS | NS | NS | 1.35 × 10–2 |
20 | 2172 | 2213 | VI | VI | NS | NS | NS | 2.18 × 10–3 | NS | NS | 2.63 × 10–2 |
21 | 644 | 1599 | IV | V | NS | NS | NS | NS | NS | NS | 2.73 × 10–2 |
*Positions after the FJ824669 sequence.
Phylogenetic analysis
Analysis of the complete coding sequences of PmMSP1 has revealed two distinct clades in the phylogenetic tree (Fig. 4). The maximum likelihood tree inferred from the sequences of block VI per se has revealed 2 clades corresponding to characteristic repeats assigned to types A and B. It is noteworthy that the bifurcating clusters of taxa in the clade belonging to type B were in line with the isolates bearing types B1 and B2 repeats (Figs. 3, 4). On the other hand, the tree inferred from the sequences excluding block VI showed a different topology.
Predicted linear B-cell epitopes
The graphical presentation from BepiPred 2.0 analysis has revealed a number of potential linear B-cell epitopes across PmMSP1, spanning both conserved and variable blocks (Fig. 5A). Short indels in block II did not affect predicted B-cell epitopes encompassing this region. Interestingly, amino acid substitutions in variable block IV seemed not to affect predicted linear B-cell epitopes in all variants (Fig. 5B). On the other hand, the predicted epitope scores were variable among different alleles of blocks VI and VIII (Fig. 5C,D). Variation in the predicted scores was more pronounced among variants in block VI in which some regions were below the cutoff threshold value for being linear B-cell epitopes.
Predicted helper T-cell epitopes
Analysis of HLA-class II-binding peptides in PmMSP1 based on common HLA-DR alleles in Thai population (allele frequencies > 10%) including HLA-DRB1*12:02, -DRB1*15:02, -DQB1*05:01, -DQB1*05:02, -DQB1*03:01, -DQB1*03:03, -DQA1*01:01, -DQA1*01:02, -DQA1*03:02 and -DQA1*06:01 has predicted a number of potential binding peptides predominantly outside blocks VI and VIII which contained repeats (Supplemental Fig. S2). Block IV did not receive adequate scores for being HLA-class II-binding peptides (percentile rank < 10 and MHC binding affinity IC50 < 1000 nM) for these common HLA class II alleles42,43. However, searching for potential HLA-class II-binding peptides among alleles spanning block IV from residues 207 to 221 has shown that alleles II, IV and V had percentile ranks less than 10 and MHC binding affinity IC50 < 1000 nM for some uncommon HLA class II alleles in Thailand44. On the other hand, a potential HLA-class II-binding peptide was identified in one of nine alleles (allele V) of block IV encompassing residues 211–225 (Table 6). Taken together, these peptide variants could be potential helper T-cell epitopes in this molecule albeit being recognized by some uncommon HLA class II alleles among Thai population44.
Table 6.
Amino acid residue† | Allele | Peptides and their variants | Prevalence (%, n = 35) | HLA§ | Allele Frequency* | IC50# | Peptide rank# |
---|---|---|---|---|---|---|---|
207–221 | I | KKEYNNIADENKKLE | 11.43 | None | – | – | – |
II | KKEYDNIATTNKELE | 45.71 |
DQA1*04:01/ DQB1*04:02 |
0.0021/ 0.0032 |
747 | 9.9 | |
III | KKEYDNIATKNKELE | 25.71 | None | – | – | – | |
IV | KKEYDNITTTNNELV | 11.43 | DRB1*13:02 | 0.0138 | 43.7 | 5.5 | |
DRB1*04:05 | 0.0489 | 235.4 | 10 | ||||
V | KKEHDNIATTNNELE | 2.86 |
DQA1*04:01/ DQB1*04:02 |
0.0021/ 0.0032 |
478 | 6.4 | |
DQA1*03:01/ DQB1*03:02 |
0.0457/ 0.0426 |
878 | 9.9 | ||||
VI | KKEYDNIATTNNELK | 2.86 | None | – | – | – | |
211–225 | I | NNIADENKKLEAPSE | 8.57 | None | – | – | – |
II | DNIATTNKELEAPSG | 31.43 | None | – | – | – | |
III | DNIATKNKELEAHSG | 22.86 | None | – | – | – | |
IV | DNIATTNKELEASSG | 14.29 | None | – | – | – | |
V | DNITTTNNELVTSNV | 11.43 | DRB1*13:02 | 0.0138 | 55.9 | 6.6 | |
VI | DNIATTNNELEASSG | 2.86 | None | – | – | – | |
VII | DNIATKNKELEAPSG | 2.86 | None | – | – | – | |
VIII | DNIATTNNELKAPSE | 2.86 | None | – | – | – | |
IX | NNIADENKKLEASSE | 2.86 | None | – | – | – |
Discussion
In this study, we have shown that the complete coding sequence of PmMSP1could be partitioned into five conserved and four variable blocks. Like other malarial MSP1s, conserved blocks of PmMSP1 exhibited microheterogeneity of sequences with dimorphic nucleotide substitutions24,32–34,45,46. Comparative analysis has revealed that short indels in block II of PmMSP1 seemed to be homologous to a short indel region at the 5′ portion of PvMSP1. Likewise, variable nonrepeat block IV of PmMSP1 were found to be homologous to variable nonrepeat blocks of PoMSP1, and repeat domains of PkMSP1 and PvMSP1 (Supplemental Fig. 3). Likewise, repeat blocks VI and VIII of PmMSP1 were homologous to blocks VIII and X of PocMSP1 and PowMSP1, blocks IV and VI of PkMSP1, and blocks VI and VIII of PvMSP1 (Supplemental Fig. 3). Meanwhile, the distantly related PfMSP1 also contained repeats in blocks VIII homologous to block VI of PmMSP1. Although variable and semi-conserved blocks of PfMSP1 consisted of two distinct parental alleles (MAD20 and K1), sequences of these regions were highly conserved within each allelic family24,45. Therefore, intraspecific conserved blocks of these malarial MSP1 genes seemed to be largely found in corresponding locations. Taken together, the similarity in primary structural organization of MSP1s across Plasmodium species may suggest that this locus has evolved from a common ancestral sequence whereas the lack of homologous regions in some domains of the genes among species could imply post-speciation evolution of individual MSP1 lineages. Consistently, it has been suggested that positive selection could influence lineage-specific evolutionary history of some human and simian malarial MSP1s47.
It is noteworthy that the levels of nucleotide diversity of PkMSP1, PvMSP1 and PfMSP1 were comparable among Thai isolates. On the other hand, the level of nucleotide diversity of PmMSP1 was significantly less than those of PkMSP1, PvMSP1 and PfMSP1 but remarkably greater than those of PoMSP119,32,33,48–50. Consistent findings were observed when analysis was performed separately for synonymous (πS) and nonsynonymous sites (πN) (Supplemental Table 2). The extent of nucleotide diversity among MSP1s of different Plasmodium species in Thailand could be due to evolutionary and population genetic forces on parasite populations such as mutation, recombination and population processes. Meanwhile, the neutral theory of molecular evolution predicts that the level of nucleotide diversity is proportional to the mutation rate (μ) and effective population size (Ne) under mutation-drift equilibrium51. Since the mutation rates of malarial MSP1 genes seemed to be similar across species52–54, variation in the levels of nucleotide diversity of these loci could be due to the difference in effective population sizes among Plasmodium species in Thailand. Although some regions or residues in malarial MSP1s were deviated from selective neutrality, the remaining majority of sequences seemed to be under neutral evolution. Therefore, the level of nucleotide diversity may roughly reflect the number of breeding individuals in the population. On the other hand, the low level of nucleotide diversity in PmMSP1 could represent the low transmission rate and probably from bottleneck effects due to malaria control measures as previously noted38,46. Our previous surveys of malaria in Thailand have shown that the prevalence of P. malariae and P. knowlesi in Thailand was comparable15,18–20. Therefore, it is likely that the higher level of nucleotide diversity of PkMSP1 than that of PmMSP1 could stem from a hidden large reservoir of P. knowlesi in its macaque natural hosts in this country33,55. On the one hand, the haplotype diversity of most malarial MSP1 genes in Thailand was relatively high (> 0.9), implying that distinct or rare haplotypes were abundant in the populations (Supplemental Table S2). On the other hand, we observed some predominant PmMSP1 haplotypes in this country, i.e. haplotypes III, XIII and X, which occurred across endemic provinces and between long time intervals of sample collections (Fig. 1), suggesting that the parasites bearing these haplotypes could probably have reproductive advantage.
Conserved blocks in PmMSP1 displayed microheterogeneity of sequences in which nucleotide substitutions seems to have evolved neutrally because block-wise analysis revealed that dS was not significantly different from dN (Table 4). However, codon-based analysis has identified four positively selected codons in conserved blocks, suggesting that natural selection has influenced evolution of particular codons. Interestingly, one of these codons (residue E1684D) was located between the two EGF-like domains at the C-terminal part of PmMSP1 in which the homologous region in PfMSP1 has been a target for naturally acquired antibodies associated with clinical protection against falciparum malaria26. Intriguingly, positive selection in the EGF-like domain of PmMSP1 could probably be driven by host immune pressure. On the other hand, evidence for purifying selection was detected at codon 1374 (GAT ⟶ GAC) that was located in close proximity to the canonical 38 kDa/42 kDa cleavage site in PfMSP156. Importantly, cleavage at this site has been shown to be a rate-limiting processing step, suggesting its pivotal role for MSP1 proteolytic maturation57,58. Therefore, deviation from selective neutrality occurred at particular residues in conserved regions of PmMSP1.
The variable nonrepeat block IV of PmMSP1 spanned 32 codons with 21 amino acid substitutions, resulting in 16 alleles among Thai and global isolates (Table 2). The significant difference in dN exceeding dS in this block implies that positive selection could influence diversity in this region (Table 4). On the basis of amino acid alignment, block IV of PmMSP1 was homologous to block III of PfMSP1, a portion of the 83-kDa fragment which forms a flexible wing domain of the protein as demonstrated by single-particle cryo-electron microscopy31. Several lines of evidence have suggested that MSP1 could be detected as monomeric and dimeric forms58–60. It has been shown that dimerization of PfMSP1 involves the interaction between the 83-kDa and 42-kDa fragments31. Although the significance of dimerization of PfMSP1 remains unknown, the protective capability against falciparum malaria conferred by natural antibodies to the 83-kDa fragment could suggest the functional importance of this region61. Importantly, in silico analysis has shown that block IV of PmMSP1 contained both B-cell and helper T-cell epitopes. Consistently, recombinant proteins derived from various regions of PmMSP1 including the N-terminal fragment elicited strong immunogenicity in mice62 and were highly recognized in serum samples of primates and non-human primates from malaria endemic areas63,64. Although allelic variation in block IV of this protein seemed not to drastically change the propensity of being B-cell epitopes as predicted by the IEDB analysis resource (Fig. 5B), amino acid substitutions in this region were unlikely recognized by common HLA class II alleles among Thai population (Fig. 4B, Table 6, Supplemental Fig. S2). Importantly, mutations in block IV may reduce or totally abolish predicted binding capability of the peptides to some uncommon HLA class II alleles in Thai population (Table 6; Supplemental Fig. S2). Undoubtedly, further studies are required to address the immunological significance of helper T-cell epitopes in block IV of PmMSP1. Therefore, it seemed that positive selection in block IV could probably be driven by host immune pressure.
Repetitive amino acid sequences have been observed in several malarial antigens including MSP1s. Our analysis has revealed two repeat-containing regions in blocks VI and VIII of PmMSP1. Unlike block VIII that contained degenerate octapeptide motifs, repeats in block VI were more complex, characterized by a repertoire of different repeat arrays and arrangements. Meanwhile, the RDP4 package has identified 21 recombination breakpoints in PmMSP1. Interestingly, about half of recombination events involved block VI whereas about one-third of the breakpoints occurred within this block. Besides slip-strand mispairing mechanism that could generate sequence and size variation in repeat sequences, recombination may contribute to shuffle of repeat units in block VI. Although a number of linear B-cell epitopes were predicted in this block (Fig. 5A), in silico analysis has suggested that variation in repeat sequences could affect antibody recognition (Fig. 5C). Meanwhile, phylogenetic tree inferred from the block VI sequences of PmMSP1 has revealed two distinct clades, corresponding to repeat sequence types A and B of this block. Importantly, variation in the number of repeat units could affect intensity of antibody reactivity whereas distinct variants of repetitive antigens may abolish specific antibody response as shown by antibody recognition of repeat antigens in block II of PfMSP165,66. Therefore, sequence divergence of repetitive regions in PmMSP1 could probably enhance host immune evasion by the parasites.
One of the shared features of PmMSP1 and PvMSP1 was the presence of indels near the N-terminus of the proteins. Although indels in block II spanned 17 codons, 6 alleles have been identified among Thai isolates (Table 1). Indels are commonly found in both coding and noncoding regions of prokaryotes and eukaryotes genomes while they may occur within repeats and nonrepeat regions67,68. The generation of indels related with repeats could be due to polymerase slippage69,70. On the other hand, the formation of indels in nonrepeat regions required pre-existing palindromic or quasi-palindromic sequences, provoking a double-stranded break intermediate during DNA replication while the ensuing repair process was imperfect71–74. It is noteworthy that quasi-palindromic repeats were identified around indels of PmMSP1 and PvMSP1, supporting the mechanisms for indel formation in nonrepeats of these genes (Supplemental Fig. S4). Although analysis of natural selection on these indels was not possible due to unknown ancestral state of this region, the lack of frame-shift mutation following indels in both PmMSP1 and PvMSP1 could imply selective constraint on the protein structure and/or function.
Several lines of evidence have suggested that P. malariae and P. brasilianum were de facto either con-species or the same parasites38,40,41. A repertoire of alleles in block VI constituting the most polymorphic region of the gene has been identified among PmMSP1 and PbrMSP1 (Fig. 2; Table 4). Importantly, allele A1.10 of block VI was shared between the MSP1 genes of P. malariae and P. brasilianum whereas allele XIII of block VIII has been previously reported to occur in both species38. Like other genes or non-coding loci containing repeats in malarial genomes, variation in repeat sequences and the number of repeat units could be generated by the process of slip-strand mispairing mechanism75,76. Therefore, it is unlikely that identical complex repeats could have arisen from homoplasy. Furthermore, shared alleles between PmMSP1 and PbrMSP1 have been observed in variable blocks II (alleles V and VI) and VIII (allele XIII) whereas a single codon difference was observed between alleles VI and XV in variable block IV (alleles VI and XV) (Tables 1, 2, 3). Taken together, it is likely that P. malariae and P. brasilianum could be identical species or at least con-species as previously noted38,41.
In conclusion, analysis of the complete coding sequences of PmMSP1 from clinical isolates has revealed structural organization of this locus. Besides structural similarity across human malarial MSP1s, evidences of intragenic recombination and natural selection have been identified in PmMSP1. The information from this study could be useful for further studies such as vaccine development and strain characterization of P. malariae based on this molecule.
Materials and methods
Parasite isolates
Thirty-five Plasmodium malariae isolates were obtained from symptomatic malaria patients during surveys of Plasmodium species distribution in Thailand during 1994 and 2016 (Fig. 1). Either finger-pricked or venous blood samples were taken from each subject and spotted onto filter papers or preserved in EDTA, respectively. Both thin and thick blood films were prepared from fresh blood and stained with Giemsa solution for microscopic examination of malaria parasites. DNA was extracted from each blood sample using Qiagen DNA mini kit (Qiagen, Hilden, Germany) per the manufacturer’s recommendation and kept at − 40 °C until use. Definite species identification was performed by species-specific nested PCR targeting 18S rRNA, mitochondrial cytochrome b or cytochrome oxidase I as previously described15,19,20.
PCR amplification and sequencing of the PmMSP1 gene
The complete coding sequence of PmMSP1 was amplified by nested PCR using outer primers: Pmmsp1F0 (5′-TACTCTATATTATCAAGTTTAATTC-3′) and Pmmsp1R0 (5′-CATTCGTATCCTTCTTTTCTGT-3′), and inner primers: Pmmsp1F01 (5′-GTTTAATTCAAAAATGAAAGCAC-3′) and Pmmsp1R01 (5′-TCTTTTTTTCTTAAAGTAAGTTAAAC-3′). Amplification reaction and condition were as previously described34. All amplification reactions were done in an Applied Biosystem GeneAmpH PCR System 9700 thermocycler (PE Biosystems, Foster City, CA). PCR products were analyzed by 1% agarose gel electrophoresis. The PCR products were purified by using a QIAamp PCR purification kit (Qiagen) and used as templates for sequencing. Sequencing primers were deployed to obtain overlapping sequences of the gene in which both directions were determined directly from the PCR-purified templates (Supplemental Table S1). Validation of singletons and indels in the sequences was performed by sequencing of the PCR products from independent amplification reactions using the same genomic DNA as templates.
Data analysis
Alignment of the PmMSP1 nucleotide sequences was performed by using the default option of the MUSCLE program and manually edited77. Indels in coding regions were determined from multiple alignments of amino acid sequences to maintain the reading frame. The sequence of the first complete PmMSP1 gene from a Cameroonian patient was used as reference (GenBank accession number FJ824669)36. Tandem repeats were analyzed by scanning each sequence using window sizes per the default option of the Tandem Repeats Finder version 4.0 algorithm78. Nucleotide diversity was computed from the average number of nucleotide differences per site between two sequences in the sample and the standard errors were estimated by 1000 boostrap pseudoreplicates79. A sliding window analysis of nucleotide diversity was performed by using window length of 100 nucleotides and step size of 15 sites. Haplotype diversity and its sampling variance were determined by using the DnaSP program80. The number of synonymous substitutions per synonymous site and the number of nonsynonymous substitutions per nonsynonymous site was computed using Nei and Gojobori’s method79 with Juke and Cantor correction81. Standard errors of these parameters were estimated by the bootstrap method with 1000 pseudoreplicates using the MEGA 6.0 program82. Differences between the nucleotide diversity values were determined by a two-tailed Z-test. Deviation from selective neutrality at individual codons was identified using fast unconstrained Bayesian approximation (FUBAR) method implemented in the Datamonkey Web-Server83,84. To minimize the interfering signals from recombination on selection of individual codons, the data generated by elimination of recombination segments was deployed for analysis85. Determination of intragenic recombination was performed by using the Recombination Detection Program version 4 (RDP4)86. Phylogenetic trees were constructed by using the Maximum Likelihood method based on the General Time Reversible model with a discrete Gamma distribution to model evolutionary rate differences among sites84. The final tree is the one with the highest log likelihood value. Bootstrap supports for the branching patterns were estimated from 1000 pseudoreplicates of the sample data. Prediction of linear B-cell epitopes was done by using the BepiPred linear epitope prediction 2.0 implemented in the Immune Epitope Database (IEDB) And Analysis Resource87. The threshold for linear B-cell epitopes was more than or equal to the average predicted residue score of the protein. The HLA-class II-binding peptides were predicted by using the IEDB recommended 2.22 algorithm with a default 12–18 residues option88. The criterion for being HLA-class II-binding peptides included the percentile rank ≤ 10 and the IC50 threshold for MHC binding affinity ≤ 1000 nM43. The common HLA class II haplotypes among Thai population were based on allele frequency ≥ 0.1 according to the previous report44.
Ethical approval
The study protocol was approved by the Institutional Review Board on Human Research of Faculty of Medicine, Chulalongkorn University (IRB No. 384/60 and COA No. 805/2018). Written informed consent was obtained from participants or from parents or guardians prior to blood sample collections. All procedures were performed in accordance to the relevant guidelines and regulations.
Supplementary Information
Acknowledgements
We are grateful to all patients who donated their blood samples for this study. This study received financial support from Ratchadapiseksompotch Fund, Faculty of Medicine, Chulalongkorn University (Grant no. RA60/126) to S.J. and C.P.
Author contributions
S.J. and C.P. designed the study and funding acquisition. C.P., S.S., N.K. and S.J. contributed to sample collection. C.P., N.K. and S.S. performed the experiments. C.P. retrieved GenBank sequences. R.R. prepared Fig. 1 and Supplemental Fig. S3. C.P. and S.J. performed data analysis. C.P. drafted the manuscript. S.J. reviewed and finalized the manuscript. All authors approved the manuscript.
Data availability
Thirty-five complete coding sequences of PmMSP1 have been deposited in NCBI GenBank under accession numbers OM525734–OM525768. The datasets generated during and/or analyses during the current study are available from the corresponding author upon request.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Chaturong Putaporntip, Email: p.chaturong@gmail.com.
Somchai Jongwutiwes, Email: jongwutiwes@gmail.com.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-022-19049-z.
References
- 1.Betson M, Clifford S, Stanton M, Kabatereine NB, Stothard JR. Emergence of nonfalciparum Plasmodium infection despite regular artemisinin combination therapy in an 18-month longitudinal study of Ugandan children and their mothers. J. Infect. Dis. 2018;217:1099–1109. doi: 10.1093/infdis/jix686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Groger M, et al. Prospective clinical and molecular evaluation of potential Plasmodium ovale curtisi and wallikeri relapses in a high-transmission setting. Clin. Infect. Dis. 2019;69:2119–2126. doi: 10.1093/cid/ciz131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Yman V, et al. Persistent transmission of Plasmodium malariae and Plasmodium ovale species in an area of declining Plasmodium falciparum transmission in eastern Tanzania. PLoS Negl. Trop. Dis. 2019;13:e0007414. doi: 10.1371/journal.pntd.0007414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Hawadak J, Dongang Nana RR, Singh V. Global trend of Plasmodium malariae and Plasmodium ovale spp. malaria infections in the last two decades (2000–2020): A systematic review and meta-analysis. Parasit. Vectors. 2021;14:297. doi: 10.1186/s13071-021-04797-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gilles HM, Hendrickse RG. Nephrosis in Nigerian children. Role of Plasmodium malariae, and effect of antimalarial treatment. Br. Med. J. 1963;2:27–31. doi: 10.1136/bmj.2.5348.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ward PA, Kibukamusoke JW. Evidence for soluble immune complexes in the pathogenesis of the glomerulonephritis of quartan malaria. Lancet. 1969;1:283–285. doi: 10.1016/S0140-6736(69)91038-1. [DOI] [PubMed] [Google Scholar]
- 7.Hendrickse RG, Adeniyi A. Quartan malarial nephrotic syndrome in children. Kidney Int. 1979;16:64–74. doi: 10.1038/ki.1979.103. [DOI] [PubMed] [Google Scholar]
- 8.Silva GBDJ, Pinto JR, Barros EJG, Farias GMN, Daher EF. Kidney involvement in malaria: An update. Rev. Inst. Med. Trop. Sao Paulo. 2017;59:e53. doi: 10.1590/S1678-9946201759053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Maguire JD, et al. Chloroquine-resistant Plasmodium malariae in south Sumatra, Indonesia. Lancet. 2002;360:58–60. doi: 10.1016/S0140-6736(02)09336-4. [DOI] [PubMed] [Google Scholar]
- 10.Collins WE, Jeffery GM. Extended clearance time after treatment of infections with Plasmodium malariae may not be indicative of resistance to chloroquine. Am. J. Trop. Med. Hyg. 2002;67:406–410. doi: 10.4269/ajtmh.2002.67.406. [DOI] [PubMed] [Google Scholar]
- 11.Collins WE, Jeffery GM. Plasmodium malariae: Parasite and disease. Clin. Microbiol. Rev. 2007;20:579–592. doi: 10.1128/CMR.00027-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Verra F, et al. A systematic review of transfusion-transmitted malaria in non-endemic areas. Malar. J. 2018;17:36. doi: 10.1186/s12936-018-2181-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Aschar M, et al. The hidden Plasmodium malariae in blood donors: A risk coming from areas of low transmission of malaria. Rev. Inst. Med. Trop. Sao Paulo. 2020;62:e100. doi: 10.1590/s1678-9946202062100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Putaporntip C, Buppan P, Jongwutiwes S. Improved performance with saliva and urine as alternative DNA sources for malaria diagnosis by mitochondrial DNA-based PCR assays. Clin. Microbiol. Infect. 2011;17:1484–1491. doi: 10.1111/j.1469-0691.2011.03507.x. [DOI] [PubMed] [Google Scholar]
- 15.Putaporntip, C. et al. Cryptic Plasmodium inui and P. fieldi infections among symptomatic malaria patients in Thailand. Clin. Infect. Dis.(In press) (2022). [DOI] [PubMed]
- 16.Cunha MG, et al. Mixed Plasmodium malariae infections were underdetected in a malaria endemic area in the Amazon Region, Brazil. Am. J. Trop. Med. Hyg. 2021;105:1184–1186. doi: 10.4269/ajtmh.21-0296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Thimasarn K, Jatapadma S, Vijaykadga S, Sirichaisinthop J, Wongsrichanalai C. Epidemiology of malaria in Thailand. J. Travel Med. 1995;2:59–65. doi: 10.1111/j.1708-8305.1995.tb00627.x. [DOI] [PubMed] [Google Scholar]
- 18.Putaporntip C, et al. Differential prevalence of Plasmodium infections and cryptic Plasmodium knowlesi malaria in humans in Thailand. J. Infect. Dis. 2009;199:1143–1150. doi: 10.1086/597414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Jongwutiwes S, et al. Plasmodium knowlesi malaria in humans and macaques, Thailand. Emerg. Infect. Dis. 2011;17:1799–1806. doi: 10.3201/eid1710.110349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Putaporntip C, et al. Plasmodium cynomolgi co-infections among symptomatic malaria patients, Thailand. Emerg. Infect. Dis. 2021;27:590–593. doi: 10.3201/eid2702.191660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Blackman MJ, Carruthers VB. Recent insights into apicomplexan parasite egress provide new views to a kill. Curr. Opin. Microbiol. 2013;16:459–464. doi: 10.1016/j.mib.2013.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Das S, et al. Processing of Plasmodium falciparum merozoite surface protein msp1 activates a spectrin-binding function enabling parasite egress from RBCs. Cell Host Microbe. 2015;18:433–444. doi: 10.1016/j.chom.2015.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Holder AA. The precursor to major merozoite surface antigens: Structure and role in immunity. Prog. Allergy. 1988;41:72–97. [PubMed] [Google Scholar]
- 24.Tanabe K, Mackay M, Goman M, Scaife JG. Allelic dimorphism in a surface antigen gene of the malaria parasite Plasmodium falciparum. J. Mol. Biol. 1987;195:273–287. doi: 10.1016/0022-2836(87)90649-8. [DOI] [PubMed] [Google Scholar]
- 25.Blackman MJ, Heidrich HG, Donachie S, McBride JS, Holder AA. A single fragment of a malaria merozoite surface protein remains on the parasite during red cell invasion and is the target of invasion-inhibiting antibodies. J. Exp. Med. 1990;172:379–382. doi: 10.1084/jem.172.1.379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Egan AF, et al. Clinical immunity to Plasmodium falciparum malaria is associated with serum antibodies to the 19-kDa C-terminal fragment of the merozoite surface antigen, PfMSP-1. J. Infect. Dis. 1996;173:765–769. doi: 10.1093/infdis/173.3.765. [DOI] [PubMed] [Google Scholar]
- 27.Conway DJ, et al. A principal target of human immunity to malaria identified by molecular population genetic and immunological analyses. Nat. Med. 2000;6:689–692. doi: 10.1038/76272. [DOI] [PubMed] [Google Scholar]
- 28.Goel VK, et al. Band 3 is a host receptor binding merozoite surface protein 1 during the Plasmodium falciparum invasion of erythrocytes. Proc. Natl. Acad. Sci. USA. 2003;100:5164–5169. doi: 10.1073/pnas.0834959100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Boyle MJ, Richards JS, Gilson PR, Chai W, Beeson JG. Interactions with heparin-like molecules during erythrocyte invasion by Plasmodium falciparum merozoites. Blood. 2010;115:4559–4568. doi: 10.1182/blood-2009-09-243725. [DOI] [PubMed] [Google Scholar]
- 30.Baldwin MR, Li X, Hanada T, Liu SC, Chishti AH. Merozoite surface protein 1 recognition of host glycophorin A mediates malaria parasite invasion of red blood cells. Blood. 2015;125:2704–2711. doi: 10.1182/blood-2014-11-611707. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Dijkman PM, et al. Structure of the merozoite surface protein 1 from Plasmodium falciparum. Sci. Adv. 2021;7:eabg0465. doi: 10.1126/sciadv.abg0465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Putaporntip C, et al. Mosaic organization and heterogeneity in frequency of allelic recombination of the Plasmodium vivax merozoite surface protein-1 locus. Proc. Natl. Acad. Sci. USA. 2002;99:16348–16353. doi: 10.1073/pnas.252348999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Putaporntip C, Thongaree S, Jongwutiwes S. Differential sequence diversity at merozoite surface protein-1 locus of Plasmodium knowlesi from humans and macaques in Thailand. Infect. Genet. Evol. 2013;18:213–219. doi: 10.1016/j.meegid.2013.05.019. [DOI] [PubMed] [Google Scholar]
- 34.Putaporntip C, Hughes AL, Jongwutiwes S. Low level of sequence diversity at merozoite surface protein-1 locus of Plasmodium ovale curtisi and P. ovale wallikeri from Thai isolates. PLoS One. 2013;8:e58962. doi: 10.1371/journal.pone.0058962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Fandeur T, Volney B, Peneau C, de Thoisy B. Monkeys of the rainforest in French Guiana are natural reservoirs for P. brasilianum/P. malariae malaria. Parasitology. 2000;120(I):11–21. doi: 10.1017/S0031182099005168. [DOI] [PubMed] [Google Scholar]
- 36.Birkenmeyer L, Muerhoff AS, Dawson GJ, Desai SM. Isolation and characterization of the MSP1 genes from Plasmodium malariae and Plasmodium ovale. Am. J. Trop. Med. Hyg. 2010;82:996–1003. doi: 10.4269/ajtmh.2010.09-0022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Araújo MS, et al. Natural Plasmodium infection in monkeys in the state of Rondônia (Brazilian Western Amazon) Malar. J. 2013;12:180. doi: 10.1186/1475-2875-12-180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Guimarães LO, et al. Merozoite surface protein-1 genetic diversity in Plasmodium malariae and Plasmodium brasilianum from Brazil. BMC Infect. Dis. 2015;15:529. doi: 10.1186/s12879-015-1238-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Li P, et al. Plasmodium malariae and Plasmodium ovale infections in the China-Myanmar border area. Malar. J. 2016;15:557. doi: 10.1186/s12936-016-1605-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Guimarães LO, et al. The genetic diversity of Plasmodium malariae and Plasmodium brasilianum from human, simian and mosquito hosts in Brazil. Acta Trop. 2012;124:27–32. doi: 10.1016/j.actatropica.2012.05.016. [DOI] [PubMed] [Google Scholar]
- 41.Lalremruata A, et al. Natural infection of Plasmodium brasilianum in humans: Man and monkey share quartan malaria parasites in the Venezuelan Amazon. EBioMedicine. 2015;2:1186–1192. doi: 10.1016/j.ebiom.2015.07.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Andreatta M, et al. An automated benchmarking platform for MHC class II binding prediction methods. Bioinformatics. 2018;34:1522–1528. doi: 10.1093/bioinformatics/btx820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Paul S, Grifoni A, Peters B, Sette A. Major histocompatibility complex binding, eluted ligands, and immunogenicity: Benchmark testing and predictions. Front. Immunol. 2020;10:3151. doi: 10.3389/fimmu.2019.03151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Satapornpong P, et al. Genetic diversity of HLA class I and class II alleles in Thai populations: Contribution to genotype-guided therapeutics. Front. Pharmacol. 2020;11:78. doi: 10.3389/fphar.2020.00078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Jongwutiwes S, Tanabe K, Kanbara H. Sequence conservation in the C-terminal part of the precursor to the major merozoite surface proteins (MSP1) of Plasmodium falciparum from field isolates. Mol. Biochem. Parasitol. 1993;59:95–100. doi: 10.1016/0166-6851(93)90010-U. [DOI] [PubMed] [Google Scholar]
- 46.Jongwutiwes S, Putaporntip C, Hughes AL. Bottleneck effects on vaccine-candidate antigen diversity of malaria parasites in Thailand. Vaccine. 2010;28:3112–3117. doi: 10.1016/j.vaccine.2010.02.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Sawai H, et al. Lineage-specific positive selection at the merozoite surface protein 1 (msp1) locus of Plasmodium vivax and related simian malaria parasites. BMC Evol. Biol. 2010;10:52. doi: 10.1186/1471-2148-10-52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Tanabe K, et al. Allelic dimorphism-associated restriction of recombination in Plasmodium falciparum msp1. Gene. 2007;397:153–160. doi: 10.1016/j.gene.2007.04.033. [DOI] [PubMed] [Google Scholar]
- 49.Tanabe K, et al. Within-population genetic diversity of Plasmodium falciparum vaccine candidate antigens reveals geographic distance from a Central sub-Saharan African origin. Vaccine. 2013;31:1334–1339. doi: 10.1016/j.vaccine.2012.12.039. [DOI] [PubMed] [Google Scholar]
- 50.Tanabe K, et al. Plasmodium falciparum: Genetic diversity and complexity of infections in a isolated village in Western Thailand. Parasitol. Int. 2015;64:260–266. doi: 10.1016/j.parint.2013.09.011. [DOI] [PubMed] [Google Scholar]
- 51.Kimura M. The Neutral Theory of Molecular Evolution. Cambridge University Press; 1983. [Google Scholar]
- 52.Hughes AL. Positive selection and interallelic recombination at the merozoite surface antigen-1 (MSA-1) locus of Plasmodium falciparum. Mol. Biol. Evol. 1992;9:381–393. doi: 10.1093/oxfordjournals.molbev.a040730. [DOI] [PubMed] [Google Scholar]
- 53.Hughes AL, Verra F. Extensive polymorphism and ancient origin of Plasmodium falciparum. Trends Parasitol. 2002;18:348–351. doi: 10.1016/S1471-4922(02)02290-0. [DOI] [PubMed] [Google Scholar]
- 54.Putaporntip C, Jongwutiwes S, Iwasaki T, Kanbara H, Hughes AL. Ancient common ancestry of the merozoite surface protein 1 of Plasmodium vivax as inferred from its homologue in Plasmodium knowlesi. Mol. Biochem. Parasitol. 2006;146:105–108. doi: 10.1016/j.molbiopara.2005.11.001. [DOI] [PubMed] [Google Scholar]
- 55.Putaporntip C, et al. Ecology of malaria parasites infecting Southeast Asian macaques: Evidence from cytochrome b sequences. Mol. Ecol. 2010;19:3466–3476. doi: 10.1111/j.1365-294X.2010.04756.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Withers-Martinez C, et al. Plasmodium subtilisin-like protease 1 (SUB1): Insights into the active-site structure, specificity and function of a pan-malaria drug target. Int. J. Parasitol. 2012;42:597–612. doi: 10.1016/j.ijpara.2012.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Child MA, Epp C, Bujard H, Blackman MJ. Regulated maturation of malaria merozoite surface protein-1 is essential for parasite growth. Mol. Microbiol. 2010;78:187–202. doi: 10.1111/j.1365-2958.2010.07324.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Das S, et al. Processing of Plasmodium falciparum merozoite surface protein msp1 activates a spectrin-binding function enabling parasite egress from RBCs. Cell Host Microbe. 2015;18:433–444. doi: 10.1016/j.chom.2015.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Sanders PR, et al. Identification of protein complexes in detergent-resistant membranes of Plasmodium falciparum schizonts. Mol. Biochem. Parasitol. 2007;154:148–157. doi: 10.1016/j.molbiopara.2007.04.013. [DOI] [PubMed] [Google Scholar]
- 60.Lin CS, et al. The merozoite surface protein 1 complex is a platform for binding to human erythrocytes by Plasmodium falciparum. J. Biol. Chem. 2014;289:25655–25669. doi: 10.1074/jbc.M114.586495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Tolle R, et al. A prospective study of the association between the human humoral immune response to Plasmodium falciparum blood stage antigen gp190 and control of malarial infections. Infect. Immun. 1993;61:40–47. doi: 10.1128/iai.61.1.40-47.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Elizardez YB, et al. Recombinant proteins of Plasmodium malariae merozoite surface protein 1 (PmMSP1): Testing immunogenicity in the BALB/c model and potential use as diagnostic tool. PLoS One. 2019;14:e0219629. doi: 10.1371/journal.pone.0219629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Monteiro EF, et al. Naturally acquired humoral immunity against malaria parasites in non-human primates from the Brazilian Amazon, Cerrado and Atlantic Forest. Pathogens. 2020;9:525. doi: 10.3390/pathogens9070525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Monteiro EF, et al. Antibody profile comparison against MSP1 antigens of multiple Plasmodium species in human serum samples from two different Brazilian populations using a multiplex serological assay. Pathogens. 2021;10:1138. doi: 10.3390/pathogens10091138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Locher CP, Tam LQ, Chang SP, McBride JS, Siddiqui WA. Plasmodium falciparum: gp195 tripeptide repeat-specific monoclonal antibody inhibits parasite growth in vitro. Exp. Parasitol. 1996;84:74–83. doi: 10.1006/expr.1996.0091. [DOI] [PubMed] [Google Scholar]
- 66.McBride JS, Walliker D, Morgan G. Antigenic diversity in the human malaria parasite Plasmodium falciparum. Science. 1982;217:254–257. doi: 10.1126/science.6178159. [DOI] [PubMed] [Google Scholar]
- 67.Pascarella S, Argos P. Analysis of insertions/deletions in protein structures. J. Mol. Biol. 1992;224:461–471. doi: 10.1016/0022-2836(92)91008-D. [DOI] [PubMed] [Google Scholar]
- 68.Montgomery SB, et al. The origin, evolution, and functional impact of short insertion-deletion variants identified in 179 human genomes. Genome Res. 2013;23:749–761. doi: 10.1101/gr.148718.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Streisinger G, et al. Frameshift mutations and the genetic code. Cold Spring Harb. Symp. Quant. Biol. 1966;31:77–84. doi: 10.1101/SQB.1966.031.01.014. [DOI] [PubMed] [Google Scholar]
- 70.Levinson G, Gutman GA. Slipped-strand mispairing: A major mechanism for DNA sequence evolution. Mol. Biol. Evol. 1987;4:203–221. doi: 10.1093/oxfordjournals.molbev.a040442. [DOI] [PubMed] [Google Scholar]
- 71.Chu G. Double strand break repair. J. Biol. Chem. 1997;272:24097–24100. doi: 10.1074/jbc.272.39.24097. [DOI] [PubMed] [Google Scholar]
- 72.McVey M, Larocque JR, Adams MD, Sekelsky JJ. Formation of deletions during double-strand break repair in Drosophila DmBlm mutants occurs after strand invasion. Proc. Natl. Acad. Sci. USA. 2004;101:15694–15699. doi: 10.1073/pnas.0406157101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Lee JA, Carvalho CM, Lupski JR. A DNA replication mechanism for generating nonrecurrent rearrangements associated with genomic disorders. Cell. 2007;131:1235–1247. doi: 10.1016/j.cell.2007.11.037. [DOI] [PubMed] [Google Scholar]
- 74.Hastings PJ, Ira G, Lupski JR. A microhomology-mediated break-induced replication model for the origin of human copy number variation. PLoS Genet. 2009;5:e1000327. doi: 10.1371/journal.pgen.1000327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Jongwutiwes S, Tanabe K, Hughes MK, Kanbara H, Hughes AL. Allelic variation in the circumsporozoite protein of Plasmodium falciparum from Thai field isolates. Am. J. Trop. Med. Hyg. 1994;51:659–668. doi: 10.4269/ajtmh.1994.51.659. [DOI] [PubMed] [Google Scholar]
- 76.Seethamchai S, et al. Variation in intronic microsatellites and exon 2 of the Plasmodium falciparum chloroquine resistance transporter gene during modification of artemisinin combination therapy in Thailand. Infect. Genet. Evol. 2018;65:35–42. doi: 10.1016/j.meegid.2018.07.015. [DOI] [PubMed] [Google Scholar]
- 77.Edgar RC. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Benson G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999;27:573–580. doi: 10.1093/nar/27.2.573. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Nei M. Molecular Evolutionary Genetics. Columbia University Press; 1987. [Google Scholar]
- 80.Librado P, Rozas J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25:1451–1452. doi: 10.1093/bioinformatics/btp187. [DOI] [PubMed] [Google Scholar]
- 81.Jukes TH, Cantor CR. Evolution of protein molecules. In: Munro HN, editor. Mammalian Protein Metabolism. Academic Press; 1969. pp. 21–132. [Google Scholar]
- 82.Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular evolutionary genetics analysis version 6.0. Mol. Biol. Evol. 2013;30:2725–2729. doi: 10.1093/molbev/mst197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Murrell B, et al. FUBAR: A fast, unconstrained Bayesian AppRoximation for inferring selection. Mol. Biol. Evol. 2013;30:1196–1205. doi: 10.1093/molbev/mst030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Weaver S, et al. Datamonkey 2.0: A modern web application for characterizing selective and other evolutionary processes. Mol. Biol. Evol. 2018;35:773–777. doi: 10.1093/molbev/msx335. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Kosakovsky Pond SL, Frost SDW. Datamonkey: Rapid detection of selective pressure on individual sites of codon alignments. Bioinformatics. 2005;21:2531–2533. doi: 10.1093/bioinformatics/bti320. [DOI] [PubMed] [Google Scholar]
- 86.Martin DP, Murrell B, Golden M, Khoosal A, Muhire B. RDP4: Detection and analysis of recombination patterns in virus genomes. Virus Evol. 2015;1:vev003. doi: 10.1093/ve/vev003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Jespersen MC, Peters B, Nielsen M, Marcatili P. BepiPred-2.0: Improving sequence-based B-cell epitope prediction using conformational epitopes. Nucleic Acids Res. 2017;45:W24–W29. doi: 10.1093/nar/gkx346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Vita R, et al. The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 2019;47:D339–D343. doi: 10.1093/nar/gky1006. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Thirty-five complete coding sequences of PmMSP1 have been deposited in NCBI GenBank under accession numbers OM525734–OM525768. The datasets generated during and/or analyses during the current study are available from the corresponding author upon request.