Skip to main content
PLOS Neglected Tropical Diseases logoLink to PLOS Neglected Tropical Diseases
. 2024 Jul 11;18(7):e0011879. doi: 10.1371/journal.pntd.0011879

Plasmodium vivax genomic surveillance in the Peruvian Amazon with Pv AmpliSeq assay

Johanna Helena Kattenberg 1,*,#, Luis Cabrera-Sosa 2,3,4,#, Erick Figueroa-Ildefonso 2,3,¤, Mathijs Mutsaers 1, Pieter Monsieurs 1, Pieter Guetens 1, Berónica Infante 2,3, Christopher Delgado-Ratto 4, Dionicia Gamboa 2,3,5,, Anna Rosanas-Urgell 1,
Editor: Andrés Aranda-Diaz6
PMCID: PMC11265702  PMID: 38991038

Abstract

Background

Plasmodium vivax is the most predominant malaria species in Latin America, constituting 71.5% of malaria cases in 2021. With several countries aiming for malaria elimination, it is crucial to prioritize effectiveness of national control programs by optimizing the utilization of available resources and strategically implementing necessary changes. To support this, there is a need for innovative approaches such as genomic surveillance tools that can investigate changes in transmission intensity, imported cases and sources of reintroduction, and can detect molecular markers associated with drug resistance.

Methodology/Principal findings

Here, we apply a modified highly-multiplexed deep sequencing assay: Pv AmpliSeq v2 Peru. The tool targets a newly developed 41-SNP Peru barcode for parasite population analysis within Peru, the 33-SNP vivaxGEN-geo panel for country-level classification, and 11 putative drug resistance genes. It was applied to 230 samples from the Peruvian Amazon (2007–2020), generating baseline surveillance data. We observed a heterogenous P. vivax population with high diversity and gene flow in peri-urban areas of Maynas province (Loreto region) with a temporal drift using all SNPs detected by the assay (nSNP = 2909). In comparison, in an indigenous isolated area, the parasite population was genetically differentiated (FST = 0.07–0.09) with moderate diversity and high relatedness between isolates in the community. In a remote border community, a clonal P. vivax cluster was identified, with distinct haplotypes in drug resistant genes and ama1, more similar to Brazilian isolates, likely representing an introduction of P. vivax from Brazil at that time. To test its applicability for Latin America, we evaluated the SNP Peru barcode in P. vivax genomes from the region and demonstrated the capacity to capture local population clustering at within-country level.

Conclusions/Significance

Together this data shows that P. vivax transmission is heterogeneous in different settings within the Peruvian Amazon. Genetic analysis is a key component for regional malaria control, offering valuable insights that should be incorporated into routine surveillance.

Author summary

Latin America is aiming towards malaria elimination. Genomic surveillance is crucial for a country’s malaria strategy, aiding in understanding and stopping the spread of the disease. While widely used for another malaria species (Plasmodium falciparum), limited tools exist for tracking P. vivax, a significant player in malaria-endemic areas outside of Africa, and the primary cause of malaria in Latin America.

In this study, we used a new tool, Pv AmpliSeq v2 Peru assay, to examine the genetic makeup of malaria parasites in the Peruvian Amazon. This tool helps us see how the parasites from different areas are connected and tracks markers that could indicate resistance to drugs. We found that the parasites from remote areas in the Amazon were genetically different from parasites in areas surrounding the main city of Iquitos, and parasites in a remote border community were genetically more similar to Brazilian parasites.

We also show that the Pv AmpliSeq v2 Peru assay can be used to study parasites from other countries in Latin America, highlighting the broader application in the region. Considering that parasites are not constrained by borders and can easily spread between neighboring countries, a regional approach can be crucial for malaria elimination.

Introduction

Malaria continues to impact millions of people around the world, particularly those living in low- and middle-income countries. P. vivax infections make up 18.0% to 71.5% of cases in regions outside Africa, with the highest proportions in the Americas [1], mainly affecting populations living in remote areas with poor access to healthcare services. The Amazon Basin reports the majority of malaria cases in the Americas, including Peru (> 26000 malaria cases in 2022, 88% caused by P. vivax) [2].

Challenges to malaria control and elimination in the Amazon region are diverse. P. vivax infections in the region are often asymptomatic (58–93%), below the detection threshold of microscopy (61–96%, sub-patent infections) [3], while a significant proportion of these infections carry gametocytes, the parasite stages responsible for transmission to the vector [4]. Moreover, the highest P. vivax incidence occurs in hard-to-reach areas, such as remote indigenous communities [5], and cross-border regions [6] where human mobility, often linked to economic activities, facilitates transmission and reintroduction [7]. In addition, low density infections typically require PCR-based tools for diagnosis, which are difficult to implement in remote areas. Although isothermal amplification assays -such as loop-mediated isothermal amplification (LAMP)- have the potential for point-of-care diagnosis [810], they have not been broadly implemented. Therefore, these asymptomatic and sub-patent infections typically are undiagnosed and untreated, maintaining malaria transmission [4].

Despite these P. vivax specific challenges, the region has made significant efforts to control malaria, with a reduction of the number of cases by 60% from 2000 to 2021 [1]. Therefore, many countries are moving towards malaria elimination in the next 5–10 years by implementing National Malaria Control Programs (NMCPs). In Peru, the Ministry of Health (MINSA) launched a National Malaria Elimination Program (NMEP) in 2022, intending to reduce the number of cases by 90% in 2030 [11].

P. vivax transmission is not uniform across the Peruvian Amazon region and varies significantly between different ecological niches suggesting that transmission is also influenced by ecological and environmental factors such as temperature and rainfall [12]. Since efforts to control P. vivax malaria in Peru must consider its complexity and heterogeneity of transmission, well-structured and integrated genomic surveillance systems can significantly contribute to pinpoint priorities and guide NMCP/NMEPs decision-making [1315]. These systems empower countries to detect areas with the greatest need for control measures, identify program deficiencies and suboptimal strategies, and determine the amount of connectivity between regions. Through genomic analyses, the P. vivax parasite population of Latin America has been described as a distinct population from the rest of the world, presenting relatively lower genetic diversity due to relatively recent introductions in the region [1618]. In the case of the Peruvian Amazon, Iquitos city acts as a source from where parasites spread to other areas through human mobility related to socio-economic activities [19,20]. Additionally, there is substantial gene flow between parasite populations from geographically distant areas in the region [21].

Previously, we have developed a highly-multiplexed amplicon sequencing (AmpliSeq) assay for malaria genomic surveillance of P. falciparum in Peru (Pf AmpliSeq Peru, [22]) and P. vivax in Vietnam (Pv AmpliSeq v1 Vietnam, [23]). Our assays include validated and candidate genes associated with antimalarial resistance, SNP-barcodes to serve different functions, i.e. a global P. vivax SNP-barcode for prediction of origin, and country or region-specific SNP-barcodes for population genetics analysis such as transmission dynamics and connectivity. The AmpliSeq assays have the advantage, in addition to cover a large panel of genetic variants of interest, to be easily adaptable for diverse epidemiological contexts, and have been successfully used to identify temporal changes in the parasite population (before and after NMCP interventions), monitor drug resistant markers, assess connectivity, and identify sources of reintroduction [22,23].

In this study, we modified the Pv AmpliSeq v1 Vietnam targeted NGS assay to Pv AmpliSeq v2 Peru, incorporating a newly designed Peru-specific SNP-barcode (named Pv Peru barcode from here on) with in-country resolution instead of the Vietnam specific barcode. The Peru barcode was validated in silico using a published dataset of 399 genomes from Latin America [18].

We analyzed 230 samples collected between 2007 and 2020 across 11 districts in the Peruvian Amazon and generated new genetic data as baseline information for molecular surveillance in Peru. While previous population genetic analysis in the Loreto region have focused in Maynas Province, we have included samples from remote communities in 2 additional provinces: Loreto and Mariscal Ramon Castilla. This tool provides valuable information to effectively guide malaria control and elimination efforts in Peru by facilitating the generation of high-quality data and strategic information for malaria surveillance in Latin America.

Methods

Ethics statement

Samples from previous studies with written consent for future use for malaria research were used with protocols registered in the Decentralized System of Information and Follow-up to Research (SIDISI) of the University Directorate of Research, Science and Technology at Universidad Peruana Cayetano Heredia (UPCH), to be then evaluated by UPCH Institutional Research Ethics Committee (CIEI) prior to its execution (SIDISI codes 52707, 61703, 66235, 101518, 102725). In Yavari, Ramon Castilla and Trompeteros districts, samples were collected as part of routine surveillance activities by MINSA. MINSA authorities transferred these samples to the UPCH team for research activities (SIDISI project 102725). Secondary use of all samples for the purpose of genomic surveillance of malaria was approved through the Institutional Review Board of the Institute of Tropical Medicine Antwerp (reference 1417/20).

SNP selection for barcode design

To design a SNP barcode with in-country resolution in Peru, raw whole genome sequencing data (Fast Q files) of P. vivax isolates from Peru generated in-house (n = 30 from [23,24]) was combined with online available Peruvian P. vivax genomes (n = 100 from [2528]) and jointly genotyped after variant calling as described elsewhere [18]. Briefly, FASTQ files were aligned to the PvP01 reference genome version 46 from PlasmoDB [29] and variants were called using GATK4 HaplotypeCaller. Allele frequencies of the selected SNPs in the design were assessed in the global genome dataset (n = 1474) [18], as well as in genomes from Latin America (n = 399) from that dataset.

To design the SNP Peru barcode, unlinked biallelic SNPs were filtered from the genomic dataset by LD-pruning in 5–6 iterations by scanning over the genome in 500 bp windows to remove uninformative SNPs with pairwise LD>0.2 using the python package scikit-allel. Subsequently, the contributions of the SNPs to genetic clusters were determined using discriminant analysis of principal components (DAPC) [30] with K-means inferred populations (n = 10) using the adegenet package in R. DAPC was performed multiple times (n = 20) with cross-validation, and associated allele loadings between simulations were compared to determine the most contributing alleles. Finally, 49 SNPs with high allele loadings in the DAPC were selected with spread over the chromosomes (aiming for 2–4 SNPs per chromosome, which gave good population differentiation in our previous studies [22,23] and distance (>500 bp) from drug resistant amplicons targeted in the assay to avoid linkage). The Illumina Concierge team (Illumina, San Diego, USA) used DesignStudio software with the PvP01 reference genome to add amplicons for the new barcode to the existing Pv AmpliSeq v1 Vietnam custom panel design [23] without the Vietnam SNP-barcode. Out of the 49 selected SNPs, amplicon design was successful for 41 SNPs, resulting in the Pv Peru Barcode.

Samples and study settings

In order to generate baseline genetic surveillance data that includes parasite populations from peri-urban areas, remote areas and border communities across a wide area in the Peruvian Amazon, we selected samples retrospectively from prior studies in Peru. P. vivax qPCR-positive samples (n = 230) were selected based on geographical origin and parasite density. We applied a parasite density cut-off of ≥5 parasites/μL by qPCR, to prioritize samples with sufficient template for expected successful library preparation as determined previously [23]. Samples were from 11 districts in 3 provinces in the Loreto region: Loreto, Maynas, and Mariscal Ramon Castilla (Fig 1).

Fig 1. Map of included study sites in the Peruvian Amazon.

Fig 1

A. Provinces in the Loreto region. Three provinces were included in this study (Loreto: orange, Maynas: green, Mariscal Ramon Castilla: blue). Loreto province, includes samples from the Trompeteros district. The remote indigenous Nueva Jerusalen community is settled there. In Mariscal Ramon Castilla province, samples were included from 3 districts containing border communities with Colombia and Brazil. The area within the red square is shown in B. B. Maynas province, with many districts with peri-urban communities. Maps were created with an in-house script in R using the R-packages used R libraries SF, ggspatial, ggrepel. The base layer of the map was obtained from: https://ide.inei.gob.pe/#geo.

The samples included here were collected in previously published studies from 2007–2008 (n = 41) [31], 2016 and 2017 (n = 54) [32], from population-based cross-sectional surveys in Mazan district in 2018 (n = 13) [33,34], and n = 32 samples collected in 2014 by passive case detection (PCD) at the health center at San Juan district in Iquitos (S1 Table). In addition, samples from Ramon Castilla, Yavari and Trompeteros districts were collected in collaboration with MINSA authorities as part of surveillance activities in remote border communities in the Loreto region. Collections in Ramon Castilla (n = 2) and Yavari (n = 20) were conducted in December 2018 through active case detection (ACD) and PCD, respectively. Samples from Trompeteros (n = 68) were collected during 3 weeks at the end of November as part of ACD visits and during April—May 2020 by PCD (Fig 2 and S1 Table).

Fig 2. Overview of samples from Peru over time (year) and district.

Fig 2

Samples were collected in 11 districts (y-axis) between 2007 and 2020 (year on x-axis). The size of the bubbles indicates the number of samples collected in a specific year and district.

DNA extractions and quantification

DNA from all samples was extracted using the E.Z.N.A Blood DNA Mini Kit (Omega Bio-tek, Georgia, USA) or QIAmp DNA Blood Mini Kit (Qiagen, Germany) following the manufacturer’s instructions. For DNA extraction using the E.Z.N.A kit, we used 40μL of whole blood, or a ~0.72 cm2 piece of dried blood spot (DBS); DNA was eluted in a final volume of 50 μL. For DNA extraction using the QIAmp kit, we used two punches of DNA (0.6 cm diameter); DNA was eluted in a final volume of 100μL. P. vivax parasitaemia was quantified by 18S rRNA qPCR [35] and/or Pv mtCOX1 qPCR [36].

AmpliSeq library preparation and bioinformatics

Pv AmpliSeq library preparation was performed using AmpliSeq Library PLUS for Illumina kit (Illumina), AmpliSeq Custom Panel design (i.e. the primer pools as per the new design with amplicons of 158–384 bp lengths; S2 Table) and AmpliSeq CD Indexes (Illumina) as per the manufacturer’s instructions. Library preparation was performed on 7 μL of undiluted DNA as previously described [23], following similar steps as the protocol for P. falciparum AmpliSeq [37]. Briefly, target regions were amplified in two reactions, and then combined for final library preparation. Libraries were quantified using Qubit v3 High sensitivity DNA kit (Invitrogen, Massachusetts, USA), and subsequently diluted to 2nM with low Tris-EDTA buffer, and pooled (pooling 144 libraries per sequencing run). Denatured library pool (7 pM) was loaded on a MiSeq system (Illumina) for 2x300 paired-end sequencing (Miseq Reagent Kit v3, Illumina) with 1–5% PhiX spike-in (Illumina).

FASTQ files were processed with an in-house analysis pipeline on a Unix operating system computer, as previously described [23], where trimmed reads were aligned to the PvP01 reference genome using Burrows-Wheeler aligner followed by GATK haplotypecaller generating a jointly-called VCF file with variants (SNPs and INDELs) detected in the targeted regions (scripts are available at https://github.com/Ekattenberg/Plasmodium-AmpliSeq-Pipeline). Variants were hard filtered (QUAL>30, overall DP>100, MQ>50, QD>1.0, SOR<4, GT depth >5) and annotated with SnpEff (v4.3T) [38], resulting in 3862 high quality genotypes (incl. all variant types, e.g. SNPs and indels). Per locus filtered depth of coverage (format field DP in the vcf) was used to calculate the median depth of all loci per sample or per amplicon. Aligned coverage was calculated as the number of bases passed filter divided by the number of bases (59815 bp) targeted in the Pv AmpliSeq v2 Peru.

A total of 274 unique samples from 5 different projects, 84 replicates and 69 controls were attempted with the Pv AmpliSeq assay (Fig A in S1 Text). For the analysis of genomic surveillance use cases in Peru, we included 230/274 (83.9%) samples (Fig 1) with good quality data (<25% missing genotype calls for all variants, mean coverage >15), and retained only one library of replicates (with lowest missingness).

Data analysis

Different SNP subsets were used in the data analysis as specified for each analysis below. The Pv Peru barcode was first evaluated in silico in a WGS dataset from [18], using the barcode SNPs and a larger set of biallelic SNPs in the WGS data that were also present in the newly generated Pv AmpliSeq Peru data (n = 730). In the WGS data not all the SNPs from the Pv AmpliSeq Peru (n = 2909 in total) were present in this dataset, partly as the MAF filter applied in that dataset could have removed SNPs that are rarely observed in other regions in the world. For the analysis of the Pv AmpliSeq Peru data, we used the 41 SNP loci. In addition, while the Pv Peru Barcode is appropriate to be used for population genetic analyses, higher geographical resolution with the AmpliSeq assay data can be obtained by using all biallelic variants or SNPs in the vcf (nloci = 3862, including nSNP = 2909). Using all biallelic variants in the assay also allows the use of models such as identity-by-descent, which requires at least 200 biallelic SNPs [39]. We use the biallelic SNPs as we previously saw these variants had a lowest error rate compared to indels [22]. However, for the DAPC analysis indels are also included, as these can be biologically relevant variants.

Genetic diversity, expressed as expected heterozygosity (He), was calculated using polymorphic barcode (40/41 SNPs) genotypes from the vcf with the adegenet package in R [40,41]. Nucleotide diversity (pi) was determined by sliding across the target regions in the genome in 500-bp windows using vcftools and a vcf file with all biallelic SNPs detected by the assay. To compare the median of the genetic diversity parameters across different districts and periods, Kruskal-Wallis rank sum test was performed. For pairwise comparisons between groups, Wilcoxon test with the False Discovery Rate (FDR) or Benjamini-Hochberg procedure as correction for multiple testing was used. Statistical tests were performed with the R package stats. P-values < 0.05 were considered significant. PCA and DAPC with cross-validation was performed to infer population structure based on haplotypes across districts and years [30]. Associated allele loadings for the first four components in the DAPC were determined.

Genetic differentiation, expressed as fixation index (FST), was calculated using all biallelic SNPs (n = 2909) detected by the assay with the R package hierfstat [42]. Within-host infection complexity was assessed using within-sample F-statistic (Fws) with the R package moimix [43] using all biallelic SNPs (n = 2909) detected by the assay. Fws ≥ 0.95 was considered a proxy for a monoclonal infection as in Auburn et al. 2012 [44].

We created a list of variants of interest (S3 Table) that included variants in genes reported in the literature as potentially associated with P. vivax antimalarial resistance. The list was supplemented with non-synonymous variants detected in the target genes that contributed to the variation in the DAPC. Haplotypes were created by combining genotypes of variants of interest.

To measure pairwise identity-by-descent (IBD) between samples, PED and MAP file formats were generated from VCF data using VCFtools. The level of IBD-sharing was calculated employing the isoRelate package in R [45], following previously described settings [23]. Specifically, we used all biallelic SNPs (n = 2909) identified by the AmpliSeq assay, applying filtering criteria of the MAF = 0.001 and SNP and individual missingness thresholds (0.6 and 0.3, respectively), resulting in 1219 SNPs for subsequent IBD analysis. Furthermore, IBD thresholds were set to include a minimum of 15 SNPs per segment and a segment length of 3000 bp, aimed at mitigating false-positive calls using an error of 0.001. Network plots (at thresholds of 99%, 50% and 10% IBD) were created using the igraph package in R to visualize the relatedness between samples (95% and 50%) and connectivity between districts (10%).

A likelihood-based classifier was used to predict the origin of P. vivax isolates using the 33-SNP vivaxGEN-geo barcode, using the vivaxgen geo framework (https://geo.vivaxgen.org/) and reference dataset [46].

Results

Design and in silico evaluation of the Pv Peru-barcode using WGS

We designed a 41-SNP Pv Peru Barcode, with in-country resolution (i.e. able to separate between distinct P. vivax populations in Peru based on variability of allele frequencies between districts and provinces) using P. vivax genomes from Peru (n = 130) [18]. SNPs in the barcode were selected based on their contribution to geographically distinct genetic clusters in the DAPC. Allele frequencies (AF) of the 41 SNPs in the barcode were evaluated in silico in P. vivax genomes (n = 1474) originating from 31 countries, including 130 Peruvian genomes [18]. Minor allele frequencies (MAF) of all SNPs varied between 0.01–0.49, with a median of 0.16 (S4 Table). Within genomes from Peru, most SNPs (63%) had a MAF>10%, with 10 alleles observed at MAF<5%. The MAF in isolates from multiple countries in Latin America (including Mexico, Panama, Colombia, Brazil) were similar (median MAF 0.09, range 0.01–0.49) to the MAF found in Peru. The Pv Peru Barcode was capable of separately clustering the Latin American population in PCA analysis (Fig 3A and 3C). The resolution for population structuring increased when using n = 753 loci (i.e. the overlap in SNPs between the WGS data and all biallelic SNPs detected in the Pv AmpliSeq v2 Peru target region, mimicking the resolution for population structure achievable with the assay), separating the samples from different countries with distinct subpopulations (Fig 3B and 3D), matching patterns previously observed in the whole genome dataset (Fig B in S1 Text). Altogether, this demonstrates a wider applicability of the Pv AmpliSeq v2 Peru assay to the Latin American region and the optimal resolution of the Pv Peru barcode to differentiate parasite populations by country in Latin America.

Fig 3. Principal Component Analysis of P.

Fig 3

vivax genomes dataset (23), filtered on the SNPs in the Pv Peru Barcode only (A & C), and all biallelic SNPs detected by the Pv AmpliSeq v2 Peru assay in the Peruvian Amazon samples (B&D). The samples (dots) are colored according to the originating population (regions or country), using the classifications from Adam et al., 2022 [68].

Barcode and Pv AmpliSeq v2 Peru assay performance

The Pv Peru Barcode was experimentally validated with the Pv AmpliSeq v2 Peru assay using P. vivax isolates (n = 230) from several regions in Peru (Figs 1 and 2). All 41 targeted SNPs in the barcode were amplified successfully, with a median MAF of 0.24 (range: 0.01–0.49, mean ± standard deviation: 0.25 ± 0.14) in all samples from Peru. For the SNP PvP01_13_v1_32509, only the reference allele was detected. Most of the samples performed well with the Pv Peru barcode: 185 (80.4%) and 213 (92.6%) samples had <10% and < = 25% missing genotypes at the barcode positions, respectively. There were 7 (3%) samples without barcode data due to an experimental error (Pv Vietnam barcode primers from the Pv AmpliSeq v1 Vietnam assay were used instead of the Pv Peru barcode), however the samples were kept for the variant of interest and prediction of origin analyses as they passed the criteria for all positions.

We detected a median of 14 (range 4–28) biallelic SNPs per barcode amplicon in all samples, which could potentially be used as microhaplotypes in a haplotype-based approach (S4 Table). The Pv AmpliSeq v2 Peru assay generated a high number of reads per sample (median 119033, interquartile range [IQR] 59535–243132 reads/sample after trimming low-quality reads), with a median of 99.4% (IQR 97.7–99.9%) of reads aligning in pairs to the PvP01 reference genome. The median depth of coverage of aligned high-quality reads past the filter (DP) was 335 reads (IQR 121 to 859) (Fig C in S1 Text). There were 4 (1.8%) amplicons with low mean DP-values (<10) (pvama1_4, pfdhfr_8, pvmdr1_1, and PvP01_14_v1_2841138), and 1 amplicon (0.44%) had high mean DP-values (>200) (pvcrt_11).

Primer specificity was confirmed using uninfected human blood samples (n = 2) and a 3D7 laboratory strain (n = 3) as negative controls. The few sequencing reads generated mapped predominantly outside the assay target regions, and only a mean 1.1% ± 0.8% of all variants in these regions were called. All called variants were below the filtering and inclusion thresholds. In addition, previously Pv AmpliSeq v1 Vietnam genotyped samples (n = 24, from [18,23]), were repeated in the Pv AmpliSeq v2 Peru assay. When comparing genotypes at variants of interest loci that were included in both the AmpliSeq v1 and AmpliSeq v2 versions (n = 26 loci), the majority of pairwise comparisons between the replicates, resulted in identical genotypes. Except in the case of 5 genotypes (in 3 samples), where mixed genotypes (more than one allele present) were not detected in both replicates. These samples also had more than one allele at other loci (indicative of multiple strain infections), and therefore 10/15 mixed genotypes were identified in both replicates.

Spatial-temporal patterns in transmission intensity

Complexity of infection (COI) and genetic diversity–expressed as expected heterozygosity (He) and nucleotide diversity (pi)–were used as a proxy of transmission intensity. Both parameters were compared between parasite isolates from different districts and years (Fig 3). He was estimated using the SNPs-barcode positions only, and pi was measured over all biallelic variants detected in the samples. Moderate levels of He were found in all districts, except in Yavari, which had a significantly lower diversity (mean He = 0.0121; p < 0.0001) (Fig 4A). Pi was significantly higher in Iquitos compared to its surrounding districts Mazan and San Juan Bautista (adjusted p<0.0001). In addition, similar to the observed patterns in He, pi was significantly lower in Yavari (adjusted p<0.0001) compared to all other districts (Fig 4C). Pi was higher in 2007–2008 than in later years (adjusted p<0.001, Fig 4D), alongside a period of intensification of malaria control and reduction of cases in Peru. No temporal trends were observed in He (Fig 4B). Diversity in samples from 2018 was lower than other years, however this is likely a result of the low diversity in Yavari in 2018, which made up the majority of samples in this year. In addition, we found polyclonal infections in 5 districts, all of them at a low proportion (10–35.3%) (Fig D in S1 Text).

Fig 4. Molecular markers for transmission intensity.

Fig 4

Violin plot of Expected heterozygosity (He) of 40-SNP barcode positions (A & B), and nucleotide diversity (pi) (C and D) measured across the targeted region in 5000 bp windows by district (A& C) and years (B & D).

Population structure and connectivity

The geographic differentiation and structure of parasite populations was investigated at the district level using all variants detected (nloci = 3862, including nSNP = 2909) to increase the resolution for fine scale population structure detected with the 41 Pv Perubarcode SNPs. Estimation of pairwise genetic differentiation (Fst) using all biallelic SNPs (nSNP = 2909) showed little differentiation (Fst: 0.02–0.07) between districts; except Yavari, which was highly differentiated (Fst: 0.39–0.55) from all other districts and in particular from Iquitos (Fig 5A).

Fig 5. Population structure and genetic differentiation.

Fig 5

Heatmap of pairwise FST values between 5 districts in the Peruvian Amazon estimated with the hierfstat package in R (A). Scatter plot of discriminant analysis eigenvalues 1 and 2 (LD1 and LD2) using all variants detected by the Pv AmpliSeq v2 Peru assay of Peru samples (n = 230) grouped by district (B) or district and year (C), and Maynas samples (n = 124) grouped by period (D).

We explored the population structure using DAPC and all variants detected with the AmpliSeq assay (nloci = 3862). The first two components (PC) separate the communities in Maynas province from remote communities in Loreto and Mariscal Ramon Castilla provinces: Yavari (PC1), Trompeteros (PC2) and Ramon Castilla (PC2) (Fig 5B). Yavari and Trompeteros were also separated by the first two PCs when using PCA (Fig E in S1 Text). This observed structure was not affected by sampling in different years, as clusters from different time points, but same areas, overlap in the DAPC (Fig 5C). However, within Maynas province, a temporal differentiation was found separating the samples in 3 clusters: 2007–08, 2014 and 2016–18 (Fig 5D).

From the variants contributing most to the first two axes of the DAPC analysis with all samples (Fig 5B), 10 out of 38 highest contributing alleles were SNPs in the barcode amplicons. Similarly, 12 out of 30 highest contributing alleles in the Maynas DAPC analysis (Fig 5D) were Pv Peru Barcode positions or other SNPs on the barcode amplicons (S5 Table), highlighting the contribution of these genomic regions to the population structure with sufficient resolution for within-country analysis in Peru. Highly-contributing alleles in genes of drug resistance interest, especially pvmdr2 and upstream variants of pvcrt, were found, including both missense mutations, potentially under drug-induced selection pressure, as well as synonymous mutations, which do not impact the phenotype but reflect the population genetic background (S5 Table).

We assessed the connectivity of parasites in the Peruvian Amazon across space and time by measuring the genetic relatedness between isolates. We analyzed the pairwise IBD between samples within and between districts and years (Fig 6 and Fig F in S1 Text). High relatedness was seen mainly between samples from the same district, with a clonal population (sequences sharing IBD > 99%) in Yavari (Fig 6A). At moderate levels of IBD-sharing (50%), there was a lot of relatedness within Trompeteros samples (Fig 6B). At even lower levels of IBD (10%), samples from Yavari, Trompeteros, Mazan and San Juan Bautista become connected (i.e. high amount of sample pairs with at least 10% IBD), regardless of their geographical distance within these districts (Fig 6C). Many clusters of high IBD (50% or 90%) were found within one year, however, some sample pairs from different years with high relatedness were observed (Fig F in S1 Text).

Fig 6. Parasite relatedness.

Fig 6

Network of individual relatedness at (A) intermediate levels of relatedness (50% IBD threshold) and (B) very high levels of relatedness, indicating clonal infections (95% IBD threshold) colored by district. (C) Network of connectivity showing pairwise IBD-sharing between isolates using low levels of relatedness (10% IBD threshold) and summing pairwise comparisons by districts.

Variants of interest

We investigated the haplotypes in the genes that have a potential association with drug resistance in P. vivax and orthologue species as previously described [23]. Haplotypes were constructed of variants of interest (S6 Table), including variants with a potential association with resistance and non-synonymous variants contributing to the DAPC in Fig 5A.

High variability in the genes pvmdr1, pvdhps, pvdhfr, pvmdr2, pvp13K, pvmrp1, and pvmrp2 was found in most districts (Fig G in S1 Text), whereas low variability was observed in pvcrt (only one sample with an amino acid change at position 275, F>V) and pvK13 (only one haplotype). In pvdhps, we observed the sulphadoxine-resistance associated mutation A383G in 61.7% (142/230) of samples, alongside other phenotypically uncharacterized mutations (S6 Table). In pvdhfr, we observed the pyrimethamine-resistance associated mutations S58R (125/230, 54.3%) and S117N (3/230, 1.3%), alongside other uncharacterized mutations, including S58K (S6 Table). Two observed different nucleotide changes (codons AGA and AGG) resulted in the S58R amino acid change in pvdhfr. The A553G mutation in pvdhps and mutations F57L, T61M, S117T in pvdhfr were absent from the samples from Peru. In pvmdr1, we observed the CQ resistance-associated mutations Y967F (CQ and AQ) and F1076L (CQ) in 15.2% (35/230) of samples in Peru, always combined in the same haplotype that also carried the S698G mutation (S6 Table).

The DAPC analysis identified differences between populations (Fig 5A and 5B), including mutations in pvdmt2 (S277Y), pvmdr2 (G305S, S1038N, D1447E, A1450V, and T1480A), pvmrp1 (H1586Y, I1478V, G1419A and K36Q), pvmpr2 (N1251Y, P330S), and large indels in pvdhfr (at amino acid positions 618 and 639) and pvp13k (indels at amino acid positions 86, 106–108, 834, and mutations E1456K and E810G), but the phenotypes of these variants have not been characterized. Yaravi samples presented haplotypes in most of the genes (pvmdr1, pvdhps, pvdhfr, pvmdr2, pvp13K, pvmrp1, and pvmrp2) that were found exclusively in this location or were rare in other areas.

We also investigated the variability in the antigenic pvama1 gene, with 6 observed haplotypes of 6 non-synonymous variants detected in the amplicons targeting the highly variable regions in this gene (S6 Table). Moderate variability was found with all haplotypes detected in all years (Fig H in S1 Text). A high proportion of samples with Haplotype 19 was observed in 2007–08 (39%), which decreased in later years (16% in 2014, 24% in 2016–2018 and 4% in 2019–2020). In contrast, haplotype 4 was predominant in 2016–2018 (24%), mainly corresponding to isolates from border districts Yavari and Ramon Castilla (Fig E in S1 Text).

Predicting the origin of infections

We predicted the origin of samples from different districts in the Loreto region using the SNP vivaxGEN-geo barcode included in the panel. As expected, most samples from Mazan, San Juan Bautista, Iquitos and Trompeteros were predicted to originate from Peru (predicted value ranging between 50–75%), with a small proportion of samples with principal predictions to originate from Brazil (predicted value ranging between 22–35%) and Colombia (predicted value ranging between 3–25%) (Fig 7). Samples from Yavari, Ramon Castilla and San Pablo—all border communities—were more similar to Brazilian isolates (Fig 7).

Fig 7. Prediction of origin of P.vivax.

Fig 7

The origin of samples was predicted using the vivaxgen geo framework (https://geo.vivaxgen.org/), which returns 3 predictions of origin for each sample with a score for each prediction. Here we plot a heatmap of the frequency (y-axis) of the highest-scoring prediction of origin for the samples from each district of the sample collection site (x-axis). The majority of samples from Mazan, San Juan Bautista, Iquitos and Trompeteros were predicted to originate from Peru, while bordering communities Ramon Castilla, San Pablo and Yavari were predicted to originate from Brazil.

Discussion

In this study, we designed a Pv Peru Barcode capable in combination with the Pv AmpliSeq panel [23] to differentiate parasite populations from not only Peru, but the wider region of Latin America at a high resolution. We successfully applied the Pv AmpliSeq v2 Peru assay to a retrospective sample collection (n = 230) from 11 districts in the Peruvian Amazon. We observed marked differences between peri-urban, remote and border communities, with an overall heterogenous P. vivax population with high diversity in the Loreto region, but moderate to low diversity at the community level. This pattern of low-local but relatively high-regional genetic diversity has been described previously using microsatellites [47] and is likely caused by the effect of random genetic drift in small populations that remain relatively isolated from each other. Rare alleles disappear (decreased local diversity translated in a reduction of He, and high relatedness IBD), while differentiation between sites increases. In contrast, in peri-urban communities close to Iquitos -the regional capital- a relatively high genetic diversity is observed with connectivity within and between districts and a temporal drift of the parasite population, in concordance with previous reports [19,48].

The observed connectivity between the peri-urban settings in Maynas province is likely due to economic activities and travel to the economic centers in the areas [49]. The presence of malaria corridors in the Peruvian Amazon due to human mobility and other socioeconomic factors has been previously reported [7,21]. The effects of human movement on malaria transmission in peri-urban areas of the Peruvian Amazon have been well-studied, though less is known about how migration and mobility affect the spread of malaria to and from remote indigenous and border communities.

In the remote isolated area of Trompeteros (district in Loreto province and inhabited by an indigenous population), a genetically distinct parasite population was observed with moderate diversity, high relatedness between isolates in the community and limited connectivity to other districts in Loreto province. While in the remote community of Yavari district (in Mariscal Ramon Castilla province and along the border with Brazil), a highly differentiated clonal P. vivax population was observed that carried distinct haplotypes in drug resistant genes and pvama1. This population was predicted to be more similar to Brazilian isolates and showed little connectivity with populations in the other provinces in Peru. This highly differentiated clonal population may have been introduced from Brazil causing an outbreak in Yavari. Indeed, 22% (36/163) of samples collected from symptomatic patients in the community, tested positive for P. vivax by PCR analysis. However, due to the limited epidemiological data at that time, an outbreak could not be officially confirmed. However, in addition of typically clonal P. falciparum outbreaks [5052], P. vivax outbreaks have also been reported in Peru [32]. An alternative explanation is that, due to the vicinity of the Yavari community to the Brazilian border, the parasite population in Yavari shares ancestry with populations in Brazil. This would also result in a similar genetic pattern as observed.

Predictions of origin of parasites in this region are also limited by the low number of South American isolates in the reference dataset in the applied likelihood model, which could be improved by including more recent genomes from this region from other studies [16,18]. There were 7 (3%) samples from Peru with predicted origins from Vietnam, Afghanistan and Iran. While it is possible that cases were imported from areas outside of South America, it is more likely that this is a result of incorrect predictions due to missing data in the 33-SNP vivaxGEN-geo barcode loci used for the prediction (3 samples had 18–30% missingness in this barcode compared to the proportion of missingness in all variants: 4–28%), or inaccuracies in the reference dataset, as this was also observed in our earlier study [23].

The presence of highly related parasites observed in these remote districts offers compelling evidence of local transmission in these isolated and hard-to-reach communities, which are accessible only after extensive riverine and road travel spanning many hours or days from Iquitos. Direct interactions with inhabitants during field work reveal occasional movement to neighboring communities for work or social engagements, yet such mobility rarely extends to more distant regions like Maynas. Nonetheless, our analysis underscores the relatedness between P. vivax parasites in these remote communities and those nearer to economic centers. Future investigations should prioritize studying patterns of human mobility and associated socio-epidemiological factors within these hard-to-reach communities to gain deeper insights into the factors driving malaria transmission.

Similar spatiotemporal transmission dynamics have been observed in various malaria-endemic Latin American countries. In Brazil, the population structure is notably complex, characterized by numerous clades and a high prevalence of monoclonal infections. Notably, a clear genetic separation exists between the northern states (Amapá and Pará), with high genetic diversity, and a highly clonal P. simium cluster originating from São Paulo [16]. Additionally, within the Mâncio Lima municipality, a cluster primarily comprised of monoclonal infections has persisted from 2014 to 2016, indicating stable transmission dynamics in this area [53]. In contrast, Colombia displays a high proportion of monoclonal infections, alongside two subpopulations with significant spatiotemporal shifts, particularly evident in the Córdoba region where one subpopulation has increased in prevalence over time [54]. Similarly, the P. vivax population in Panama is highly clonal, with a dominant lineage persisting across the country over a decade (2007–2019), suggesting stable transmission despite elimination efforts and varying transmission levels across regions [55]. Conversely, Venezuela has experienced an increase in polyclonality and genetic diversity of parasites in 2018 compared to 2003 [56]. Overall, these findings underscore the impact of environmental factors and human activities, such as rural-urban mobility, on shaping parasite populations in these regions, highlighting the challenges in malaria control and elimination strategies.

The differences in gene flow and transmission dynamics resulting in distinct genetic patterns are important to guide NMEP strategies for malaria control and elimination. First, it is important to inform implementation of local vs. regional interventions in order to ensure progress towards malaria elimination in all areas. Second, gene flow between the Yavari population and neighboring Brazil, but also with isolates from other Peruvian districts, shows the importance of increasing surveillance in border areas (not only with Brazil, but also Colombia and Ecuador) and creating connections with health authorities from these countries to collectively better control malaria [57]. In addition, specific attention can be directed towards addressing stable local transmission in hard-to-reach indigenous communities, such as in Trompeteros. The risk of malaria resurgence following elimination in these areas is mitigated by geographical remoteness and cultural differences. Nevertheless, also due to remoteness, MINSA interventions frequently fail to reach these communities, and when they do, are typically temporary. Consequently, the adoption of district or community-level strategies with targeted interventions, as previously proposed [58], may be advantageous in addressing malaria control challenges in Peru’s heterogeneous settings.

Other regions of Peru, such as the Amazonas region (neighboring Loreto) or the Northern Coast, were not included in this study, highlighting the need for further investigations to fully characterize the entire P. vivax population in Peru. This is particularly crucial given the observed low local but high regional diversity. However, compared to other previous studies, our study included more districts and remote areas spanning a larger period [1921]. To fully assess the diversity of Peruvian parasites, a more systematic sampling approach is warranted, possibly linked with activities conducted by the NMCP and/or NMEP. Regular sample collection should be conducted through multiple sentinel sites across the country, including hard-to-reach communities. Additionally, employing current active case detection strategies may be necessary when number of cases increases over a defined threshold at the community level.

We detected distinct haplotypes in genes putatively associated with drug resistance in Peruvian parasites. However, in contrast to P. falciparum, no validated markers of resistance have been validated for P. vivax, with the exception of SP-resistance [59]. Therefore, the observed haplotypes are not characterized in clinical or in vitro studies for CQ or ART resistance in P. vivax, which are the first-line drugs for P. vivax and P. falciparum malaria in Peru, making it difficult to interpret these results considering the national treatment guidelines. In addition, while the WHO recommends routinely monitoring antimalarial efficacy every 2 years [60], the latest treatment efficacy studies (TES) for P. vivax in Peru were conducted about 10 years ago [61,62]. Ex-vivo assays represent an additional approach to characterize resistance profiles in Plasmodium parasites. These assays need to be conducted quickly after sample collection and require laboratory facilities in the vicinity to the collection sites [63]. For instance, reports from Porto Velho and Mâncio Lima in Brazil spanning from 2012 to 2015 indicated the absence of chloroquine resistance during that period [64,65]. Another study with samples from Iquitos collected between 2015 and 2019 reported chloroquine resistance in 3.3% (1/30) of isolates [66]. However, these assays are not suitable to analyze low parasitemia infections, common in P. vivax infections, thereby complicating routine analysis [63].

The Pv AmpliSeq assay has been successfully applied in this study to low density samples from Peru, using the previously established threshold of a minimum of 5 parasites per microliter of blood [22]. Accurate quantification, through standardized qPCR protocols and the use of a Ct-value cut-offs (e.g. a Ct-value of 34 cycles), which provide a more reliable representation of parasite DNA quantity compared to challenging-to-standardize diluted samples with known parasite densities, is crucial for precise sample selection in efficient surveillance protocols. In addition, maintaining DNA integrity in DBS samples, which are well-suited for sample collection in remote communities due to their ease of transportation to reference laboratories [67], is important to ensure optimal performance of the assay.

Asymptomatic infections are a critical challenge for P. vivax elimination in the Amazon basin, particularly in Brazil and Peru. Though most of these infections commonly have low density parasitemia, 15–31% of asymptomatic infections are patent [3]. In this study, we have included samples from asymptomatic patients obtained in active case detection (Iquitos, San Juan Bautista, Trompeteros) and by population-based survey (Mazan), showing the potential of the Pv AmpliSeq assay to analyze both asymptomatic and symptomatic infections, as we also demonstrated for the Pf AmpliSeq assay in Peru [22]. Asymptomatic infections should be included in molecular surveillance efforts given the large contribution of these infections into the malaria burden and transmission in Peru.

In summary, our study elucidated the heterogeneity of P. vivax transmission across diverse settings in the Peruvian Amazon by applying the Pv AmpliSeq v2 Peru assay. Results reveal local patterns of low diversity and connectivity, that are important for the NMEP to improve and tailor strategies in accordance with local epidemiology, ecology, and human population. Genomic surveillance of malaria using the Pv AmpliSeq v2 Peru assay can be a valuable tool for routine application in the country and potentially in other countries in South America with similar allele frequencies of the barcode SNPs. To ensure sustained progress in malaria control and eventual long-term elimination, it is essential to prioritize the effectiveness of NMCP/NMEPs through the optimization of resource utilization and the strategic implementation of necessary changes. Informed decision-making, guided by relevant data and an increasing recognition of the potential of genetic surveillance tools, is crucial for effectively addressing the unique challenges associated with P. vivax control and elimination efforts.

Supporting information

S1 Table. Summary of samples and studies included in the Pv AmpliSeq assay.

(XLSX)

pntd.0011879.s001.xlsx (20.1KB, xlsx)
S2 Table. Pv AmpliSeq v2 Peru design.

Primer sequences and target regions in the AmpliSeq design.

(XLSX)

pntd.0011879.s002.xlsx (57.3KB, xlsx)
S3 Table. Variants of interest list.

(XLSX)

pntd.0011879.s003.xlsx (20.9KB, xlsx)
S4 Table. Allele frequencies of barcode positions and microhaplotypes.

Allele frequencies were determined in a WGS dataset and are reported for worldwide, Latin American countries only, and Peru.

(XLSX)

pntd.0011879.s004.xlsx (36.3KB, xlsx)
S5 Table. Variants contributing to the population differentiation in the DAPC.

(XLSX)

pntd.0011879.s005.xlsx (23.3KB, xlsx)
S6 Table. Haplotypes and haplotype frequencies of the variants of interest observed in Peru and control samples.

(XLSX)

pntd.0011879.s006.xlsx (54.1KB, xlsx)
S1 Text

Fig A. Flowchart describing number and type of samples and control attempted with the Pv AmpliSeq assay in this study. Fig B. Global Plasmodium vivax population structure. First two principal components of PCA analysis of 1474 high-quality P. vivax genomes using LD-pruned SNPs across the core genome as previously described in Kattenberg et al 2024 (https://doi.org/10.1002/ece3.11103). The samples (dots) are colored according to the originating population (here region), following classifications from Adam et al., 2022 (https://doi.org/10.12688/WELLCOMEOPENRES.17795.1). Fig C. Distribution of depth of coverage for each amplicon in the Pv AmpliSeq v2 Peru assay. Fig D. Proportion of polyclonal infections detected in each district. Within-host infection complexity was used as a measure of complexity of infections, using within-sample F-statistic (Fws) ≥ 0.95 as proxy for a monoclonal infection. Fig E. Scatter plot of principal components 1 & 2 (A) and 3 & 4 (B) using all variants detected by the Pv AmpliSeq v2 Peru assay of Peru samples (n = 230) grouped by district. Fig F. Parasite relatedness. Network of individual relatedness at (A) intermediate levels of relatedness (50% IBD threshold) and (B) Very high levels of relatedness, indicating clonal infections (95% IBD threshold) colored by years. Fig G. Parasite relatedness and haplotypes of variants of interest. Network of individual relatedness at intermediate levels of relatedness (50% IBD threshold) colored by haplotypes for the different genes. Fig H. Distribution of pvama1 haplotypes by year and district.

(DOCX)

pntd.0011879.s007.docx (3.5MB, docx)
S1 Data. Database of samples.

List of samples and data included in the study.

(XLSX)

pntd.0011879.s008.xlsx (288.5KB, xlsx)

Acknowledgments

First, we want to thank all the participants, field workers, and related staff of the different projects included in this work for making the sample collection possible and their material available for this study. We would also like to acknowledge Prof. Dr. Jean-Pierre Van geertruyden (Global Health Institute, UAntwerp) and Dr. Hugo Valdivia (Naval Medical Research Unit SOUTH) for their contribution on Trompeteros data and sample acquisition. Finally, we want to thank Mr. Carlos Acosta for his help in the preparation of the map in Fig 1.

Data Availability

Raw data (FASTQ files) are available at the SRA under BioProject accession number PRJNA1055117 and individual sample accession numbers are listed in supplementary data file S1 Data. Data preparation scripts are available at https://github.com/Ekattenberg/Plasmodium-AmpliSeq-Pipeline. All other data are included in the manuscript and supporting information.

Funding Statement

This work was funded by the Belgium Development Cooperation (DGD) under the Framework Agreement Program between the DGD and ITM (FA4 Peru, 2017–2021 & FA5 Peru, 2022–2026) (AR-U, DG). Sample collections in Trompeteros and Yavari were supported by VLIR-UOS (project PE2018TEA470A102; University of Antwerp) (CD-R). LC-S is supported by a doctoral scholarship of PROCIENCIA in the framework of the project EC-165-2020-FONDECYT-Programa de Doctorado en Ciencias mención Bioquímica y Biología molecular-UPCH. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.World Health Organization. World malaria report 2022. Geneva: World Health Organization; 2022. [Google Scholar]
  • 2.Ministerio de Salud. Número de casos de malaria, Perú 2019–2022. 2022. Available from: https://www.dge.gob.pe/portal/docs/vigilancia/sala/2022/SE52/malaria.pdf. [Google Scholar]
  • 3.Ferreira MU, Corder RM, Johansen IC, Kattenberg JH, Moreno M, Rosas-Aguirre A, et al. Relative contribution of low-density and asymptomatic infections to Plasmodium vivax transmission in the Amazon: pooled analysis of individual participant data from population-based cross-sectional surveys. Lancet Reg Health Am. 2022;9. Epub 2022/06/07. doi: 10.1016/j.lana.2021.100169 ; PubMed Central PMCID: PMC9161731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Rovira-Vallbona E, Contreras-Mancilla JJ, Ramirez R, Guzmán-Guzmán M, Carrasco-Escobar G, Llanos-Cuentas A, et al. Predominance of asymptomatic and sub-microscopic infections characterizes the Plasmodium gametocyte reservoir in the Peruvian Amazon. PLoS Negl Trop Dis. 2017;11(7):e0005674. Epub 2017/07/04. doi: 10.1371/journal.pntd.0005674 ; PubMed Central PMCID: PMC5510906 following competing interests: Joseph M. Vinetz is member of the Editorial Board of PLoS NTD. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Robortella DR, Calvet AA, Amaral LC, Fantin RF, Guimarães LFF, França Dias MH, et al. Prospective assessment of malaria infection in a semi-isolated Amazonian indigenous Yanomami community: Transmission heterogeneity and predominance of submicroscopic infection. PLoS One. 2020;15(3):e0230643. Epub 2020/03/20. doi: 10.1371/journal.pone.0230643 ; PubMed Central PMCID: PMC7081991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Palma-Cuero M, Machado MB, Graça JTB, Anjos NBD, Pereira RS, Suárez-Mutis MC. Malaria at international borders: challenges for elimination on the remote Brazil-Peru border. Rev Inst Med Trop Sao Paulo. 2022;64:e29. Epub 2022/04/07. doi: 10.1590/S1678-9946202264029 ; PubMed Central PMCID: PMC8993150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Carrasco-Escobar G, Castro MC, Barboza JL, Ruiz-Cabrejos J, Llanos-Cuentas A, Vinetz JM, et al. Use of open mobile mapping tool to assess human mobility traceability in rural offline populations with contrasting malaria dynamics. PeerJ. 2019;7:e6298. Epub 2019/01/31. doi: 10.7717/peerj.6298 ; PubMed Central PMCID: PMC6346981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Nolasco O, Montoya J, Rosales Rosas AL, Barrientos S, Rosanas-Urgell A, Gamboa D. Multicopy targets for Plasmodium vivax and Plasmodium falciparum detection by colorimetric LAMP. Malar J. 2021;20(1):225. Epub 2021/05/21. doi: 10.1186/s12936-021-03753-8 ; PubMed Central PMCID: PMC8135177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Serra-Casas E, Guetens P, Chiheb D, Gamboa D, Rosanas-Urgell A. A pilot evaluation of alternative procedures to simplify LAMP-based malaria diagnosis in field conditions. Acta Trop. 2019;200:105125. Epub 2019/08/09. doi: 10.1016/j.actatropica.2019.105125 . [DOI] [PubMed] [Google Scholar]
  • 10.Serra-Casas E, Manrique P, Ding XC, Carrasco-Escobar G, Alava F, Gave A, et al. Loop-mediated isothermal DNA amplification for asymptomatic malaria detection in challenging field settings: Technical performance and pilot implementation in the Peruvian Amazon. PLoS One. 2017;12(10):e0185742. Epub 2017/10/06. doi: 10.1371/journal.pone.0185742 ; PubMed Central PMCID: PMC5628891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Ministerio de Salud. Documento Técnico: Plan hacia la Eliminación de la Malaria 2022–2030. 2022. Available from: https://bvs.minsa.gob.pe/local/fi-admin/RM-034-2022%20MINSA.pdf. [Google Scholar]
  • 12.Rosas-Aguirre A, Gamboa D, Manrique P, Conn JE, Moreno M, Lescano AG, et al. Epidemiology of Plasmodium vivax Malaria in Peru. Am J Trop Med Hyg. 2016;95(6 Suppl):133–44. Epub 2016/11/02. doi: 10.4269/ajtmh.16-0268 ; PubMed Central PMCID: PMC5201219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Auburn S, Barry AE. Dissecting malaria biology and epidemiology using population genetics and genomics. Int J Parasitol. 2017;47(2–3):77–85. Epub 2016/11/09. doi: 10.1016/j.ijpara.2016.08.006 . [DOI] [PubMed] [Google Scholar]
  • 14.Dalmat R, Naughton B, Kwan-Gett TS, Slyker J, Stuckey EM. Use cases for genetic epidemiology in malaria elimination. Malar J. 2019;18(1):163. Epub 2019/05/09. doi: 10.1186/s12936-019-2784-0 ; PubMed Central PMCID: PMC6503548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Noviyanti R, Miotto O, Barry A, Marfurt J, Siegel S, Thuy-Nhien N, et al. Implementing parasite genotyping into national surveillance frameworks: feedback from control programmes and researchers in the Asia-Pacific region. Malar J. 2020;19(1):271. Epub 2020/07/29. doi: 10.1186/s12936-020-03330-5 ; PubMed Central PMCID: PMC7385952. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Ibrahim A, Manko E, Dombrowski JG, Campos M, Benavente ED, Nolder D, et al. Population-based genomic study of Plasmodium vivax malaria in seven Brazilian states and across South America. Lancet Reg Health Am. 2023;18:100420. Epub 2023/02/28. doi: 10.1016/j.lana.2022.100420 ; PubMed Central PMCID: PMC9950661. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Rougeron V, Daron J, Fontaine MC, Prugnolle F. Evolutionary history of Plasmodium vivax and Plasmodium simium in the Americas. Malar J. 2022;21(1):141. Epub 2022/05/04. doi: 10.1186/s12936-022-04132-7 ; PubMed Central PMCID: PMC9066938. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Kattenberg JH, Monsieurs P, De Meyer J, De Meulenaere K, Sauve E, de Oliveira TC, et al. Population genomic evidence of structured and connected Plasmodium vivax populations under host selection in Latin America. Ecol Evol. 2024;14(3):e11103. Epub 2024/03/26. doi: 10.1002/ece3.11103 ; PubMed Central PMCID: PMC10961478. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Delgado-Ratto C, Gamboa D, Soto-Calle VE, Van den Eede P, Torres E, Sánchez-Martínez L, et al. Population Genetics of Plasmodium vivax in the Peruvian Amazon. PLoS Negl Trop Dis. 2016;10(1):e0004376. Epub 2016/01/15. doi: 10.1371/journal.pntd.0004376 ; PubMed Central PMCID: PMC4713096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Villena FE, Sanchez JF, Nolasco O, Braga G, Ricopa L, Barazorda K, et al. Drug resistance and population structure of Plasmodium falciparum and Plasmodium vivax in the Peruvian Amazon. Sci Rep. 2022;12(1):16474. Epub 2022/10/02. doi: 10.1038/s41598-022-21028-3 ; PubMed Central PMCID: PMC9526214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Manrique P, Miranda-Alban J, Alarcon-Baldeon J, Ramirez R, Carrasco-Escobar G, Herrera H, et al. Microsatellite analysis reveals connectivity among geographically distant transmission zones of Plasmodium vivax in the Peruvian Amazon: A critical barrier to regional malaria elimination. PLoS Negl Trop Dis. 2019;13(11):e0007876. Epub 2019/11/12. doi: 10.1371/journal.pntd.0007876 ; PubMed Central PMCID: PMC6874088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kattenberg JH, Fernandez-Miñope C, van Dijk NJ, Llacsahuanga Allcca L, Guetens P, Valdivia HO, et al. Malaria Molecular Surveillance in the Peruvian Amazon with a Novel Highly Multiplexed Plasmodium falciparum AmpliSeq Assay. Microbiol Spectr. 2023;11(2):e0096022. Epub 2023/02/26. doi: 10.1128/spectrum.00960-22 ; PubMed Central PMCID: PMC10101074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Kattenberg JH, Nguyen HV, Nguyen HL, Sauve E, Nguyen NTH, Chopo-Pizarro A, et al. Novel highly-multiplexed AmpliSeq targeted assay for Plasmodium vivax genetic surveillance use cases at multiple geographical scales. Front Cell Infect Microbiol. 2022;12:953187. Epub 2022/08/30. doi: 10.3389/fcimb.2022.953187 ; PubMed Central PMCID: PMC9403277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.De Meulenaere K, Prajapati SK, Villasis E, Cuypers B, Kattenberg JH, Kasian B, et al. Band 3-mediated Plasmodium vivax invasion is associated with transcriptional variation in PvTRAg genes. Front Cell Infect Microbiol. 2022;12:1011692. Epub 2022/10/18. doi: 10.3389/fcimb.2022.1011692 ; PubMed Central PMCID: PMC9563252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Cowell AN, Valdivia HO, Bishop DK, Winzeler EA. Exploration of Plasmodium vivax transmission dynamics and recurrent infections in the Peruvian Amazon using whole genome sequencing. Genome Med. 2018;10(1):52. Epub 2018/07/06. doi: 10.1186/s13073-018-0563-0 ; PubMed Central PMCID: PMC6032790. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Dharia NV, Bright AT, Westenberger SJ, Barnes SW, Batalov S, Kuhen K, et al. Whole-genome sequencing and microarray analysis of ex vivo Plasmodium vivax reveal selective pressure on putative drug resistance genes. Proc Natl Acad Sci U S A. 2010;107(46):20045–50. Epub 2010/11/03. doi: 10.1073/pnas.1003776107 ; PubMed Central PMCID: PMC2993397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Flannery EL, Wang T, Akbari A, Corey VC, Gunawan F, Bright AT, et al. Next-Generation Sequencing of Plasmodium vivax Patient Samples Shows Evidence of Direct Evolution in Drug-Resistance Genes. ACS Infect Dis. 2015;1(8):367–79. Epub 2016/01/01. doi: 10.1021/acsinfecdis.5b00049 ; PubMed Central PMCID: PMC4692371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Hupalo DN, Luo Z, Melnikov A, Sutton PL, Rogov P, Escalante A, et al. Population genomics studies identify signatures of global dispersal and drug resistance in Plasmodium vivax. Nat Genet. 2016;48(8):953–8. Epub 2016/06/28. doi: 10.1038/ng.3588 ; PubMed Central PMCID: PMC5347536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Auburn S, Böhme U, Steinbiss S, Trimarsanto H, Hostetler J, Sanders M, et al. A new Plasmodium vivax reference sequence with improved assembly of the subtelomeres reveals an abundance of pir genes. Wellcome Open Res. 2016;1:4. Epub 2016/12/23. doi: 10.12688/wellcomeopenres.9876.1 ; PubMed Central PMCID: PMC5172418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Jombart T, Devillard S, Balloux F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet. 2010;11:94. Epub 2010/10/19. doi: 10.1186/1471-2156-11-94 ; PubMed Central PMCID: PMC2973851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Gamboa D, Ho MF, Bendezu J, Torres K, Chiodini PL, Barnwell JW, et al. A large proportion of P. falciparum isolates in the Amazon region of Peru lack pfhrp2 and pfhrp3: implications for malaria rapid diagnostic tests. PLoS One. 2010;5(1):e8091. Epub 2010/01/30. doi: 10.1371/journal.pone.0008091 ; PubMed Central PMCID: PMC2810332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Rosas-Aguirre A, Moreno M, Moreno-Gutierrez D, Llanos-Cuentas A, Saavedra M, Contreras-Mancilla J, et al. Integrating Parasitological and Entomological Observations to Understand Malaria Transmission in Riverine Villages in the Peruvian Amazon. J Infect Dis. 2021;223(12 Suppl 2):S99–s110. Epub 2021/04/28. doi: 10.1093/infdis/jiaa496 ; PubMed Central PMCID: PMC8079135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Rosado J, Carrasco-Escobar G, Nolasco O, Garro K, Rodriguez-Ferruci H, Guzman-Guzman M, et al. Malaria transmission structure in the Peruvian Amazon through antibody signatures to Plasmodium vivax. PLoS Negl Trop Dis. 2022;16(5):e0010415. Epub 2022/05/10. doi: 10.1371/journal.pntd.0010415 ; PubMed Central PMCID: PMC9119515 following competing interests: MTW and IM are inventors on patent PCT/US17/67926 on a system, method, apparatus and diagnostic test for Plasmodium vivax. No other authors declare a conflict of interest. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Villasis E, Garro K, Rosas-Aguirre A, Rodriguez P, Rosado J, Gave A, et al. PvMSP8 as a Novel Plasmodium vivax Malaria Sero-Marker for the Peruvian Amazon. Pathogens. 2021;10(3). Epub 2021/04/04. doi: 10.3390/pathogens10030282 ; PubMed Central PMCID: PMC7999794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Mangold KA, Manson RU, Koay ES, Stephens L, Regner M, Thomson RB Jr, et al. Real-time PCR for detection and identification of Plasmodium spp. J Clin Microbiol. 2005;43(5):2435–40. Epub 2005/05/06. doi: 10.1128/JCM.43.5.2435-2440.2005 ; PubMed Central PMCID: PMC1153761. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Gruenberg M, Moniz CA, Hofmann NE, Wampfler R, Koepfli C, Mueller I, et al. Plasmodium vivax molecular diagnostics in community surveys: pitfalls and solutions. Malar J. 2018;17(1):55. Epub 2018/01/31. doi: 10.1186/s12936-018-2201-0 ; PubMed Central PMCID: PMC5789620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kattenberg JH, Van Dijk NJ, Fernández-Miñope CA, Guetens P, Mutsaers M, Gamboa D, et al. Molecular Surveillance of Malaria Using the PF AmpliSeq Custom Assay for Plasmodium falciparum Parasites from Dried Blood Spot DNA Isolates from Peru. Bio Protoc. 2023;13(5):e4621. Epub 2023/03/14. doi: 10.21769/BioProtoc.4621 ; PubMed Central PMCID: PMC9993081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012;6(2):80–92. Epub 2012/06/26. doi: 10.4161/fly.19695 ; PubMed Central PMCID: PMC3679285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Taylor AR, Jacob PE, Neafsey DE, Buckee CO. Estimating Relatedness Between Malaria Parasites. Genetics. 2019;212(4):1337–51. Epub 2019/06/19. doi: 10.1534/genetics.119.302120 ; PubMed Central PMCID: PMC6707449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Jombart T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24(11):1403–5. Epub 2008/04/10. doi: 10.1093/bioinformatics/btn129 . [DOI] [PubMed] [Google Scholar]
  • 41.Jombart T, Ahmed I. adegenet 1.3–1: new tools for the analysis of genome-wide SNP data. Bioinformatics. 2011;27(21):3070–1. Epub 2011/09/20. doi: 10.1093/bioinformatics/btr521 ; PubMed Central PMCID: PMC3198581. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Goudet J, Jombart T, Kamvar Z, Archer E, Hardy O. Package ‘hierfstat’ 2022. Available from: https://cran.r-project.org/web/packages/hierfstat/hierfstat.pdf.
  • 43.Lee S, Bahlo M. moimix: an R package for assessing clonality in high-througput sequencing data. moimix: an R package for assessing clonality in high-throughput sequencing data. 2016. [Google Scholar]
  • 44.Auburn S, Campino S, Miotto O, Djimde AA, Zongo I, Manske M, et al. Characterization of within-host Plasmodium falciparum diversity using next-generation sequence data. PLoS One. 2012;7(2):e32891. Epub 2012/03/07. doi: 10.1371/journal.pone.0032891 ; PubMed Central PMCID: PMC3290604. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Henden L, Lee S, Mueller I, Barry A, Bahlo M. Identity-by-descent analyses for measuring population dynamics and selection in recombining pathogens. PLoS Genet. 2018;14(5):e1007279. Epub 2018/05/24. doi: 10.1371/journal.pgen.1007279 ; PubMed Central PMCID: PMC5988311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Trimarsanto H, Amato R, Pearson RD, Sutanto E, Noviyanti R, Trianty L, et al. A molecular barcode and web-based data analysis tool to identify imported Plasmodium vivax malaria. Commun Biol. 2022;5(1):1411. Epub 2022/12/24. doi: 10.1038/s42003-022-04352-2 ; PubMed Central PMCID: PMC9789135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Taylor JE, Pacheco MA, Bacon DJ, Beg MA, Machado RL, Fairhurst RM, et al. The evolutionary history of Plasmodium vivax as inferred from mitochondrial genomes: parasite genetic diversity in the Americas. Mol Biol Evol. 2013;30(9):2050–64. Epub 2013/06/05. doi: 10.1093/molbev/mst104 ; PubMed Central PMCID: PMC3748350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.McCollum AM, Soberon V, Salas CJ, Santolalla ML, Udhayakumar V, Escalante AA, et al. Genetic variation and recurrent parasitaemia in Peruvian Plasmodium vivax populations. Malar J. 2014;13:67. Epub 2014/02/27. doi: 10.1186/1475-2875-13-67 ; PubMed Central PMCID: PMC3941685. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Carrasco-Escobar G, Gamboa D, Castro MC, Bangdiwala SI, Rodriguez H, Contreras-Mancilla J, et al. Micro-epidemiology and spatial heterogeneity of P. vivax parasitaemia in riverine communities of the Peruvian Amazon: A multilevel analysis. Sci Rep. 2017;7(1):8082. Epub 2017/08/16. doi: 10.1038/s41598-017-07818-0 ; PubMed Central PMCID: PMC5556029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Baldeviano GC, Okoth SA, Arrospide N, Gonzalez RV, Sánchez JF, Macedo S, et al. Molecular Epidemiology of Plasmodium falciparum Malaria Outbreak, Tumbes, Peru, 2010–2012. Emerg Infect Dis. 2015;21(5):797–803. Epub 2015/04/22. doi: 10.3201/eid2105.141427 ; PubMed Central PMCID: PMC4412223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Montenegro CC, Bustamante-Chauca TP, Pajuelo Reyes C, Bernal M, Gonzales L, Tapia-Limonchi R, et al. Plasmodium falciparum outbreak in native communities of Condorcanqui, Amazonas, Perú. Malar J. 2021;20(1):88. Epub 2021/02/14. doi: 10.1186/s12936-021-03608-2 ; PubMed Central PMCID: PMC7880654. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Okoth SA, Chenet SM, Arrospide N, Gutierrez S, Cabezas C, Matta JA, et al. Molecular Investigation into a Malaria Outbreak in Cusco, Peru: Plasmodium falciparum BV1 Lineage is Linked to a Second Outbreak in Recent Times. Am J Trop Med Hyg. 2016;94(1):128–31. Epub 2015/10/21. doi: 10.4269/ajtmh.15-0442 ; PubMed Central PMCID: PMC4710416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.de Oliveira TC, Corder RM, Early A, Rodrigues PT, Ladeia-Andrade S, Alves JMP, et al. Population genomics reveals the expansion of highly inbred Plasmodium vivax lineages in the main malaria hotspot of Brazil. PLoS Negl Trop Dis. 2020;14(10):e0008808. Epub 2020/10/29. doi: 10.1371/journal.pntd.0008808 ; PubMed Central PMCID: PMC7592762. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Sutanto E, Pava Z, Echeverry DF, Lopera-Mesa TM, Montenegro LM, Yasnot-Acosta MF, et al. Genomics of Plasmodium vivax in Colombia reveals evidence of local bottle-necking and inter-country connectivity in the Americas. Sci Rep. 2023;13(1):19779. Epub 2023/11/14. doi: 10.1038/s41598-023-46076-1 ; PubMed Central PMCID: PMC10643449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Buyon LE, Santamaria AM, Early AM, Quijada M, Barahona I, Lasso J, et al. Population genomics of Plasmodium vivax in Panama to assess the risk of case importation on malaria elimination. PLoS Negl Trop Dis. 2020;14(12):e0008962. Epub 2020/12/15. doi: 10.1371/journal.pntd.0008962 ; PubMed Central PMCID: PMC7769613 was unable to confirm their authorship contributions. On their behalf, the corresponding author has reported their contributions to the best of their knowledge. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Pacheco MA, Forero-Pena DA, Schneider KA, Chavero M, Gamardo A, Figuera L, et al. Malaria in Venezuela: changes in the complexity of infection reflects the increment in transmission intensity. Malar J. 2020;19(1):176. Epub 2020/05/10. doi: 10.1186/s12936-020-03247-z ; PubMed Central PMCID: PMC7206825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Gunderson AK, Kumar RE, Recalde-Coronel C, Vasco LE, Valle-Campos A, Mena CF, et al. Malaria Transmission and Spillover across the Peru-Ecuador Border: A Spatiotemporal Analysis. Int J Environ Res Public Health. 2020;17(20). Epub 2020/10/18. doi: 10.3390/ijerph17207434 ; PubMed Central PMCID: PMC7600436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Gosling R, Chimumbwa J, Uusiku P, Rossi S, Ntuku H, Harvard K, et al. District-level approach for tailoring and targeting interventions: a new path for malaria control and elimination. Malar J. 2020;19(1):125. Epub 2020/04/02. doi: 10.1186/s12936-020-03185-w ; PubMed Central PMCID: PMC7106871. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Buyon LE, Elsworth B, Duraisingh MT. The molecular basis of antimalarial drug resistance in Plasmodium vivax. Int J Parasitol Drugs Drug Resist. 2021;16:23–37. Epub 2021/05/07. doi: 10.1016/j.ijpddr.2021.04.002 ; PubMed Central PMCID: PMC8113647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.World Health Organization. Methods for surveillance of antimalarial drug efficacy. Geneva: World Health Organization; 2009. [Google Scholar]
  • 61.Durand S, Cabezas C, Lescano AG, Galvez M, Gutierrez S, Arrospide N, et al. Efficacy of three different regimens of primaquine for the prevention of relapses of Plasmodium vivax malaria in the Amazon Basin of Peru. Am J Trop Med Hyg. 2014;91(1):18–26. Epub 2014/04/23. doi: 10.4269/ajtmh.13-0053 ; PubMed Central PMCID: PMC4080559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Graf PC, Durand S, Alvarez Antonio C, Montalvan C, Galves Montoya M, Green MD, et al. Failure of Supervised Chloroquine and Primaquine Regimen for the Treatment of Plasmodium vivax in the Peruvian Amazon. Malar Res Treat. 2012;2012:936067. Epub 2012/06/16. doi: 10.1155/2012/936067 ; PubMed Central PMCID: PMC3371340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Ferreira MU, Nobrega de Sousa T, Rangel GW, Johansen IC, Corder RM, Ladeia-Andrade S, et al. Monitoring Plasmodium vivax resistance to antimalarials: Persisting challenges and future directions. Int J Parasitol Drugs Drug Resist. 2021;15:9–24. Epub 2020/12/29. doi: 10.1016/j.ijpddr.2020.12.001 ; PubMed Central PMCID: PMC7770540. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Aguiar AC, Pereira DB, Amaral NS, De Marco L, Krettli AU. Plasmodium vivax and Plasmodium falciparum ex vivo susceptibility to anti-malarials and gene characterization in Rondônia, West Amazon, Brazil. Malar J. 2014;13:73. Epub 2014/03/04. doi: 10.1186/1475-2875-13-73 ; PubMed Central PMCID: PMC3945814. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Ladeia-Andrade S, Menezes MJ, de Sousa TN, Silvino ACR, de Carvalho JF, Salla LC Jr., et al. Monitoring the Efficacy of Chloroquine-Primaquine Therapy for Uncomplicated Plasmodium vivax Malaria in the Main Transmission Hot Spot of Brazil. Antimicrob Agents Chemother. 2019;63(5). Epub 2019/02/21. doi: 10.1128/AAC.01965-18 ; PubMed Central PMCID: PMC6496112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Medrano Zavala A. Resistencia de Plasmodium vivax y Plasmodium falciparum a medicamentos antimaláricos mediante la inmunodetección de lactato deshidrogenasa del parásito, Iquitos, 2015–2019. Peru: Universidad Nacional Mayor de San Marcos; 2023. [Google Scholar]
  • 67.Hansson H, Saidi Q, Alifrangis M. Preservation and Extraction of Malaria Parasite DNA from Dried Blood Spots. Methods Mol Biol. 2022;2470:27–36. Epub 2022/07/27. doi: 10.1007/978-1-0716-2189-9_4 . [DOI] [PubMed] [Google Scholar]
  • 68.Adam I, Alam MS, Alemu S, Amaratunga C, Amato R, Andrianaranjaka V, et al. An open dataset of Plasmodium vivax genome variation in 1,895 worldwide samples. Wellcome Open Res. 2022;7:136. Epub 2022/06/03. doi: 10.12688/wellcomeopenres.17795.1 ; PubMed Central PMCID: PMC9127374. [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Negl Trop Dis. doi: 10.1371/journal.pntd.0011879.r001

Decision Letter 0

Susan Madison-Antenucci, Andrés Aranda-Diaz

12 Apr 2024

Dear Dr. Kattenberg,

Thank you very much for submitting your manuscript "Plasmodium vivax genomic surveillance in the Peruvian Amazon with Pv AmpliSeq assay" for consideration at PLOS Neglected Tropical Diseases. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers.

The reviewers agree that your manuscript meets the criteria for publication and have provided feedback to improve the clarity throughout the sections of the paper. I would like to highlight that added clarity on the sources of data for each of the figures and text and the conclusions derived from those analysis will ensure that the novel aspects of the tool presented in the manuscript are contrasted to previous work and available data, addressing some of the major comments from the reviewers.

Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Andrés Aranda-Diaz, Ph.D.

Guest Editor

PLOS Neglected Tropical Diseases

Susan Madison-Antenucci

Section Editor

PLOS Neglected Tropical Diseases

***********************

Reviewer's Responses to Questions

Key Review Criteria Required for Acceptance?

As you describe the new analyses required for acceptance, please consider the following:

Methods

-Are the objectives of the study clearly articulated with a clear testable hypothesis stated?

-Is the study design appropriate to address the stated objectives?

-Is the population clearly described and appropriate for the hypothesis being tested?

-Is the sample size sufficient to ensure adequate power to address the hypothesis being tested?

-Were correct statistical analysis used to support conclusions?

-Are there concerns about ethical or regulatory requirements being met?

Reviewer #1: Number of samples you attend vs successfully genotyped not mentioned. This is very important to see your new Pv AmpliSeq panel robustness. Moreover, the following statement “For the analysis of genomic surveillance use cases in Peru, we included 230 samples with good quality data (<25% missing genotype calls for all variants, mean coverage >15), and retained only one library of replicates (with lowest missingness)” not clear. What is minimum sample and SNPs missingness rate in your data set. How many of samples and SNPs/loci/genotypes removed from total based on your filtering criteria. The association between sequence coverage and parasite density is critical but not mentioned!

Reviewer #2: -Are the objectives of the study clearly articulated with a clear testable hypothesis stated?

Yes

-Is the study design appropriate to address the stated objectives?

Yes

-Is the population clearly described and appropriate for the hypothesis being tested?

Yes

-Is the sample size sufficient to ensure adequate power to address the hypothesis being tested?

Yes

-Were correct statistical analysis used to support conclusions?

Yes

-Are there concerns about ethical or regulatory requirements being met?

No

Reviewer #3: Yes

--------------------

Results

-Does the analysis presented match the analysis plan?

-Are the results clearly and completely presented?

-Are the figures (Tables, Images) of sufficient quality for clarity?

Reviewer #1: Yes, well addressed. Few comments attached below.

Reviewer #2: -Does the analysis presented match the analysis plan?

Yes

-Are the results clearly and completely presented?

Yes

-Are the figures (Tables, Images) of sufficient quality for clarity?

They seem good but too low res to conclude now. Labeling throughout does seem limited and small in font.

Reviewer #3: Yes - mostly, just some minor clarifications needed as described in the report

--------------------

Conclusions

-Are the conclusions supported by the data presented?

-Are the limitations of analysis clearly described?

-Do the authors discuss how these data can be helpful to advance our understanding of the topic under study?

-Is public health relevance addressed?

Reviewer #1: Yes. Here are some comments. From FST vs IBD sharing and HE VS FWS. Worthy to mention which metrics are robust for Pv genomic surveillance and more correlated with transmission intensity. Which metrics useful for malaria control efforts?

Reviewer #2: -Are the conclusions supported by the data presented?

Yes

-Are the limitations of analysis clearly described?

Yes

-Do the authors discuss how these data can be helpful to advance our understanding of the topic under study?

Yes

-Is public health relevance addressed?

Yes

Reviewer #3: Yes

--------------------

Editorial and Data Presentation Modifications?

Use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity. If the only modifications needed are minor and/or editorial, you may wish to recommend “Minor Revision” or “Accept”.

Reviewer #1: “Minor Revision”

Reviewer #2: Minor revision

Reviewer #3: Minor Revision

--------------------

Summary and General Comments

Use this section to provide overall comments, discuss strengths/weaknesses of the study, novelty, significance, general execution and scholarship. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. If requesting major revision, please articulate the new experiments that are needed.

Reviewer #1: This manuscript by Johanna Helena Kattenberg and colleagues examined Plasmodium vivax genomic surveillance in the Peruvian Amazon using multiplex and robust Pv AmpliSeq panel. The study is timely study as Pv highly prevalent in hard-to-reach community of Peru and it is a major impediment to achieve malaria elimination goals. Overall, the manuscript is well written however it lacks novelty and key gaps to be addressed by the study. Detecting previously reported population differentiation is not good enough.

Reviewer #2: Methods comments:

What was the success rate? You mention 230 samples, but how many did you attempt to use initially?

I think you could add a few more words on the in-house variant calling pipeline. Is this a (targeted) mapping + GATK haplotype calling procedure as commonly applied to WGS reads, or are additional amplicon denoising strategies applied?

Whats the target size range (excl adapters) and how many samples do you sequence in single 600-cycle run?

Why do you select Fws ≥ 0.95 as a proxy for a monoclonal infection?

Would it be useful to show sampling month in Fig 2 (e.g., as pie slices) and in S1 Table, since suggesting outbreak dynamics in some areas of text?

Results comments:

Fig. 1A: Could be expanded with larger labels and replace 1B,C,D - which currently provide little extra info.

Since text talks of peri-urban Pv connectivity (as opposed to simply spatial proximity-based connectivity) it would be useful to see some representation of urban extent (around Iquitos etc) in the map. Currently also the grey lines are not clearly explained. Rivers etc? Also, it could be useful to use some of the same province-level coloring in components of Figs 5-6? (since it seems most relatedness patterns related to within versus between province membership).

Fig. 3: I suggest to label the genome/assay regions underlying each PCA directly in the subplots.

Figs 5A-B: Currently the node colors are not visible enough. Could make them bigger and better associate coloring with geography, e.g., add a small map and/or associate coloring with main map in Fig. 1.

How are p-values calculated for differences in Pi for Iquitos compared to its surrounding districts and for 2007-2008 vs. later years?

Also please label Fig. 6A-B thresholds directly in the plot.

Please clarify meaning of left vs. right in Fig. 6C. Thickness of curves on the left indicates how many samples from the left-indicated district show IBD > 0.10 IBD to any sample in the district indicated at right? Etc.

Conclusions/Discussion comments:

I think a few sentences putting results in the context of transmission dynamics observed in other South American Pv regions would be useful. For example, the study suggests relatively clear regional clustering within Peru, signals of Brazilian diversity and outbreaking in Mariscal Ramon Castilla, and temporal turnover in Maynas. Do other regions show similar degrees of spatiotemporal population differentiation? If yes/no, how might environmental configuration and human activity patterns play a role?

Reviewer #3: The article describes some great, new, country-specific (Peru) amplicon-based sequencing tools for P. vivax that have the potential to support control and elimination efforts in Peru. The methods, the data showcasing the utilities of the tools and the conclusions are mostly robust. I recommend a few revisions that may help to improve the clarity of various approaches. One of the major confusions is regarding whether one can/should use the 41 biallelic SNPs and when one needs to use the full panel of variants called. The abstract only refers to 41 biallelic barcode SNPs, but many analyses use a larger panel of markers. More clarification on what one can expect from the tools (41 or hundreds of SNPs?) would be helpful for the reader to better understand what they might expect to achieve in their own studies if they use these tools.

Minor revisions

1. The introduction has an entire paragraph describing the challenge of asymptomatic vivax in Peru (p3), which gives the implication that the tools can help to address this. However, the Discussion section doesn't revisit how the study tools address this particular challenge. Some comments on this would be helpful.

2. The Introduction also alludes to the asymptomatic infections as a P. vivax-specific challenge - is that strictly true?

3. On p4, AmpliSeq tools are described as having previously identified temporal challenges before and after NMCP intervention (along with other utilities) - are the appropriate references for this statement 22 and 23?

4. On p5, it would be helpful to better understand why the authors aimed for 49 SNPs? Is this purely because of the DAPC results or did they have a prior maximum number based on multiplex requirements with the other SNPs? Did they consider IBD requirements as per the specifications of Taylor and colleagues for 200 biallelic SNPs (https://pubmed.ncbi.nlm.nih.gov/31209105/)? Some further details would be helpful here.

5. On p6, the authors describe <25% missingness as the cut-off - across all variants if I recall correctly. What was the maximum observed missingness at the 41-SNP barcode in the 230 defined good-quality samples?

6. On p6, the authors refer to 230 high-quality samples - is this 100% (i.e. 230/230) of samples on which sequencing was attempted?

7. On p7 the authors describe using Fws to characterize polyclonality - I believe on the 41-SNP barcode but they might clarify this. Given that the Fws was designed for genome-wide data, how well would this measure extrapolate to smaller barcodes?

8. On p7, the authors refer to a genomic dataset comprising 1474 genomes - is this the MalariaGEN resource? It wasn't clear.

9. On p7, the authors refer to 753 loci, which are not mentioned in the abstract. Is this because they do not anticipate them to be consistently polymorphic? Some more details on these loci, such as their MAF profile or the allelic diversity of each amplicon would be helpful.

10. The legend for Figure 3 (and perhaps the text on p7) needs more explanation on what the regional groups (AFR, ESEA etc) refer to. These look like MalariaGEN definitions but there is no reference to the definitions.

11. The country clustering in Figure 3 is not very tight. Is the resolution similar with genomic data - perhaps a reference figure would help. Only Panama separates well, but I believe they have experienced a strong bottleneck as described in https://pubmed.ncbi.nlm.nih.gov/33315861/. Having said the above, I appreciate that DAPC will only show parts of the full data...

12. On p8, it's helpful to have read depth information but would be even more informative if we had more details on how many samples were multiplexed in each run as that can affect the yield and is an important cost consideration.

13. The Results did not describe any negative controls (human DNA or other Plasmodium Spp) - were these tested and, if so, how did they perform?

14. On p8, the authors describe a lack of temporal trends with He (in the 41 SNPs) despite a trend with pi on the full variants - is this potentially because of how the 41 SNPs were selected? What years of collection, for example, do the genomic sets reflect? Or is the lack of a trend with He likely simply due to lower information content with the 41 SNPs?

15. On p8, can the authors clarify whether Fst was run on the 41 SNPs or all variants?

16. Was IBD calculated on the 41 SNPs or all variants? If the 41 SNPs, a comment in regards the 2019 study by Taylor mentioned above would be helpful.

17. On p11, were the misclassified samples also heterozygote at any of the global barcode SNPs?

18. On p12, the authors refer to the utility of CT screening by qPCR - what CT was found to be useful in this study?

--------------------

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Sarah Auburn

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included

Attachment

Submitted filename: Reviewer_comments3.doc

pntd.0011879.s009.doc (31.5KB, doc)
PLoS Negl Trop Dis. doi: 10.1371/journal.pntd.0011879.r003

Decision Letter 1

Susan Madison-Antenucci, Andrés Aranda-Diaz

12 Jun 2024

Dear Dr. Kattenberg,

We are pleased to inform you that your manuscript 'Plasmodium vivax genomic surveillance in the Peruvian Amazon with Pv AmpliSeq assay' has been provisionally accepted for publication in PLOS Neglected Tropical Diseases.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Neglected Tropical Diseases.

Best regards,

Andrés Aranda-Diaz, Ph.D.

Guest Editor

PLOS Neglected Tropical Diseases

Susan Madison-Antenucci

Section Editor

PLOS Neglected Tropical Diseases

***********************************************************

We have reviewed the revisions to your manuscript. The changes satisfactorily address the reviewers' concerns. Therefore, we are pleased to inform you that your manuscript has been accepted for publication.

PLoS Negl Trop Dis. doi: 10.1371/journal.pntd.0011879.r004

Acceptance letter

Susan Madison-Antenucci, Andrés Aranda-Diaz

26 Jun 2024

Dear Dr. Kattenberg,

We are delighted to inform you that your manuscript, "Plasmodium vivax genomic surveillance in the Peruvian Amazon with Pv AmpliSeq assay," has been formally accepted for publication in PLOS Neglected Tropical Diseases.

We have now passed your article onto the PLOS Production Department who will complete the rest of the publication process. All authors will receive a confirmation email upon publication.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any scientific or type-setting errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Note: Proofs for Front Matter articles (Editorial, Viewpoint, Symposium, Review, etc...) are generated on a different schedule and may not be made available as quickly.

Soon after your final files are uploaded, the early version of your manuscript will be published online unless you opted out of this process. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Neglected Tropical Diseases.

Best regards,

Shaden Kamhawi

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

Paul Brindley

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Summary of samples and studies included in the Pv AmpliSeq assay.

    (XLSX)

    pntd.0011879.s001.xlsx (20.1KB, xlsx)
    S2 Table. Pv AmpliSeq v2 Peru design.

    Primer sequences and target regions in the AmpliSeq design.

    (XLSX)

    pntd.0011879.s002.xlsx (57.3KB, xlsx)
    S3 Table. Variants of interest list.

    (XLSX)

    pntd.0011879.s003.xlsx (20.9KB, xlsx)
    S4 Table. Allele frequencies of barcode positions and microhaplotypes.

    Allele frequencies were determined in a WGS dataset and are reported for worldwide, Latin American countries only, and Peru.

    (XLSX)

    pntd.0011879.s004.xlsx (36.3KB, xlsx)
    S5 Table. Variants contributing to the population differentiation in the DAPC.

    (XLSX)

    pntd.0011879.s005.xlsx (23.3KB, xlsx)
    S6 Table. Haplotypes and haplotype frequencies of the variants of interest observed in Peru and control samples.

    (XLSX)

    pntd.0011879.s006.xlsx (54.1KB, xlsx)
    S1 Text

    Fig A. Flowchart describing number and type of samples and control attempted with the Pv AmpliSeq assay in this study. Fig B. Global Plasmodium vivax population structure. First two principal components of PCA analysis of 1474 high-quality P. vivax genomes using LD-pruned SNPs across the core genome as previously described in Kattenberg et al 2024 (https://doi.org/10.1002/ece3.11103). The samples (dots) are colored according to the originating population (here region), following classifications from Adam et al., 2022 (https://doi.org/10.12688/WELLCOMEOPENRES.17795.1). Fig C. Distribution of depth of coverage for each amplicon in the Pv AmpliSeq v2 Peru assay. Fig D. Proportion of polyclonal infections detected in each district. Within-host infection complexity was used as a measure of complexity of infections, using within-sample F-statistic (Fws) ≥ 0.95 as proxy for a monoclonal infection. Fig E. Scatter plot of principal components 1 & 2 (A) and 3 & 4 (B) using all variants detected by the Pv AmpliSeq v2 Peru assay of Peru samples (n = 230) grouped by district. Fig F. Parasite relatedness. Network of individual relatedness at (A) intermediate levels of relatedness (50% IBD threshold) and (B) Very high levels of relatedness, indicating clonal infections (95% IBD threshold) colored by years. Fig G. Parasite relatedness and haplotypes of variants of interest. Network of individual relatedness at intermediate levels of relatedness (50% IBD threshold) colored by haplotypes for the different genes. Fig H. Distribution of pvama1 haplotypes by year and district.

    (DOCX)

    pntd.0011879.s007.docx (3.5MB, docx)
    S1 Data. Database of samples.

    List of samples and data included in the study.

    (XLSX)

    pntd.0011879.s008.xlsx (288.5KB, xlsx)
    Attachment

    Submitted filename: Reviewer_comments3.doc

    pntd.0011879.s009.doc (31.5KB, doc)
    Attachment

    Submitted filename: Reviewer_comments_and_responses_V2.docx

    pntd.0011879.s010.docx (53KB, docx)

    Data Availability Statement

    Raw data (FASTQ files) are available at the SRA under BioProject accession number PRJNA1055117 and individual sample accession numbers are listed in supplementary data file S1 Data. Data preparation scripts are available at https://github.com/Ekattenberg/Plasmodium-AmpliSeq-Pipeline. All other data are included in the manuscript and supporting information.


    Articles from PLOS Neglected Tropical Diseases are provided here courtesy of PLOS

    RESOURCES