Abstract
Intervention against falciparum malaria in high transmission regions remains challenging, with relaxation of control efforts typically followed by rapid resurgence. Resilience to intervention co-occurs with incomplete immunity, whereby children eventually become protected from severe disease but not infection and a large transmission reservoir results from high asymptomatic prevalence across all ages. Incomplete immunity relates to the vast antigenic variation of the parasite, with the major surface antigen of the blood stage of infection encoded by the multigene family known as var. Recent deep sampling of var sequences from individual isolates in northern Ghana showed that parasite population structure exhibited persistent features of high-transmission regions despite the considerable decrease in prevalence during transient intervention with indoor residual spraying (IRS). We ask whether despite such apparent limited impact, the transmission system had been brought close to a transition in both prevalence and resurgence ability. With a stochastic agent-based model, we investigate the existence of such a transition to pre-elimination with intervention intensity, and of molecular indicators informative of its approach. We show that resurgence ability decreases sharply and nonlinearly across a narrow region of intervention intensities in model simulations, and identify informative molecular indicators based on var gene sequences. Their application to the survey data indicates that the transmission system in northern Ghana was brought close to transition by IRS. These results suggest that sustaining and intensifying intervention would have pushed malaria dynamics to a slow-rebound regime with an increased probability of local parasite extinction.
One Sentence Summary:
Population genomics of hyper-diverse var genes inform resurgence dynamics in falciparum malaria.
INTRODUCTION
High-transmission endemic regions present an important challenge to controlling and eliminating falciparum malaria. Although malaria prevalence has declined considerably in many parts of the world in the last two decades, high-transmission regions, found primarily in sub-Saharan Africa, tend to be highly resilient to intervention efforts (1). Here, malaria incidence often rebounds to pre-intervention levels once control is relaxed, and sustained efforts are limited not only by the availability of resources but also by the spread of resistance to insecticides and drug treatments (2, 3). Children experience clinical episodes, eventually becoming immune to severe disease but not infection (4). This results in a large reservoir of asymptomatic infections in hosts of all ages, which contributes to sustained disease transmission (5, 6). A similar transmission reservoir is found in other vector-borne diseases in domestic and wildlife hosts that exhibit high prevalence of infection but incomplete immunity with no clinical symptoms (7, 8, 9, 10).
In these pathogens, a high level of asymptomatic infection at high transmission rates and associated incomplete immunity are enabled by high antigenic variation encoded via multigene families (11). The var multigene family in the malaria parasite Plasmodium falciparum provides an example; it encodes for the major variant surface antigen (VSA) during the blood stage of infection, the protein PfEMP1 (Plasmodium falciparum erythrocyte membrane protein 1) (12, 13, 14, 15). Anti-PfEMP1 immunity has been found to be crucial to prevent severe disease manifestations and to clear infection (16, 17, 18, 19). Each parasite carries 50–60 var genes across its chromosomes. They are expressed largely sequentially, producing and exporting different variants of this protein to the surface of red blood cells (12, 20). Specific immune memory of a given VSA allows the host immune system to clear parasitized red blood cells, shortening infection and reducing the likelihood of parasite transmission. In contrast, when a strain expresses many VSAs a host has not previously encountered, long infectious periods ensue, enhancing its chance of transmission to another host (19, 12). Under high-transmission settings, local parasite populations exhibit a vast pool of var gene variants, documented to reach in the thousands to tens of thousands (21, 22, 23, 24) and generated mostly through mitotic and meiotic recombination but also mutation, as well as from migration of the human host and the mosquito vectors (25, 26, 27, 28). Previous studies have shown that negative frequency-dependent selection (NFDS) mediated by the acquisition of specific immunity by hosts contributes to the limited overlap of the var genes of both individual repertoires (individual parasite genomes) and isolates (sets of individual parasite genomes co-infecting individual hosts), in a pattern that is both non-random and non-neutral (23, 29). Given the large number of possible var gene combinations, hosts are not likely to have seen all or many of the VSAs encoded by the different circulating genomes, and they therefore remain susceptible to reinfection throughout their lifetime despite previous repeated exposure (24, 30). Multi-genomic infections are common resulting from either successive mosquito bites (i.e., superinfection) or a single mosquito bite (i.e., co-transmission) (31, 32, 33, 34).
A recent longitudinal field study in Bongo District, located in northern Ghana, implemented deep sampling and sequencing of var genes from individual isolates to monitor the response of the transmission system to a three-round transient intervention of indoor residual spraying (IRS) with insecticides (21, 35). Sequencing specifically concerns the DBLα tag, a small conserved ~450bp region within var genes which encodes for the immunogenic Duffy-binding-like alpha domain of PfEMP1 (21, 22, 23, 24, 36, 37, 38). Bioinformatic analyses of a large database of exon 1 sequences of var genes showed a predominantly 1-to-1 DBLα-var relationship, such that each DBLα tag typically represents a unique var gene (39). We use DBLα and var interchangeably hereafter. This surveillance through molecular epidemiology revealed persistent estimates of high var diversity and low parasite strain overlap (low repertoire similarity), characteristic features of high-transmission regions, despite a considerable decrease in prevalence and parasite population size (21, 35). The transmission system rebounded rapidly after the IRS was discontinued, with this transient intervention appearing to have had only a limited impact (21). Was this indeed the case? We examine here whether deep sampling of sequences of var genes in a host population can help us evaluate the impact of intervention further, from which we specifically infer whether sustaining and intensifying control efforts would have taken the system through a pronounced transition and averted the rapid rebound of prevalence.
To address these questions, we first investigate the response of the epidemiological system to transmission-reducing interventions with an extended stochastic agent-based model (ABM) that incorporates strain diversity from a multigene family and the interplay between parasite evolution and malaria population dynamics (Materials and Methods) (29). We address the existence of a threshold behavior with intervention intensity, whereby the perturbed system would lose its capacity to rebound rapidly, and that of molecular indicators of such a transition based on deep sampling of var gene sequences. We then apply the lessons learned from the computational model to field data from northern Ghana. Results indicate that the structure of diversity revealed via molecular surveillance can inform intervention against malaria in high-transmission endemic settings. These findings should be relevant to other pathogens of humans, wildlife, and livestock with similar immune evasion strategies based on hyper-diverse antigenic variation (7, 8, 9, 10).
RESULTS
We investigate how different intervention schemes impact the population genomic structure under different transmission regimes (Materials and Methods, see Fig. 1A–D for the numerical simulation designs and scenarios considered). We start with sustained interventions of a decade to then consider shorter, more realistic, transient ones (Materials and Methods, Fig. 1C). Numerical simulations of the ABM demonstrate three qualitatively different rebound dynamics in prevalence (Fig. 2A, figs. S1A) across the series of sustained IRS interventions (Fig. 1C–D). Systems under low- or mid-coverage vector control rebound rapidly in prevalence back to levels similar to those pre-intervention, in what we refer to as a fast-rebound regime. We use “rebound” hereafter synonymously with “resurgence” from the malaria literature, both describing a return to a baseline. In contrast, systems under high-coverage interventions remain at low prevalence, entering what we describe as a slow-rebound regime. Lastly, systems under interventions with levels between these two extremes gradually settle into intermediate prevalence, exhibiting a “transition” regime.
Fig. 1. Diagram of the numerical simulation design mimicking different lengths of the transmission-reducing intervention (the application of indoor residual spraying or IRS) and the study design of field data sampled from Bongo District in northern Ghana.
(A) Each simulation follows either two or three stages depending on intervention duration: a “pre-IRS” period during which the transmission in the local population reaches a stable state, followed by an “IRS” intervention period of ten or two years which reduces transmission rate (in what we call sustained vs. transient IRS, respectively), and a “post-IRS” period when transmission rates go back to their original levels only applicable to the transient IRS. Three different spatial configurations are implemented: a closed system with no migration, a semi-open system in which two individual parasite populations are connected via migration events, and a regionally-open system in which migrant genomes are sourced from a regional pool of var genes (Materials and Methods). The value of the baseline migration rate is inferred from field data from northern Ghana (Materials and Methods). Individual infections are only tracked locally, and other major stochastic events besides transmission include mutation of genes, recombination within and between genomes, and acquisition and loss of specific immune memory in hosts. (B) 14 different scenarios are considered in the simulation, indicated by Roman Numerals I to XIV. High migration rate is an order higher than the inferred baseline rate. Medium size regional pool and large one correspond to the aggregated diversity of a few and more than 10 individual local parasite populations with the region respectively. (C) Transmission intensity or effective contact rate (Materials and Methods) varies as a function of time, from pre-, to during, to post-intervention for a seasonal and closed system, with the two panels illustrating the sustained vs. transient implementation for the different reduction levels. (D) Different levels of intervention coverage are considered that reduce transmission rate log-linearly along the gradient of transmission intensity. For illustration purposes, we show here levels for a seasonal and closed system under sustained interventions. The lowest and highest reductions of the transmission rate correspond respectively to ~20% and ~90%, with colors corresponding to those in the left panel of C the sustained interventions. The highest reduction for transient interventions can correspond to more than 90%, reaching 96% for certain scenarios. (E) The study design of the three-round transient IRS in Bongo District in northern Ghana.
Fig. 2. A seasonal and regionally-open system’s response to a series of sustained interventions (scenario VII in Fig. 1B).
(A) Rebound dynamics of prevalence across time. The dynamics can be categorized into three regimes described respectively as fast rebound, transition, and slow rebound, depending on the behavior of prevalence over time. As the name indicates, for the fast-rebound regime, prevalence returns quickly to similar levels as those pre-intervention, whereas for the slow-rebound regime, prevalence does not recover within a decade and remain substantially lower, and the transition regime exhibits an intermediate recovery. (B) The long-term rebound prevalence levels at the end of the decade (sampled at the end of wet/high-transmission season) are shown against the transmission rates corresponding to the different intervention levels (with colors corresponding to those in Fig. 1D). (C) Rebound dynamics of var gene diversity measured by the Shannon Diversity Index across time. (D) Long-term rebound dynamics of var gene diversity against transmission rates (with values shown at the end of the decade). Both prevalence and var gene diversity exhibit a non-linear pattern against transmission rates. The fast-rebound regime corresponds to a wide interval, whereas the transition and slow-rebound regimes cover a narrow interval along the gradient of transmission rates. For the rebound dynamics of additional combinations of parameter values for closed, semi-open and regionally-open systems, see figs. S1 and S3.
The prevalence level reached after the rebound (a decade into sustained IRS, sampled at the end of the wet/high-transmission season for seasonal transmission) is highly nonlinear along the gradient of transmission intensity (Fig. 2B, figs. S1B). The fast-rebound regime occurs for a broad range within the gradient of transmission intensity, whereas the transition and slow-rebound regimes correspond instead to narrow ranges. Thus, the perturbed system experiences a sharp transition along the gradient of transmission intensities, from a rapid recovery down to a lingering low prevalence. Concomitantly, var diversity itself, whether measured as richness or the Shannon Diversity Index, also exhibits the three general behaviors, from the maintenance of high values, through the establishment of intermediate ones, to the eventual persistence of low levels, across the series of sustained IRS interventions (Fig. 2C, figs. S2A, figs. S3A). Post-intervention var diversity levels (a decade into sustained IRS) are also highly nonlinear with intervention transmission intensity (Fig. 2D, figs. S2B, figs. S3B).
The nonlinearity of the rebound in prevalence and corresponding var diversity levels along the gradient of transmission intensity holds, regardless of whether the system is under constant or seasonal transmission, and whether it is closed, semi-open, or regionally-open, over wide ranges of parameters (Fig. 2, figs. S1–3).
Although to examine long-term rebound dynamics we sustain the vector reduction for up to a decade in the numerical simulations, we find that the perturbed system in the slow-rebound regime can be prone to extinction at a faster time scale. Replicate runs can go extinct as early as in the fourth year into intervention (Fig. 3A), a period that is more comparable to what can be realistically implemented in empirical settings.
Fig. 3. The probability of extinction in the slow-rebound regime, short-term vs. long-term rebound prevalence levels, and the population-wise pairwise type sharing or PTS distribution before and early into sustained interventions for a seasonal and regionally-open system (scenario VII in Fig. 1B).
(A) Probability of parasite extinction calculated from replicate runs for intervention intensities representative of those in the slow-rebound regime. Extinction occurs as early as in the fourth year into sustained interventions. The two intervention levels correspond to those in Fig. 1D. (B) The short-term rebound prevalence levels at the end of the 2nd year of sustained intervention (sampled at the end of wet/high-transmission season) are shown against the transmission rates corresponding to the different intervention levels. For additional scenarios simulated, see figs. S5A. (C) Short-term rebound prevalence levels (in the second year of intervention, sampled at the end of wet/high-transmission season) are not good predictors of those in the longer term (at the end of the decade, sampled at the end of wet/high-transmission season). For additional scenarios simulated, see figs. S5B. (D) The pairwise type sharing or PTS distribution during sustained intervention without measurement error. With increased intervention intensities, and as the system approaches the transition regime, the PTS distribution exhibits the emergence of a second mode at complete overlap due to clonal expansion, and a shift toward lower values (of zero) at the lowest end due to reduction in meiotic recombination (or out-crossing) and hence the circulation of non-related strains. The emergence of the second mode is sensitive to the specification of parameters and disappears under measurement error (figs. S6–7). Thus, it is not a robust indicator, unsuitable for practical surveillance purposes, of approach to the transition regime.
Under low or mid-coverage vector control, transmission intensity is maintained at a somewhat high level (Fig. 1D). Immediately following intervention, the majority, or a high fraction, of infections can still get transmitted, resulting in a high surviving diversity (Fig. 2A, 2C, figs. S1A, S2A, S3A). Hosts’ immune profiles are heterogeneous because they were exposed to different subsets of the large circulating diversity before intervention. The high surviving diversity, coupled with hosts’ heterogeneous immune profiles, allows for a heterogeneous distribution of infection duration (figs. S4). Infections with longer duration are selected due to a reduction in transmission intensity. Thus, the distribution of infection duration immediately following intervention shifts to longer values compared to those before intervention (figs. S4). The selection for infections of longer duration effectively compensates for the reduction in transmission, allowing prevalence to start rebounding. Further into intervention, the loss of immunity due to the reduction in transmission and exposure increases individual susceptibility and consequently infection duration. Prevalence rebounds further, eventually reaching comparable levels to those pre-intervention. In contrast, under high-coverage vector control, immediately following intervention, transmission intensity and circulating diversity both drop to very low levels with a low fraction of infections getting transmitted (Fig. 1D, 2A, 2C, figs. S1A, S2A, S3A). Few infections can survive longer than the average waiting time between consecutive transmission events. The reduction in transmission intensity cannot be effectively compensated for by the shift in infection duration distribution towards longer values. Thus, the transmission system remains at low prevalence and diversity.
Having documented a threshold behavior of the transmission system in its response to intervention, we address next the existence of indicators revealing its proximity to the transition regime. Because prevalence is a commonly tracked quantity in malaria surveillance, we first examine whether it can function as one such indicator. The short-term rebound prevalence levels following simulated sustained IRS (two years into intervention) show a much more linear behavior with intervention levels than those after longer sustained control (Fig. 3B, figs. S5A). The relationship between these two types of prevalence levels is non-linear and the degree of nonlinearity varies across different spatial configurations and parameter values (Materials and Methods, Fig. 3C, figs. S5B). Overall, short-term prevalence levels are only weakly correlated with, and are by themselves only poorly predictive of, long-term rebound prevalence levels. Moreover, high-quality prevalence data is often unavailable in empirical settings, and efforts to collect these data can be hindered by low testing rates and imperfect detective power for asymptomatic infections (40). Thus, prevalence by itself does not provide sufficient information on either the proximity of the perturbed system to the transition regime or the loss of resilience of the epidemiological system.
As complementary information, we therefore address indicators related to the structure of diversity from the perspective of var genes and isolates. As we seek to apply such indicators to field data, we also consider whether they are robust to measurement errors common in sampling schemes (Materials and Methods). We further focus on responsive indicators that rely on sampling immediately following the start of intervention, specifically two years into it (at the end of wet/high-transmission season for seasonal systems).
We first compared whether the distributions of pairwise type sharing (PTS) differ between before and early into intervention (38). PTS, a similarity index analogous to the Sørensen or Jaccard Index (41, 42), describes the fraction of shared DBLα types between any two isolates and hence quantifies the repertoire similarity of parasites circulating in host population. If each isolate consisted of a monoclonal infection by a repertoire of one var gene, sample PTS would be equivalent to homozygosity, and the comparison of PTS within and between populations would be equivalent to Wright’s fixation indices. PTS can therefore be viewed as a simple extension of fixation indices to the case where multigene family members of unknown allelic status have been collected (38, 43). A second mode at complete overlap emerges in the PTS distribution when the perturbed system approaches the transition regime (Fig. 3D). Concomitant with the appearance of the high overlap mode, the low end of the PTS distribution shifts towards zero or extremely low values (Fig. 3D, figs. S6–7) due to reduced outcrossing. Outcrossing creates relatedness and increases similarity between parental and offspring strains. Because the frequency of outcrossing events scales with transmission intensity, intervention results in a decrease in outcrossing rates and an increase in clonal transmission of non-related strains. Therefore, the PTS distribution is pulled into the two opposite extremes of complete and no, or extremely low, overlap. The emergence of this second mode at complete overlap is however sensitive to parameter specification and measurement error. It specifically becomes less apparent when the circulating diversity is the highest among all the simulated scenarios (Materials and Methods), effectively disappearing when we do not completely sample all the var genes constituting infections (figs. S6–7). This happens because this incipient appearance of clonality involves too few parasites in the population, a result of intervening in a high-transmission endemic system where the original diversity is high. Any subset of strains from this original diversity which survived intervention, will likely be dissimilar from each other. Unless transmission remains sufficiently low with importation of var gene diversity curtailed so that clonal transmission follows a bottleneck in strain diversity, the second mode in the PTS distribution will be insignificant. In addition, this second mode of complete overlap becomes more diffused under measurement error: two identical strains will have a high but uncertain PTS value within some range, depending on which subset of their var genes are sequenced and typed. Thus, the appearance of the second mode cannot be reliably used for empirical surveillance.
Instead of the population-wise PTS, we then examine the age group differences at the low-end of the PTS distributions. A previous study has shown age-dependent patterns of PTS in empirical data, with infections in children (1–10 years old) exhibiting more related var repertoires compared to adults (>20 years old), consistent with the effect of immune selection (negative frequency-dependent selection) (24). We therefore compare these distributions in children and adults from the simulation output across the series of sustained transmission-reducing interventions. We find a reduced difference as the system moves from the fast-rebound regime toward the transition one. Specifically, the PTS distribution within children manifests a mode around zero or extremely low overlap, which was originally present in the PTS distribution of adults. In other words, the PTS distribution between the two age groups becomes more similar, overlapping at the lowest mode when the system approaches and enters the transition regime (Fig. 4A, figs. S8–21A). This behavior is robust to sampling schemes simulating the collection of field data (Materials and Methods).
Fig. 4. Molecular indicators of the proximity of the perturbed local system to the transition regime for a seasonal regionally-open system with a medium-size pool and a baseline migration rate (scenario VII in Fig. 1B).
(A) As the system approaches the transition regime, the difference between PTS distributions across age groups is significantly reduced and even disappears. In particular, the two distributions almost completely overlap at the lowest mode around 0. The x-axis range for inset figures is 0–0.1. (B) MOI distribution starts to center around 1 and 2 with the majority of infections being either mono-clonal or multi-genomic with two genetically distinct parasites. These two molecular indicators are robust under implemented sampling schemes and sampling limitations representative of those encountered in the collection of field data (figs. S8). Molecular indicators for regionally-open systems with other setups or closed and semi-open systems can be found in figs. S9–21.
A second candidate for an indicator quantity is the distribution of the multiplicity of infection (MOI), defined as the number of genetically distinct parasite strains co-infecting a single human host (33, 44, 45). MOI, also referred to as complexity of infection (COI), is one of the most frequently reported proxies for parasite transmission (46, 47). Although a variety of statistical approaches have been developed that rely on various sources of genotypic data, the estimation of MOI remains challenging for high-transmission endemic settings where individuals typically carry multiple co-occurring infections (i.e., MOI > 1) (48). The low to non-existent overlap of var repertoires enables estimation of MOI on the basis of the number of var genes sequenced from an individual’s isolate (24, 35). This so-called “varcoding” method has been shown to provide accurate estimation in numerically simulated infections, and to perform better when transmission is high than a previous method based on a collection of neutral single nucleotide polymorphism loci (SNPs) (48). Having extended this approach to a Bayesian framework to account for measurement error in the sequencing (21), we apply the approach here to derive MOI estimates at the population level across the series of IRS interventions. We find that the MOI distribution becomes increasingly centered at low values around one and two, as the system approaches the transition regime (Fig. 4B, figs. S8–21B). Specifically, whereas for the fast-rebound regime the majority of infections remain multi-genomic (MOI > 1), the fraction of multi-genomic infections decreases significantly when the intervened system approaches the transition regime. For the transition and slow-rebound regime, the majority of infections become mono-genomic (MOI = 1). This indicator is also robust under sampling schemes simulating the collection of field data (Materials and Methods).
We apply next the above indicators to an interrupted time-series study involving three age-stratified cross-sectional surveys undertaken at the end of the wet/high-transmission season from Bongo District in northern Ghana, respectively before, during, and immediately after the transient IRS intervention (Fig. 1E) (21, 35). Following IRS, the difference in the PTS distribution across the two age groups is reduced, almost completely overlapping at the lowest mode (Fig. 5A), and the MOI distribution shifts towards a high proportion of mono-genomic infections or multi-genomic ones with only two genetically distinct parasites (Fig. 5B). These patterns suggest that intervention had pushed the empirical system near the transition regime. Similar patterns are found for the survey immediately after the IRS intervention (termed as the post-IRS phase, Fig. 1E), suggesting the impact of IRS persisted through the wet/high-transmission season after it was discontinued.
Fig. 5. The two molecular indicators applied to the field data from northern Ghana before, during, and immediately after the three-round transient IRS intervention, indicate proximity to the transition region where rebound prevalence decreases sharply with intervention intensity.
(A) The PTS distribution between the two age groups becomes more similar, especially for its lowest mode, which is almost completely overlapping. The x-axis range for inset figures is 0– 0.075. (B) The MOI distribution exhibits its main mode at one, with the majority of infections becoming either mono-clonal or multi-genomic with two genetically distinct parasites. Those patterns persisted during the year right after when the IRS was discontinued (the post-IRS panel), although to a slightly lesser degree.
Because the three-round IRS intervention was only implemented over a two-year window, we return to the computational model to examine a series of transient interventions of this length across the different spatial configurations (Fig. 1C–D, Materials and Methods). Overall, the simulated systems are highly resilient to transient interventions, especially when embedded within a regional pool. After interventions are discontinued, regionally-open systems always rebound rapidly in prevalence back to pre-IRS levels, regardless of IRS coverage and configuration of their regional pool (Fig. 6A, figs. S22). The majority of closed and semi-open systems rebound rapidly as well. However, in a few instances, prevalence remains confined to intermediate or low prevalence levels requiring more than a decade to fully rebound (Fig. 6A, figs. S22). In these, IRS coverage had been at the highest end and sufficiently strong to bring prevalence levels and parasite population sizes close to zero (Fig. 6A, figs. S22), with a low var diversity during IRS (Fig. 6C).
Fig. 6. The rebound dynamics in prevalence across time, under a series of transient interventions (2 years); the diversity of migrant genomes and the degree of novelty they introduce to the local parasite population vs. rebound dynamics to transient intervention for a seasonal regionally-open system; the parasite genetic structure of closed systems in the fast rebound, transition, and slow rebound regimes to transient intervention.
(A): For closed and semi-open systems, the dynamics can be categorized into three regimes described respectively as fast rebound, transition, and slow rebound, depending on the behavior of prevalence over time. For regionally-open systems, the dynamics are always those of fast rebound kind. Such resilience is enabled primarily by the effective importation of novelty post interventions. The top panel from left to right: seasonal and closed systems (scenario I in Fig. 1B), seasonal and semi-open systems with a baseline migration rate (scenario III in Fig. 1B), seasonal and regionally-open systems with a medium-size pool and a baseline migration rate (scenario VII in Fig. 1B). The bottom panel from left to right: non-seasonal systems, corresponding to scenario II, V, XI in Fig. 1B. Simulation runs with other parameter values are shown in figs. S22. (B): The rebound dynamics of regionally-open systems with a medium-size pool to transient (2 years) and highest-coverage interventions. These systems receive migration from either a dynamic regional pool with regular substitution of common genes in that pool, hence effective importation of novelty via migration events, or a static regional pool which does not update its gene composition. Fluctuating lines correspond to seasonal transmission whereas the relatively non-fluctuating lines represent constant transmission. Dashed lines correspond to dynamic regional pools whereas solid lines correspond to static ones. The left panel assumes a baseline migration rate whereas the right panel assumes a migration rate that is an order higher than the baseline one (Materials and Methods). Rapid rebound occurs when the pool is dynamic, and the importation of novelty is effective. (C): The frequency of genes circulating in the host population varies as a function of time, from pre-, to during, to post-intervention. Each color in the stacked area plots refers to a different gene. Shown here are two instances of seasonal and closed systems under transient interventions (scenario I in Fig. 1B). The left panel corresponds to a perturbed system which demonstrates a fast rebound dynamics (circulating genes are plentiful and typically exhibit low frequency), whereas the right panel corresponds to a system which demonstrates a slow-rebound dynamics (circulating genes are low in numbers with each at high frequency). (D): When the system rebounds from a population bottleneck induced by transient interventions (2 years), in the fast-rebound regime, clonal expansions are limited and circulating strains are maintained to be dissimilar from each other, measured by PTS distributions. The representative four quantiles of these distributions are at low values, indicating that strains sampled at the same times and those between present and past (diagonal versus off-diagonal values, respectively), are dissimilar. In contrast, circulating strains are highly similar in the slow-rebound regime. Clonal expansion dominates.
We further investigate why the regionally-open systems are highly resilient to transient intervention. Because antigenic diversity facilitates immune evasion and further infection, we address the question from the perspective of change in var diversity and associated strain structure by examining how the diversity of migrant genomes and the degree of novelty they introduce to the local parasite population impact the resilience of a regionally-open system. We specifically implement the following two scenarios: a high migration rate per se, and one that is additionally accompanied by a highly diverse gene composition of migrant genomes. We consider for this purpose a static regional pool of medium size (lower than the highest value considered for high-transmission endemic regions) that does not update its genes, versus a dynamic one of the same size that does so continuously. Migrant genomes drawn from this static regional pool necessarily import the same set of var genes repeatedly to the local parasite population, and therefore hosts typically develop more complete immunity against their products and the system rebounds in prevalence, although not rapidly. In contrast, migrant genomes drawn from the dynamic regional pool import new var genes to which hosts are susceptible, and the system rebounds in prevalence rapidly (Fig. 6B). The transient intervention considered here is among the highest-coverage ones, reducing prevalence and parasite population size close to zero, and var diversity to very low values. We do not repeat this analysis for regional pools of the largest size because they necessarily entail migrant genomes introducing a high degree of novelty to the local parasite population, regardless of the rate of update of their gene composition. Regionally-open systems are highly resilient to transient intervention because migrant genomes effectively introduce novelty and help the recovery in var diversity once control is discontinued.
In contrast, closed and semi-open systems can lose their resilience under transient high-coverage intervention. We examine how the disruption of parasite genetic structure typical of high transmission underlies this loss of resilience. For closed and semi-open systems that exhibit a fast rebound, their parasite populations remain large and retain var gene diversity characteristic of high-transmission regions during transient intervention (Fig. 6C–D). After control is discontinued, surviving parasite populations expand, and the degree of clonal transmission is low. Circulating genes are plentiful and typically exhibit low frequency (Fig. 6C), and strains are dissimilar from one another as reflected in PTS distributions. These distributions capture the degree of overlap of isolates that are circulating within the same time period as well as those in the past and present (diagonal and off-diagonal values, respectively, in Fig. 6D). Their four quantiles systematically remain low. Hosts’ exposure to dissimilar strains facilitates immune evasion and further infection, which in turn promotes the generation and maintenance of var gene diversity.
For closed and semi-open systems that exhibit a transition or slow rebound, their parasite populations approach zero with low var diversity during transient intervention (Fig. 6C). After control is discontinued, small surviving parasite populations expand, resulting in a high degree of clonal transmission. Circulating genes are low in numbers with each at high frequency (Fig. 6C) and strains are similar to one another (Fig. 6D). Hosts’ exposure to highly similar strains throughout time compromises immune evasion and further infection, which in turn reduces the generation and maintenance of var gene diversity. In our simulations, such systems require more than a decade to recover their prevalence fully.
DISCUSSION
We have described a non-linear and sharp response to the intensity of a sustained transmission-reducing intervention, with an agent-based model that explicitly tracks specific immune memory to strain variation encoded by a hyper-variable multigene family. The perturbed transmission system experiences a threshold behavior, losing its resilience when pushed into a narrow transition region toward a regime where prevalence rebounds only slowly with an associated extinction risk of the parasite. By examining the relationship between the loss of var diversity and the disruption in the strain structure of limiting overlap, we have identified molecular indicators providing information on the approach of the perturbed transmission system to the transition between a fast and slow rebound to high prevalence. These indicators rely on sampling following intervention, effectively complementing the monitoring of the transmission system through epidemiological quantities over time. They can inform timely adjustment of intervention efforts, to push the system into the transition regime and beyond, into the slow-rebound behavior enhancing local parasite extinction, to do so more rapidly than by observing the temporal trajectory of prevalence itself.
The response behavior we have described involves a clear coupling between the rebound dynamics in prevalence and those of var gene diversity. In the fast-rebound regime, the heterogeneity of hosts’ immunity profiles and large surviving strain diversity underlie a heterogeneous distribution of infection duration. Immunity loss increases hosts’ susceptibility to infection and contributes to further lengthening infection duration. Selection for infections with longer duration in response to reduced transmission, effectively compensates for this reduction. In contrast, in the slow-rebound regime, few infections can survive for sufficiently long times to withstand the waiting time between consecutive bite events, resulting in a low surviving diversity and population size of the parasite. Selection for infections with longer duration can no longer effectively compensate for the reduction in transmission.
We illustrated the application of the identified indicators to field data from Bongo District in northern Ghana (21). Their patterns suggest that the transmission system had been brought close to transition by the two-year transient intervention, and that therefore sustaining and intensifying these efforts would have been worthwhile.
Similar patterns for the two molecular indicators of approaching the transition phase persisted through the wet/high-transmission season after the three-round transient IRS was discontinued. However, prevalence and the population genetics of the var multigene family rebounded rapidly, already returning to their pre-IRS characteristics when surveyed 32-month after IRS was discontinued (21). During the transient intervention, despite a considerable reduction in parasite prevalence (by ~40–50% compared to the pre-IRS level), estimates of parasite population size remained large and var population genetics showed persistent characteristics of high transmission regions (21). These characteristics would provide a large enough founding population to effectively maintain a low degree of clonal transmission after IRS was discontinued. The local parasite population was likely embedded in a large regional pool, which enables the maintenance and rapid recovery of var diversity. Resulting immune evasion and associated transmission would have further promoted the generation and maintenance of diversity, enabling a positive feedback between transmission and diversity which has been previously described in two other malaria models (49, 50). The relationship between the response of the transmission system described here in a stochastic individual-based model and the tipping point behavior identified in a deterministic compartmental population model of malaria (50) should be investigated.
Although we relied initially on relatively long interventions (i.e., a decade), we also examined more transient ones, which underscored the fundamental importance of spatial context through the importation of var gene diversity. Although regionally-open systems not surprisingly always exhibit a fast-rebound dynamics in prevalence regardless of intervention coverage, semi-open and closed systems can experience the full range of possible temporal trajectories. The positive feedback described above should remain highly efficient for regionally-open systems, or closed and semi-open systems with fast resurgence once transient intervention is discontinued. In this case, surviving parasite populations can rapidly expand, yet the degree of clonal transmission remains low. Circulating var diversity remains high and strains are dissimilar from one another despite a population bottleneck. In contrast, the positive feedback is weak for closed and semi-open systems which enter the transition and slow-rebound regimes. After control is discontinued, clonal expansion dominates, resulting in the circulation of highly similar strains. Thus, as for intervention in other transmission settings, coordinated regional efforts and information on sources and sinks are clearly important. Ways to infer spatial context and the importation of var gene diversity from genomic data should be investigated.
Our computational model did not consider generalized (VSA-transcending) immunity or functional divergence between genes (4, 51, 52, 53, 54, 55, 56, 57, 58, 59, 59, 60, 61). Future work will examine the performance of the identified indicators under further aspects of immunity and incorporate different var gene groups. We also used genetic variation as a proxy for antigenic variation (38) because a genotype to phenotype map is not available for var genes. Future work should examine ways to derive aspects of the map from a variety of perspectives, including bioinformatics, the fitting of ABMs, and protein structure prediction based on deep learning. For regionally-open systems, our simulation considers a local parasite population embedded within a regional pool providing the source of migrant genomes and does not represent metapopulation dynamics explicitly. An explicit simulation of multiple parasite populations in our ABM is computationally expensive. An open population with continuous migration from a regional pool is an alternative typical setting of many assembly models in community ecology.
The World Health Organization has recently underscored the importance of surveillance for malaria intervention (62, 63). Molecular epidemiology provides approaches to complement traditional surveillance, with methods best suited for moderate- to low-transmission intensities (64, 65, 66). Deep sampling of sequences of var genes in host population provides a basis to complement these efforts for high transmission, with a focus on the high diversity that characterizes the var multigene family in these regions. Our results establish a link between the population dynamics of the disease and observations of parasite population genomics from the perspective of hyper-variable antigen-encoding genes. They indicate that the structure of diversity revealed via molecular surveillance can inform intervention against malaria in high-transmission endemic settings. These findings should enhance our understanding of transmission dynamics where large standing pathogen diversity represents a major challenge to control efforts.
MATERIALS AND METHODS
Stochastic agent-based model
We extend a previous computational model (29) whose key assumptions and processes are described in supplementary Materials and Methods. Model parameters and symbols are listed and explained in Data file S1. The model is an agent-based (individual-based), discrete-event, and continuous-time stochastic system in which all known possible future events are stored in a single event queue along with their putative times, which may be fixed or drawn from a probability distribution with a certain rate. We choose values for the rates of these events based on the literature of malaria epidemiology, related field studies, and in vitro or in vivo values (see supplementary Materials and Methods and Data file S1 for further details). When an event happens, it can cause the addition or removal of future events in the queue, or the modification of their rates, resulting in a recalculation of putative times. This approach is implemented with the next-reaction method (67), which optimizes the Gillespie first-reaction method (68).
Individual infections and the immune history of individual human hosts are tracked, and evolutionary mechanisms including mitotic/ectopic recombination and mutation are modeled explicitly. We simulate a local host population of a given size which can assume different spatial configurations (see section “Seasonality and spatial configuration of the local transmission system” below). At the beginning of each simulation, a small number of hosts are randomly selected and infected with distinct parasite genomes from the random assembly of var genes from a “regional” pool to initialize local transmission.
Each parasite genome is represented as a specific combination of 45 (non-upsA) var genes, and each var gene is considered a linear combination of two epitopes (alleles) based on the empirical description of two hypervariable regions in the var tag region of the DBLα domain (54). Mosquito vectors are not explicitly represented as agents in the model. Instead, we consider an effective contact rate (hereafter, the transmission rate, which under some assumptions is effectively equivalent to vectorial capacity) which determines the times of local transmission events. Donor and recipient hosts are selected at random, and transmission occurs if the former carries active blood-stage infections, and the latter has not reached the carrying capacity of its liver-stage. We model ectopic recombination among genes within the same genome during the asexual stage, and meiotic recombination between strains during sexual replication inside the mosquito vector. The expression of genes in the repertoire is sequential and the infection ends when the whole repertoire is depleted. Hosts acquire transitory immune memory toward the product of expressed epitopes, and such memory precludes expression of such epitopes in future infections. The total duration of infection of a particular repertoire is therefore proportional to the number of unseen epitopes by the infected host across all individual var genes of a given repertoire. See supplementary Materials and Methods for further details of the agent-based model (ABM).
Extensions of the original computational model (29) implemented for this work concern the regionally-open spatial configuration, ectopic recombination events, and within-host dynamics. In particular, the regional pool for the regionally-open system now updates its genes at a rate reflecting the substitution rate of genes in individual local parasite populations within the region. Ectopic recombination events can now generate truly novel alleles instead of simply shuffling the alleles of the two parental genes. The current formulation of the model further allows ectopic recombination events to be associated with a load such that there is a certain probability of generating non-functional offspring genes which replace functional parental genes. These non-functional genes will not be expressed and hence reduce infection duration of repertoires within infected hosts.
Seasonality and spatial configuration of the local transmission system
We simulate both constant and seasonal transmission dynamics. We implemented seasonality by multiplying a scaling constant by a temporal vector of 360 days, containing the daily number of mosquitoes over a full year. In other words, the temporal vector represents the mosquito abundance, and the scaling constant encapsulates all other parameters involved in vectorial capacity. The product of both gives the effective contact rate. To obtain this temporal vector, (79) used a deterministic model for mosquito population dynamics from (80). The model was originally developed for Anopheles gambiae and consists of a set of ODEs describing the dynamics of 4 mosquito stages: eggs, larvae, pupae, and adults (80). Seasonality is implemented via density dependence at the egg and larva stages as a function of rainfall (availability of breeding sites). We adopt either constant effective contact rates or seasonal ones, which result in EIR or FOI values typical of high-transmission endemic regions (75) (see section “Repertoire transmission” and “Within-host dynamics” in supplementary Materials and Methods).
We simulate transmission dynamics for closed, semi-open, and regionally-open systems. For closed systems, after the initial seeding of local transmission from a pool of var genes, migration is discontinued. Thus, var diversity is generated intrinsically by the dynamics of the ABM for the local population. For semi-open systems, two individual populations coupled via migration are explicitly simulated. They represent empirical transmission systems which are mainly coupled to one or a few neighboring sites. Regionally-open systems represent a local transmission system that is connected with a few to many individual neighboring parasite populations within a broader regional scale. Instead of simulating those individual parasite populations explicitly, we let the focal population receive migration from a regional pool of var genes of a certain size. This regional pool acts as a proxy for regional parasite diversity, i.e., diversity of the aggregate of individual local parasite populations in the region. Because each parasite genome is a repertoire of a given number of var genes, migrant genomes are assembled from random sampling of var genes from the regional pool. Estimation of key parameters concerning the regional pool is described in the following sections.
The temporal course of a simulation and the series of IRS interventions
Each simulation follows either two or three stages (Fig. 1A, 1C): (i) a pre-IRS period during which the local parasite community is assembled and the transmission system reaches a (semi-) stationary state before the intervention; (ii) an IRS period of either 10 or 2 years during which transmission is decreased (referred to as sustained and transient IRS, respectively); (iii) a post-IRS period when transmission rates recover to pre-IRS levels (only applicable to the transient IRS simulation, since the sustained one extends for the full decade).
We simulate a series of IRS intervention efforts, which reduce transmission rates to levels that are log-linear along the gradient of transmission intensity (Fig. 1D). The lowest and highest coverage of the IRS intervention efforts correspond to a reduction of 20% and more than 90%, respectively.
The IRS intervention efforts are assumed to be regional for open systems. In empirical settings, regional intervention efforts imply that similar efforts are applied to both the local parasite population of interest and the neighboring parasite populations which exchange genomes with the local population via either short-distance or long-range dispersal of mosquitoes or human hosts. All these parasite populations then experience a similar level of reduction in transmission and consequently in their prevalence. As a result, during IRS, the importation of migrant genomes from these neighboring locations also decreases. To implement regional interventions for regionally-open systems, we let migration rates be proportional to the local transmission rate and local prevalence level, both being surrogates of transmission rates and prevalence levels of neighboring populations.
We assume the size of the regional pool remains unaffected during and post intervention because the aggregate of var genes across individual parasite populations should remain high, even when intervention is regional and of high-coverage. We do so because this work uses the simulated “regionally-open systems” to represent high-transmission endemic regions in empirical settings, where immigration is known to represent an important challenge to control and elimination efforts. Here, regional interventions rarely eliminate the majority of individual parasite populations.
Estimation of the baseline migration rate for the implementation of open systems
We rely on empirical data on var genes from Bongo District, a high-transmission region in northern Ghana to infer the rate of gene exchange between populations, and to obtain a reasonable estimate of migration rates to implement the open transmission systems. The fast-evolving var genes under immune selection provide a higher resolution than neutral loci for recent migration events. Following (81), we calculate Jost’s D (82), a measure of population divergence for highly diverse genetic markers, to quantify var gene differentiation within and between two proximal catchment areas (i.e., Vea/Gowrie and Soe) in Bongo District for which molecular sequences were previously obtained at the end of the wet/high-transmission season prior to the IRS intervention (21, 24). Furthermore, under the finite-island model with infinite alleles, the divergence between two populations, D, is proportional to the local innovation (primarily ectopic recombination) rate, and inversely proportional to the migration rate between the two populations. These relationships are intuitive because local innovation events generate new genes unique to the specific local population and hence promote genetic differentiation and isolation between populations, whereas migration events promote genetic exchange and hence reduce divergence between populations. We estimate m, the percentage of contacts leading to infection due to migration relative to local transmission events, by dividing the local innovation rate, i.e., the ectopic recombination rate of the var genes, by D. This approach was initially proposed for single-locus genes with multiple alleles (82). Here we apply it to the var multigene family, effectively treating it as if it were single-locus, and different variants of var as if they were different alleles of this locus. We consider a range of ectopic recombination rates representative of those in nature, although values of the ectopic recombination rate have been measured only in vitro (26). We consider the daily rate of ectopic recombination, and the percentage of migration rate per day relative to daily local transmission events. The estimated baseline migration rate is around 0.0026, which translates to a percentage of migrant contacts relative to local ones of around 0.26%. We simulate transmission dynamics for both semi-open and regionally-open systems with migration rates equal to, or an order of magnitude higher than, this estimated baseline.
Estimation of the baseline rate at which genes are substituted in the regional pool
We estimate a value for the rate at which genes in the regional pool are substituted. We refer to this rate as the substitution rate hereafter. As the regional pool represents an aggregate of diversity circulating in individual local populations within the region, this rate can be approximated by the substitution rate of genes in individual local populations. Specifically, we can view a regional pool as the aggregate of diversity of a certain number of equally sized individual parasite populations. Note that we consider regional pools consisting of diversity in the form of the effective number of equifrequent genes and we do not specify a frequency distribution of genes in these regional pools. The collection of this effective number of equifrequent genes, in the ideal scenario, would exploit hosts’ immune space equivalently relative to the collection of genes at their original “true” frequency. We use the inverse Simpson diversity index (or, one over the expected homozygosity) (82), , to determine the effective number of genes in individual local parasite populations of our simulated closed and semi-open systems, which is at the order of one to two thousand. Moreover, connected individual local parasite populations of these simulated semi-open systems share a significant fraction of their genes when ranked by frequency, around 50% or more. Therefore, we let the number of genes in the regional pool span from several thousand to over tens of thousands, to mimic from a few to about ten to twenty individual local parasite populations. We specifically consider two setups for the regional pool representing roughly the two extremes, referred to as “medium” versus “large” pool scenarios respectively.
The substitution rate of common genes in a local population depends on the local innovation rate, i.e., the rate at which new variants are generated from ectopic recombination and mutation, and the fixation probability, or the invasion probability, of these new variants (84, 85). Fixation of a new gene variant in this context refers to a single copy of the variant present at a given time point having descendants in the population after a very large amount of time has passed, which typically entails that the frequency of the new variant reaches a certain threshold and becomes sufficiently common so that it is much less susceptible to genetic drift. When the system is at, or close to, a stationary state, the fixation of a new gene variant would imply the loss or replacement of an older common gene variant, hence we term this process and its rate as substitution of genes and substitution rate respectively. The rate of ectopic recombination and mutation has been measured in vitro (26). The magnitude of the fixation or invasion probability can be derived on the basis of population genetic models and the simulation output from our ABM. From the two together, we can obtain an expectation for the substitution rate as follows.
We write the invasion probability of new variants, or genes, or alleles in the system. At the stationary state, malaria transmission can be described by birth-death processes according to a Moran model with selection. The invasion probability of a low frequency variant is determined by its fitness advantage relative to that of others (84).
where denotes the number of copies of a new gene, , the parasite population size, and , respectively the fitness of the low-frequency gene versus that of other genes circulating in the population. Conventionally, the fitness of other genes is normalized to be 1, and that of the new gene is denoted as . We assume that there is a large enough number of alleles so that any mutation or ectopic recombination would lead to a different allele, so that the probability of back mutation to the original allele would be low enough to be negligible. Hence, new variants, generated via mutation or ectopic recombination events, always start with a single copy and n = 1. We consider parasite populations in high-transmission endemic regions, hence . The above equation simplifies to:
As described in (85), in the context of malaria transmission, the fitness of a gene is essentially given by its reproductive number, i.e., the number of offspring genes it produces, . The reproductive number depends on the transmission rate (), the transmissibility of the given gene (), and the typical infection duration of parasites that carry the given gene.
We consider the infection duration of a strain constituted by “average” genes versus the infection duration of a strain constituted by 1 new gene and g-1 average genes where g is the genome size (i.e., the number of var genes per individual parasite or repertoire). An average gene is a conceptual entity, and the proportion of susceptible hosts to an average gene is the average of the proportion of susceptible hosts to all circulating genes. Because transmission success of a new gene depends on the total infection duration of the strain that carries it, its invasion probability depends on the overall ensemble of genes. We write here the expected invasion probability for a new gene across the different genome backgrounds it may arise in, by considering that this background consists of average genes. Since the proportion of susceptible hosts to any new gene is 1, we have the following:
Note that the expression of derived here is slightly different from, but more rigorous than, the one in (85). The two expressions are close under certain conditions, for example, when g is large.
The values for typical of high-transmission endemic regions are 0.3–0.4 for the low end and around 0.65–0.8 for the high end, based on our ABM simulation outputs. We can use these ranges of to calculate the invasion probability of a new gene in an individual local parasite population, which together with the innovation rate (primarily through ectopic recombination) and the number of individual local parasite populations (a few for “medium” pools and more than 10 for “large” pools) gives the rate at which common genes are substituted in a regional pool with its specific configuration.
For details on the parameterization of the ABM, see supplementary Materials and Methods (the section “Selection of values for other parameters”) and Data file S1.
Sampling schemes and measurement error of empirical data
Empirical surveys are carried out at a specific sampling depth and under sparse sampling schemes. When sampling individual hosts and their var types in our simulations, we follow the sampling depth implemented in the empirical surveys from Bongo District, namely around 15% of the total human population size (21). We further set the sampling period to be once a year for both constant and seasonal transmission (with the former occurring at the end of the year and the latter occurring at the end of the wet/high-transmission season), to emulate the field sampling of hosts primarily at that time of the year, although a few additional surveys were also conducted at the end of the dry (low transmission) season as a baseline for comparison.
The collection of empirical data includes a measurement error, specifically concerning the distribution of the number of non-upsA DBLα sequenced and typed per monoclonal infection, accounting for certain potential sampling errors or imperfect detection of var genes (48, 21). We incorporate this measurement error when sampling from the simulation output by sub-sampling the number of var genes per strain based on this empirical distribution.
Moreover, only individuals with microscopy-positive P. falciparum infections (i.e., isolate) were included for sequencing of var genes in the empirical surveys from Bongo District. A subset of the longitudinal surveys from Bongo District includes in addition the submicroscopic infections detected by the more sensitive method of PCR (the 18S rRNA PCR), which resulted in a considerably higher fraction (35). Using surveys for which both microscopy and PCR detection were used, we estimated a conversion factor between the proportion of hosts that are microscopy-positive and those that are PCR-positive of 57%. To address the robustness of our results on molecular indicators to such missing data, we repeated the analysis for 57% of all infected hosts originally sampled at the 15% depth.
Molecular indicators
A 96% DNA sequence identity cutoff was used to define different var gene types (38). Sequence reads from either the same or different isolates with ≤ 4% sequence differences were grouped into a single consensus or a single type (21). We consider the following quantities computed on the basis of var gene types and evaluate how these relate to the rebound dynamics of prevalence across the series of simulations for the sustained interventions.
Pairwise type sharing (PTS)
The pairwise type sharing (PTS) index describes the degree of shared DBLα types between any two isolates, and is analogous to the Jaccard and Sørensen indices in Ecology (38, 41, 42). We use a directional version of this statistic (29) defined as and where is the total number of types shared by isolate and isolate , and and are the number of unique types in isolate and respectively. This directionality in the formula means that the sharing of a given number of types is relative to the length of the isolate under consideration. A PTS score ranges between 0 and 1, where a score of 0 signifies no shared DBLα types, and a score of 1 indicates complete overlap, between the two isolates. We consider several representative quantiles of the PTS distribution (0.01, 0.31, 0.6, and 0.9) to examine the similarity of isolates that are circulating in the population at the same time or across different times. For comparison of PTS distributions across age groups, we limit the calculation to within the same MOI groups and then aggregate across all MOI values to obtain the final distribution. In other words, within each age group, we compute the PTS between individuals of MOI equal to 1, 2, and so on, and then pool the resulting PTS values together to obtain the final PTS distribution. We do so to minimize the effect of different MOI values on PTS.
Multiplicity of infection (MOI)
Because Plasmodium parasites reproduce asexually as haploid stages within human hosts, signatures of polymorphic genotypes are evidence of multiclonal infection. While any polymorphic marker should thus be suitable in theory for estimating MOI, the typically high number of multiclonal infections in high-transmission endemic regions limits our ability to accurately do so in practice. This is mainly because the diversity of the polymorphic markers can be limited, with two parasites sharing the same marker but carrying distinct sets of antigen-encoding genes. The var multigene family provides one solution. Because of the large number of different genes in local populations and the effect of immune selection (negative frequency-dependent selection), repertoires of var genes in individual genomes exhibit very limited overlap (23, 29). As a result, a constant repertoire size or number of non-upsA DBLα types in a parasite genome can be used to convert the number of types sequenced in an isolate to an estimate of MOI (24, 35). In its simplest form of considering a typical repertoire length per parasite, the approach neglects the measurement error introduced by targeted PCR and amplicon sequencing of var genes in an infection. Recently, the method was extended to a Bayesian formulation that does consider this error and provides a posterior distribution of different MOI values for each sampled individual (21). We have previously documented the major steps of the Bayesian approach, compared two ways of obtaining a MOI distribution at the population level (by either pooling the maximum a posteriori MOI estimates or calculating a mixture distribution) (21), and examined the impact of different priors on the final MOI distribution at the population level. Here, we obtain the estimated MOI distribution at the population level from pooling the maximum a posteriori MOI estimates and from using a uniform prior for individuals.
Due to the commonness of multi-genomic infections in high-transmission regions, host prevalence levels are not a good indicator of parasite population size. Summing MOI estimates from this improved “varcoding” approach over sampled individuals with infections, enables the calculation of census population size for the parasite or the total number of infections of relevance to transmission events (21).
Supplementary Material
Acknowledgments:
We wish to thank the participants, communities, and the Ghana Health Service in Bongo District, Ghana for their willingness to participate in the study of empirical data. We would like to thank the field teams in Bongo for their technical assistance in the field, as well as the laboratory personnel at the Navrongo Health Research Centre for their expertise and for undertaking the sample collections and parasitological assessments.
Funding:
The joint NIH-NSF-NIFA Ecology and Evolution of Infectious Disease award R01-TW009670 (KPD, MP)
The joint NIH-NSF-NIFA Ecology and Evolution of Infectious Disease award R01-AI149779 (KPD, MP)
Footnotes
Competing interests: Authors declare that they have no competing interests.
Data and materials availability:
The sequences utilized in this study are publicly available in GenBank under BioProject Number: PRJNA 396962. All additional data associated with this study are available in the main text, the supplementary information, or upon reasonable request to the authors. For information on the simulation code and analysis scripts, please see https://github.com/qzhan321/Intervention.
References and Notes
- 1.World Malaria Report 2022. World Health Organization, Geneva, Switzerland: (2022). [Google Scholar]
- 2.Cohen J. M., Smith D. L., Cotter C., Ward A., Yamey G., Sabot O. J., Moonen B., Malaria resurgence: a systematic review and assessment of its causes. Malar. J. 11, 122 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Gogue C., Wagman J., Tynuv K., et al. , An observational analysis of the impact of indoor residual spraying in Northern, Upper East, and Upper West Regions of Ghana: 2014 through 2017. Malar J, 19(1), 242 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Doolan D. L., Dobaño C., Baird J. K., Acquired Immunity to Malaria. Clin. Microbiol. Rev. 22, 13–36 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Andolina C., Rek J. C., Briggs J., Okoth J., Musiime A., Ramjith J., et al. , Sources of persistent malaria transmission in a setting with effective malaria control in eastern Uganda: a longitudinal, observational cohort study. The Lancet. Infectious diseases, 21, 1568–1578 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lindblade K.A., Steinhardt L., Samuels A., Kachur S. P., Slutsker L., The silent threat: asymptomatic parasitemia and malaria transmission. Expert Rev Anti Infect Ther., 11(6), 623–639 (2013). [DOI] [PubMed] [Google Scholar]
- 7.Kennedy P. G. E., Rodgers J., Clinical and Neuropathogenetic Aspects of Human African Trypanosomiasis. Front. Immunol., 10(39) (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Gabutti G., Stefanati A., Kuhdari P., Epidemiology of Neisseria meningitidis infections: case distribution by age and relevance of carriage. J. Prev. Med. Hyg., 56(3) (2015). [PMC free article] [PubMed] [Google Scholar]
- 9.Guglielmone A. A., Epidemiology of babesiosis and anaplasmosis in South and Central America. Vet. Parasitol., 57, 109–119 (1995). [DOI] [PubMed] [Google Scholar]
- 10.Kocan K. M., de la Fuente J., Guglielmone A. A., Meléndez R. D., Antigens and alternatives for control of Anaplasma marginale infection in cattle. Clin. Microbiol. Rev., 16(4), 698–712 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Deitsch K. W., Lukehart S. A., Stringer J. R., Common strategies for antigenic variation by bacterial, fungal and protozoan pathogens. Nat. Rev. Microbiol. 7, 493–503 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zhang X., Deitsch K. W.. The mystery of persistent, asymptomatic Plasmodium falciparum infections. Curr Opin Microbiol., 70 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Baruch D. I., Pasloske B. L., Singh H. B., Bi X., Ma X. C., Feldman M., Taraschi T. F., Howard R. J., Cloning the P. falciparum gene encoding PfEMP1, a malarial variant antigen and adherence receptor on the surface of parasitized human erythrocytes. Cell, 82(1), 77–87 (1995). [DOI] [PubMed] [Google Scholar]
- 14.Smith J. D., Chitnis C. E., Craig A. G., Roberts D. J., Hudson-Taylor D. E., Peterson D. S., Pinches R., Newbold C. I., Miller L. H., Switches in Expression of Plasmodium falciparum var Genes Correlate with Changes in Antigenic and Cytoadherent Phenotypes of Infected Erythrocytes. Cell, 82(1),101–110 (1995). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Su X. Z., Heatwole V. M., Wertheimer S. P., Guinet F., Herrfeldt J. A., Peterson D. S., Ravetch J. A., Wellems T. E., The large diverse gene family var encodes proteins involved in cytoadherence and antigenic variation of Plasmodium falciparum-infected erythrocytes. Cell, 82(1), 89–100 (1995). [DOI] [PubMed] [Google Scholar]
- 16.Chan J., Howell K. B., Reiling L., Ataide R., Mackintosh C. L., Fowkes F. J. I., et al. , Targets of antibodies against Plasmodium falciparum-infected erythrocytes in malaria immunity. Clin. Invest., 122(9), 3227–3238 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Chan J., Boyle M. J., Moore K. A., Reiling L., Lin Z., Hasang W., et al. , Antibody Targets on the Surface of Plasmodium falciparum-Infected Erythrocytes That Are Associated With Immunity to Severe Malaria in Young Children. The Journal of infectious diseases, 219(5), 819–828 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Travassos M. A., Niangaly A., Bailey J. A., Ouattara A., Coulibaly D., Lyke K. E., et al. , Children with cerebral malaria or severe malarial anaemia lack immunity to distinct variant surface antigen subsets. Scientific reports, 8(1), 6281 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chew M., Ye W., Omelianczyk R. I., Pasaje C. F., Hoo R., Chen Q., Niles J. C., Chen J., Preiser P., Selective expression of variant surface antigens enables Plasmodium falciparum to evade immune clearance in vivo. Nat. Commun., 13, 4067 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Deitsch K. W., Dzikowski R., Variant Gene Expression and Antigenic Variation by Malaria Parasites. Annu. Rev. Microbiol., 71(1), 625–641 (2017). [DOI] [PubMed] [Google Scholar]
- 21.Tiedje K. E., Zhan Q., Ruybal-Pésantez S., Tonkin-Hill G., He Q., Tan M. H., Argyropoulos D. C., Deed S. L., Ghansah A., Bangre O., Oduro A. R., Koram K. A., Pascual M., Day K. P., Measuring changes in Plasmodium falciparum census population size in response to sequential malaria control interventions. eLife, 12 (2023). [Google Scholar]
- 22.Ruybal-Pesántez S., Tiedje K. E., Tonkin-Hill G., Rask T. S., Kamya M. R., Greenhouse B., Dorsey G., Duffy M. F., Day K. P., Population genomics of virulence genes of Plasmodium falciparum in clinical isolates from Uganda. Sci. Rep., 7(11810) (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Day K. P., Artzy-Randrup Y., Tiedje K. E., Rougeron V., Chen D. S., Rask T. S., Rorick M. M., Migot-Nabias F., Deloron P., Luty A. J. F., Pascual M., Evidence of strain structure in Plasmodium falciparum var gene repertoires in children from Gabon, West Africa. Proc. Natl. Acad. Sci. U.S.A., 114(20), 4103–4111 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ruybal-Pesántez S., Tiedje K. E., Pilosof S., Tonkin-Hill G., He Q., Rask T. S., et al. , Age-specific patterns of DBLα var diversity can explain why residents of high malaria transmission areas remain susceptible to Plasmodium falciparum blood stage infection throughout life. International Journal for Parasitology, 52(11), 721–731 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Frank M., Kirkman L., Costantini D., Sanyal S., Lavazec C., Templeton T. J., Deitsch K. W., Frequent recombination events generate diversity within the multi-copy variant antigen gene families of Plasmodium falciparum. International journal for parasitology, 38(10), 1099–1109 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Claessens A., Hamilton W. L., Kekre M., Otto T. D., Faizullabhoy A., Rayner J. C., Kwiatkowski D., Generation of Antigenic Diversity in Plasmodium falciparum by Structured Rearrangement of Var Genes During Mitosis. PLoS. Genet., 10(12) (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Freitas-Junior L. H., Bottius E., Pirrit L. A., Deitsch K. W., Scheidig C., Guinet F., Nehrbass U., Wellems T. E., Scherf A., Frequent ectopic recombination of virulence factor genes in telomeric chromosome clusters of P. falciparum. Nature, 407(6807), 1018–1022 (2000). [DOI] [PubMed] [Google Scholar]
- 28.Bopp S. E. R., Manary M. J., Bright A. T., Johnston G. L., Dharia N. V., Luna F. L., et al. Mitotic Evolution of Plasmodium falciparum Shows a Stable Core Genome but Recombination in Antigen Families. PLoS Genet, 9(2) (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.He Q., Pilosof S., Tiedje K. E., Ruybal-Pesántez S., Artzy-Randrup Y., Baskerville E. B., Day K. P., Pascual M., Networks of genetic similarity reveal non-neutral processes shape strain structure in Plasmodium falciparum. Nat. Commun., 9, 1817 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Tran T. M., Li S., Doumbo S., Doumtabe D., Huang C., Dia S., et al. , An intensive longitudinal cohort study of Malian children and adults reveals no evidence of acquired immunity to Plasmodium falciparum infection. Clinical infectious diseases: an official publication of the Infectious Diseases Society of America, 57(1), 40–47 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Portugal S., Drakesmith H., Mota M. M., Superinfection in malaria: Plasmodium shows its iron will. EMBO reports, 12(12), 1233–1242 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Nkhoma S. C., Trevino S. G., Gorena K. M., Nair S., Khoswe S., Jett C., Garcia R., Daniel B., Dia A., Terlouw D. J., Ward S. A., Anderson T. J. C., Cheeseman I. H., Co-transmission of Related Malaria Parasite Lineages Shapes Within-Host Parasite Diversity. Cell host & microbe, 27(1), 93–103 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Anderson T. J. C., Haubold B., Williams J. T., Estrada-Franco J. G., Richardson L., Mollinedo R., et al. , Microsatellite Markers Reveal a Spectrum of Population Structures in the Malaria Parasite Plasmodium falciparum. Molecular Biology and Evolution, 17(10), 1467–1482 (2000). [DOI] [PubMed] [Google Scholar]
- 34.Volkman S., Neafsey D., Schaffner S., et al. , Harnessing genomics and genome biology to understand malaria biology. Nat Rev Genet, 13, 315–328 (2012). [DOI] [PubMed] [Google Scholar]
- 35.Tiedje K. E., Oduro A. R., Bangre O., Amenga-Etego L., Dadzie S. K., Appawu M. A., et al. , Indoor residual spraying with a non-pyrethroid insecticide reduces the reservoir of Plasmodium falciparum in a high-transmission area in northern Ghana. PLOS Glob. Public Health, 2(5) (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Tessema S. K., Monk S. L., Schultz M. B., et al. , Phylogeography of var gene repertoires reveals fine-scale geospatial clustering of Plasmodium falciparum populations in a highly endemic area. Mol Ecol., 24(2):484–497 (2015). [DOI] [PubMed] [Google Scholar]
- 37.Chen D. S., Barry A. E., Leliwa-Sytek A., Smith T-A., Peterson I., Brown S. M., et al. , A Molecular Epidemiological Study of var Gene Diversity to Characterize the Reservoir of Plasmodium falciparum in Humans in Africa. PLoS ONE 6(2) (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Barry A. E., Leliwa-Sytek A., Tavul L., Imrie H., Migot-Nabias F., Brown S. M., McVean G. A. V., Day K. P., Population genomics of the immune evasion (var) genes of Plasmodium falciparum. PLoS pathogens, 3(3), 34 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Tan M. H., Shim H., Chan Y., Day K. P., Unravelling var complexity: Relationship between DBLα types and var genes in Plasmodium falciparum. Front. Parasitol., 1 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Bousema T., Okell L., Felger I., Drakeley C., Asymptomatic malaria infections: detectability, transmissibility and public health relevance. Nature reviews. Microbiology, 12(12), 833–840 (2014). [DOI] [PubMed] [Google Scholar]
- 41.Sørensen T., A method of establishing groups of equal amplitude in plant sociology based on similarity of species and its application to analyses of the vegetation on Danish commons. Kongelige Danske Videnskabernes Selskab, 5(4): 1–34 (1948). [Google Scholar]
- 42.Dice L. R., “Measures of the Amount of Ecologic Association Between Species”. Ecology, 26(3), 297–302 (1945). [Google Scholar]
- 43.Wright S., “Genetical structure of populations”. Nature, 166(4215): 247–9, (1950). [DOI] [PubMed] [Google Scholar]
- 44.Paul R. E., Packer M. J., Walmsley M., Lagog M., Ranford-Cartwright L. C., Paru R., Day K. P., Mating patterns in malaria parasite populations of Papua New Guinea. Science, 269(5231), 1709–1711 (1995). [DOI] [PubMed] [Google Scholar]
- 45.Sumner K. M., Freedman E., Abel L., Obala A., Pence B. W., Wesolowski A., Meshnick S. R., Prudhomme-O’Meara W., Taylor S. M., Genotyping cognate Plasmodium falciparum in humans and mosquitoes to estimate onward transmission of asymptomatic infections. Nat. Commun., 12, 909 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Sondo P., Derra K., Rouamba T., Nakanabo Diallo S., Taconet P., Kazienga A., Ilboudo H., Tahita M. C., Valéa I., Sorgho H., Lefèvre T., Tinto H., Determinants of Plasmodium falciparum multiplicity of infection and genetic diversity in Burkina Faso. Parasites Vectors, 13, 427 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Arnot D., Clone multiplicity of Plasmodium falciparum infections in individuals exposed to variable levels of disease transmission. Transactions of the Royal Society of Tropical Medicine and Hygiene, 92(6), 580–585 (1998). [DOI] [PubMed] [Google Scholar]
- 48.Labbé F., He Q., Zhan Q., Tiedje K. E., Argyropoulos D. C., Tan M. H., Ghansah A., Day K. P., Pascual M., Neutral vs. non-neutral genetic footprints of Plasmodium falciparum multiclonal infections. PLoS Comput. Biol., 19(1) (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Holding T., Valletta J. J., Recker M., Multiscale Immune Selection and the Transmission-Diversity Feedback in Antigenically Diverse Pathogen Systems. The American naturalist, 192(6), 189–201 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.de Roos A. M., He Q., Pascual M., An immune memory-structured SIS epidemiological model for hyperdiverse pathogens. Proc Natl Acad Sci USA, 120(45) (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Geffery G. M., Epidemiological significance of repeated infections with homologous and heterologous strains and species of Plasmodium. Bulletin of the World Health Organization, 35(6), 873–882 (1966). [PMC free article] [PubMed] [Google Scholar]
- 52.Boyd M. F., Kitchen S. F., On the heterologous value of acquired immunity to Plasmodium falciparum. National Malaria Society (U.S.), 4, 301–306 (1945). [PubMed] [Google Scholar]
- 53.Ghumra A., Semblat J., Ataide R., Kifude C., Adams Y., Claessens A., et al. , Induction of Strain-Transcending Antibodies Against Group A PfEMP1 Surface Antigens from Virulent Malaria Parasites. PLoS. Pathog., 8(4) (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Larremore D. B., Clauset A., Buckee C. O., A network approach to analyzing highly recombinant malaria parasite genes. PLoS computational biology, 9(10) (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Buckee C. O., Recker M., Evolution of the multi-domain structures of virulence genes in the human malaria parasite, Plasmodium falciparum. PLoS computational biology, 8(4) (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Bull P. C., Buckee C. O., Kyes S., Kortok M. M., Thathy V., Guyah B., Stoute J. A., Newbold C. I., Marsh K., Plasmodium falciparum antigenic variation. Mapping mosaic var gene sequences onto a network of shared, highly polymorphic sequence blocks. Molecular microbiology, 68(6), 1519–1534 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Claessens A., Adams Y., Ghumra A., Lindergard G., Buchan C. C., Andisi C., et al. , A subset of group A-like var genes encodes the malaria parasite ligands for binding to human brain endothelial cells, Proc. Natl. Acad. Sci. U.S.A., 109(26), 1772–1781 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Rottmann M., Lavstsen T., Mugasa J. P., Kaestli M., Jensen A. T. R., Müller D., Theander T., Beck H., Differential Expression of var Gene Groups Is Associated with Morbidity Caused by Plasmodium falciparum Infection in Tanzanian Children. Infection and immunity, 74(7), 3904–3911 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Kaestli M., Cockburn I. A., Cortés A., Baea K., Rowe J. A., Beck H., Virulence of malaria is associated with differential expression of Plasmodium falciparum var gene subgroups in a case-control study. The Journal of infectious diseases, 193(11), 1567–1574 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Rask T. S., Hansen D. A., Theander T. G., Pedersen A. G., Lavstsen T., Plasmodium falciparum Erythrocyte Membrane Protein 1 Diversity in Seven Genomes – Divide and Conquer. PLoS Comput. Biol., 6(9) (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Lavstsen T., Salanti A., Jensen A. T. et al. Sub-grouping of Plasmodium falciparum 3D7 var genes based on sequence analysis of coding and non-coding regions. Malar J 2(27) (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Kraemer S. M., Smith J. D.. Evidence for the importance of genetic structuring to the structural and functional specialization of the Plasmodium falciparum var gene family. Mol Microbiol. 50(5), 1527–1538 (2003). [DOI] [PubMed] [Google Scholar]
- 63.Lourenço C., Tatem A. J., Atkinson P. M., et al. Strengthening surveillance systems for malaria elimination: a global landscaping of system performance, 2015–2017. Malar J, 18(315) (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Fountain A., Ye Y., Roca-Feltrer A., et al. , Surveillance as a Core Intervention to Strengthen Malaria Control Programs in Moderate to High Transmission Settings. Am J Trop Med Hyg., 108(2_Suppl):8–13 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Taylor A. R., Jacob P. E., Neafsey D. E., Buckee C. O., Estimating Relatedness Between Malaria Parasites. Genetics, 212(4):1337–1351 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Chang H-H., Worby C. J., Yeka A., Nankabirwa J., Kamya M. R., Staedke S. G., et al. THE REAL McCOIL: A method for the concurrent estimation of the complexity of infection and SNP allele frequency for malaria parasites. PLoS Comput Biol, 13(1) (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Wong W., Volkman S., Daniels R., Schaffner S., Sy M., Ndiaye Y. D., Badiane A. S., et al. RH: a genetic metric for measuring intrahost Plasmodium falciparum relatedness and distinguishing cotransmission from superinfection, PNAS Nexus, 1(4) (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Gibson M. A., Bruck J., Efficient Exact Stochastic Simulation of Chemical Systems with Many Species and Many Channels. The Journal of Physical Chemistry A, 104(9), 1876–1889 (2000). [Google Scholar]
- 69.Gillespie D. T., A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. Journal of Computational Physics, 22(4), 403–434 (1976). [Google Scholar]
- 70.Ghansah A., Tiedje K. E., Argyropoulos D. C., Onwona C. O., Deed S. L., Labbé F., Oduro A. R., Koram K. A., Pascual M., Day K. P., Comparison of molecular surveillance methods to assess changes in the population genetics of Plasmodium falciparum in high transmission. Front. Parasitol., 2 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Sondo P., Derra K., Lefevre T., Diallo-Nakanabo S., Tarnagda Z., Zampa O., Kazienga A., Valea I., Sorgho H., Ouedraogo J., Tinga Robert Guiguemde, Halidou Tinto, Genetically diverse Plasmodium falciparum infections, within-host competition and symptomatic malaria in humans. Sci. Rep., 9, 127 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Nkhoma S. C., Ahmed A. O. A., Porier D., Rashid S., Bradford R., Molestina R. E., Stedman T. T., Dynamics of parasite growth in genetically diverse Plasmodium falciparum isolates. Molecular and Biochemical Parasitology, 254, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Camponovo F., Lee T. E., Russell J. R., Burgert L., Gerardin J., Penny M. A., Mechanistic within-host models of the asexual Plasmodium falciparum infection: a review and analytical assessment. Malar. J., 20(309) (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Churcher T. S., Sinden R. E., Edwards N. J., Poulton I. D., Rampling T. W., Brock P. M., et al. , Probability of Transmission of Malaria from Mosquito to Human Is Regulated by Mosquito Parasite Density in Naïve and Vaccinated Hosts. PLoS. Pathog., 13(1) (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Smith D. L., Drakeley C. J., Chiyaka C., Hay S. I., A quantitative analysis of transmission efficiency versus intensity for malaria. Nat. Commun., 1(108) (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Drummond A., Silberg J. J., Meyer M. M., Wilke C. O., Arnold F. H., On the conservative nature of intragenic recombination. Proc. Natl. Acad. Sci. U.S.A., 102(15), 5380–5385 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Hermsen C. C., Telgt D. S. C., Linders E. H.P., van de Locht L. A. T. F., Eling W. M. C., Mensink E. J. B. M., Sauerwein R. W., Detection of Plasmodium falciparum malaria parasites in vivo by real-time quantitative PCR. Molecular and Biochemical Parasitology, 118(2), 247–251 (2001). [DOI] [PubMed] [Google Scholar]
- 78.Escalante A. A., Cepeda A. S., Pacheco M. A., Why Plasmodium vivax and Plasmodium falciparum are so different? A tale of two clades and their species diversities. Malar. J., 21(139) (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Pilosof S., He Q., Tiedje K. E., Ruybal-Pesántez S., Day K. P., Pascual M., Competition for hosts modulates vast antigenic diversity to generate persistent strain structure in Plasmodium falciparum. PLoS. Biol., 17(6) (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.White M. T., Griffin J. T., Churcher T. S., Ferguson N. M., Basáñez M., Ghani A. C., Modelling the impact of vector control interventions on Anopheles gambiae population dynamics. Parasites Vectors, 4(153) (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.He Q., Pilosof S., Tiedje K. E., Day K. P., Pascual M., Frequency-Dependent Competition Between Strains Imparts Persistence to Perturbations in a Model of Plasmodium falciparum Malaria Transmission. Front. Ecol. Evol., 9, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Jost L., G(ST) and its relatives do not measure differentiation. Molecular ecology, 17(18), 4015–4026 (2008). [DOI] [PubMed] [Google Scholar]
- 83.Simpson E. H., “Measurement of diversity”. Nature, 163(4148), 688 (1949). [Google Scholar]
- 84.Maruyama T., A Markov process of gene frequency change in a geographically structured population. Genetics, 76(2), 367–377 (1974). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Otto S. P., Whitlock M. C., The Probability of Fixation in Populations of Changing Size, Genetics, 146(2), 723–733 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.He Q., Pascual M., An antigenic diversification threshold for falciparum malaria transmission at high endemicity. PLoS. Comput. Biol., 17(2) (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Collins W. E., Skinner J. C., Jeffery G. M., Studies on the persistence of malarial antibody response. American journal of epidemiology, 87(3), 592–598 (1968). [DOI] [PubMed] [Google Scholar]
- 88.Collins W. E., Jeffery G. M., Skinner J. C., Fluorescent Antibody Studies in Human Malaria. II. Development and Persistence of Antibodies to Plasmodium falciparum. The American journal of tropical medicine and hygiene, 13, 256–260 (1964). [DOI] [PubMed] [Google Scholar]
- 89.Childs L. M., Baskerville E. B., Cobey S., Trade-offs in antibody repertoires to complex antigens. Philosophical Transactions of the Royal Society B: Biological Sciences, 370 (1676) (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The sequences utilized in this study are publicly available in GenBank under BioProject Number: PRJNA 396962. All additional data associated with this study are available in the main text, the supplementary information, or upon reasonable request to the authors. For information on the simulation code and analysis scripts, please see https://github.com/qzhan321/Intervention.






