ABSTRACT
The human malaria parasite Plasmodium falciparum replicates within circulating red blood cells, where it is subjected to conditions that frequently cause DNA damage. The repair of DNA double-stranded breaks (DSBs) is thought to rely almost exclusively on homologous recombination (HR), due to a lack of efficient nonhomologous end joining. However, given that the parasite is haploid during this stage of its life cycle, the mechanisms involved in maintaining genome stability are poorly understood. Of particular interest are the subtelomeric regions of the chromosomes, which contain the majority of the multicopy variant antigen-encoding genes responsible for virulence and disease severity. Here, we show that parasites utilize a competitive balance between de novo telomere addition, also called “telomere healing,” and HR to stabilize chromosome ends. Products of both repair pathways were observed in response to DSBs that occurred spontaneously during routine in vitro culture or resulted from experimentally induced DSBs, demonstrating that both pathways are active in repairing DSBs within subtelomeric regions and that the pathway utilized was determined by the DNA sequences immediately surrounding the break. In combination, these two repair pathways enable parasites to efficiently maintain chromosome stability while also contributing to the generation of genetic diversity.
KEYWORDS: Plasmodium falciparum, chromosome stability, de novo telomere addition, gene conversion, homologous recombination, telomere healing
IMPORTANCE
Malaria is a major global health threat, causing approximately 430,000 deaths annually. This mosquito-transmitted disease is caused by Plasmodium parasites, with infection with the species Plasmodium falciparum being the most lethal. Mechanisms underlying DNA repair and maintenance of genome integrity in P. falciparum are not well understood and represent a gap in our understanding of how parasites survive the hostile environment of their vertebrate and insect hosts. Our work examines DNA repair in real time by using single-molecule real-time (SMRT) sequencing focused on the subtelomeric regions of the genome that harbor the multicopy gene families important for virulence and the maintenance of infection. We show that parasites utilize two competing molecular mechanisms to repair double-strand breaks, homologous recombination and de novo telomere addition, with the pathway used being determined by the surrounding DNA sequence. In combination, these two pathways balance the need to maintain genome stability with the selective advantage of generating antigenic diversity.
INTRODUCTION
Malaria, a mosquito-transmitted infectious disease, continues to pose a major challenge for health care systems in many parts of the developing world (1). The disease is caused by various species of the Plasmodium genus of protozoan parasites, members of the Apicomplexa family of obligate parasites that also includes Toxoplasma, Babesia, Cryptosporidium, and Theileria species. The pathogenic stage of malaria occurs when parasites invade and replicate within the red blood cells (RBCs) of their vertebrate hosts, causing cycles of fever and chills as well as severe anemia. Plasmodium falciparum, the species responsible for the most virulent form of the disease, also causes infected RBCs to become cytoadhesive, resulting in sequestration within the postcapillary vasculature, leading to circulatory disruption and many of the severe symptoms of the disease, including cerebral malaria and complications during pregnancy (2). While multiplying within the RBCs, the parasites are haploid, replicating through a process called schizogony in which multiple rounds of nuclear division lead to a multinucleated cell containing 16 to 32 individual merozoites. These are released upon lysis of the host cell, enabling them to invade new RBCs and reinitiate the cycle. Throughout this process, parasites are confronted with numerous conditions that can cause DNA damage, including the products of metabolism and hemoglobin digestion, components of the host immune response, and oxidative damage resulting from exposure to antimalarial drugs (3–5). The success of these parasites therefore depends on their ability to efficiently repair DNA damage and maintain genome integrity.
The P. falciparum genome is arranged into 14 linear chromosomes constituting a total of approximately 23 Mb (6). The contents of each individual chromosome are organized similarly, with single-copy genes responsible for replication and progression through the cell cycle within the central regions and members of the large, multicopy, clonally variant gene families found in lengthy arrays within the subtelomeric domains. Flanking the variant genes are conserved telomere-associated repeat elements (TAREs), which typically extend for 10 to 30 kb, followed by the actual chromosome end, which consists of typical telomeric heptad repeats maintained by telomerase (Fig. 1A) (7). The subtelomeric domains of P. falciparum are of particular interest for understanding parasite biology, since the clonally variant gene families that reside there are the primary virulence determinants and their varied expression results in antigenic variation, the process that enables parasites to perpetuate long-term, chronic infections. These regions contain hundreds of genes from several gene families, such as var, stevor, rifin, Pfmc-2TM, FIKK and ACS, which are maintained in a unique chromatin structure that is marked by the histone modifications H3K9me3 and H3K36me3 (8–10). Within the nucleus, these regions cluster at the nuclear periphery, an arrangement that is thought to facilitate recombination between members of the multicopy gene families that reside on different chromosomes, thereby generating diversity (11, 12). Given the importance of these chromosomal regions for pathogenesis, understanding how they are replicated, maintained, and repaired is of considerable interest.
DNA double-strand breaks (DSBs) are an especially severe form of DNA damage that must be repaired for a cell to remain viable and to replicate. Eukaryotic cells typically rely on two distinct pathways to repair such breaks, homologous recombination (HR) and nonhomologous end joining (NHEJ). While NHEJ can ligate the broken ends of the DNA back together to efficiently repair a break, it often results in small deletions at the site of ligation and can therefore be mutagenic (13). HR is a more accurate method of repair since it copies information from homologous sequences elsewhere in the genome, typically the other allele in a diploid cell, thereby avoiding any insertion or deletion at the site of the break. HR, however, requires a homologous region of high sequence identity (typically greater than 98%) to serve as the template for repair (14). The asexual stages of P. falciparum represent a unique organism for the study of DSB repair, since they are haploid and thus generally lack homologous sequences to serve as the templates for HR, and yet they are missing the canonical NHEJ pathway. An alternative NHEJ pathway based on microhomology has been described, although it appears to be quite inefficient (15, 16). Therefore, how malaria parasites respond to DSBs and efficiently maintain genome integrity is unclear. Characterization of important enzymes involved in HR demonstrated that this pathway is functionally conserved (17–19). The subtelomeric regions, given their size, extensive gene content, semirepetitive nature, and unique chromatin structure, represent a particularly interesting region of the genome for studying DSB repair.
In addition to HR and NHEJ, eukaryotic organisms can also stabilize DSBs occurring at the chromosome ends through the action of telomerase, the enzyme that maintains the telomeric repeats at this specialized chromosomal location. This repair pathway, called de novo telomere addition, or “telomere healing,” involves the recruitment of the telomerase complex directly to the site of the break (20, 21). The DNA strand is resected until a region of high TG content is encountered, which is thought to “seed” telomerase and enable it to incorporate telomeric repeats at the site of a DNA break and resection (Fig. 1B). This results in the creation of a functional telomere, thereby stabilizing the chromosome and maintaining genome integrity, albeit with a deletion of the region of the chromosome between the DSB and the original telomere and loss of the intervening genetic information. Telomere healing is thought to compete with HR and NHEJ for repair of DSBs, with the pathway of repair depending on the chromosomal environment in which the break occurs (20). Telomere healing has been described for a number of eukaryotic organisms, and the addition of telomeric repeats to the ends of broken chromosomes has been described for P. falciparum (22–24), indicating that this mechanism of repair is conserved. However, the details of telomere healing in malaria parasites and how this pathway integrates with HR and alternative NHEJ have not been studied extensively.
Given the general chromosome structure of P. falciparum (Fig. 1A), telomere healing could repair DSBs that occur anywhere within the extensive subtelomeric regions between the chromosome ends and the internal, highly conserved regions of the genome. Genes within these subtelomeric regions are necessary for host-pathogen interactions but are not required for viability in cultured parasites; thus, deletions of subtelomeric regions are tolerated and can be efficiently recovered and analyzed. The semirepetitive nature of these regions suggests that DSBs could also potentially be repaired by HR, allowing the study of both pathways as they repair breaks within this chromosomal environment. Detailed sequence examination of subtelomeric domains has been difficult in the past due to the repetitive nature of the sequences, making assemblies of these regions from short sequence reads problematic. However, newer technologies, including those that utilize single-molecule real-time (SMRT) sequencing, enable the confident assembly of complete subtelomeric regions, allowing us to examine in detail DSB repair in these regions of the genome. We applied this technology to study the repair of spontaneous DSBs that occur within subtelomeric regions during in vitro culture as well as breaks induced randomly through exposure to ionizing radiation.
Our analysis indicated that both telomere healing and gene conversion through HR can repair DSBs within subtelomeric domains. Telomere healing was the most common type of repair we observed, and which pathway was utilized was strictly determined by the sequence surrounding the break point. Considering the repetitive nature of these chromosomal regions, subtelomeric deletions resulting from telomere healing events could be followed by subsequent gene conversion through HR, thus reestablishing the typical subtelomeric structure and maintaining the overall parasite’s chromosome organization. We conclude that both repair pathways therefore work in tandem to maintain genome integrity and preserve the complement of clonally variant genes found within the genomes of P. falciparum isolates.
RESULTS
Extensive telomere healing is observed in cultured parasites.
While telomere healing has been described for P. falciparum (22–24), how commonly it occurs within the 28 subtelomeric domains of the parasite’s genome has not been closely examined. Telomere healing has been most extensively studied in Saccharomyces cerevisiae, in which telomerase displays a strong preference for TG-rich sequences to initiate synthesis of repeats (25, 26). If P. falciparum telomerase has a similar preference for TG-rich sequences to initiate synthesis of repeats, then one is likely to observe telomeric repeats fused to the coding regions of truncated genes at the new chromosome ends, as coding regions are significantly enriched in Cs and Gs compared to noncoding regions of the genome.
As a first step to determine if telomere healing is common in cultured P. falciparum and to obtain a baseline for our experimental analysis, we analyzed the sequence of all 28 chromosome ends provided in the reference genome sequence of 3D7 (Plasmodb.org). Five chromosome ends display the hallmarks of telomere healing, including the absence of TAREs and telomeric repeat sequences fused directly into or next to protein coding regions. In four of these, on chromosomes 5 and 6 and both ends of chromosome 14, the telomeric repeats are found within the coding region of a truncated var gene, creating a pseudogene. In the fifth example on chromosome 11, the telomeric repeats are found immediately downstream of an intact var gene (Fig. 2A).
Next, we investigated if telomere healing events occurred in parasites cultivated in the laboratory; we generated several subclones from a stock of 3D7 parasites that had been grown in culture for several years in our laboratory and chose one for whole-genome sequencing. To determine the sequence of the chromosome ends and to avoid difficulties in assembling the repetitive structure of subtelomeric domains from the short sequence reads typically derived from most methods of high-throughput sequencing, we instead utilized SMRT sequencing, which can yield single contiguous reads of 10 to 20 kb (27, 28). Using this method, we verified the five telomere healing events recognized in the reference genome sequence and also identified two additional novel DNA sequences consistent with telomere healing (Fig. 2B; see also Fig. S1, S3, and S4 in the supplemental material). The first, on chromosome 2, resulted in the deletion of approximately 110 kb, including the entire subtelomeric domain and leading to the insertion of telomeric repeats within the lsap2 gene (PF3D7_0202100). The second occurred on chromosome 3 and similarly deleted all of the subtelomeric domain (~85 kb), inserting telomeric repeats just downstream of a gene encoding a putative alpha/beta-hydrolase (PF3D7_0301300). These two events provided us with a “before and after” picture of telomere healing and further indicated that this is a common mechanism of DSB repair within the subtelomeric regions of P. falciparum.
Inducement of DSBs and repair in cultured parasites by X-ray irradiation.
To more directly observe DSB repair within subtelomeric domains, we chose to induce random DSBs by exposing parasite cultures to X-ray. X-rays are known to cause DSBs without sequence bias, and such breaks must be repaired for parasite viability. Thus, by selecting for viable parasites after near-lethal exposure to X-rays, we hoped to detect and analyze additional examples of DSBs within subtelomeric regions. The subclone of 3D7 previously used for SMRT sequencing was exposed to increasing amounts of X-ray radiation to determine the level of exposure that would lead to significant widespread DNA damage yet allow for parasites to repair, recover, and propagate. Exposure to 100 Gy resulted in significant lethality; however, viable parasites grew from irradiated cultures within 10 to 12 days after exposure. Parasites were exposed to 100 Gy three times consecutively, allowing the parasites to recover normal growth after each irradiation. Subclones were then isolated by limiting dilution, and one clone was subjected to whole-genome sequencing using the SMRT methodology. Analysis of the genome sequence identified two additional examples of telomere healing, one on chromosome 1 that led to a deletion of ~90 kb and resulted in the insertion of telomeric repeats into a rifin gene (Pf3D7_0101900) and a second on chromosome 2 that led to deletion of ~100 kb and resulted in the insertion of telomeric repeats just downstream of the hypothetical gene Pf3D7_0221000 (Fig. 3A; Fig. S2, S5, and S6). This clone of 3D7 now carries significant truncations within 9 of its 28 subtelomeric regions.
In addition to the two new examples of telomere healing identified for this clone, we also detected three examples of recombination events that likely resulted from HR. One of the subtelomeric regions of chromosome 13 is a hybrid sequence in which the original subtelomeric region has been deleted and replaced by ~25,500 bp, including the telomeric repeats, from one of the subtelomeric regions of chromosome 9. The breakpoint of the recombination event occurred within the coding region of a var gene, creating a new var gene that is a chimera of PF3D7_0900100 and PF3D7_1300100 (Fig. 3B). A more complex product of recombination was identified near one end of chromosome 12. The first ~8,100 bp, including the telomeric repeats, was derived from one of the subtelomeric domains of chromosome 9 and, given that both ends of chromosome 9 remain unchanged, this appears to be the result of a gene conversion event. This fragment is fused to 13,886 bp of sequence identical to a region within one of the subtelomeric domains of chromosome 1. This sequence is within the portion of chromosome 1 that is now deleted in this clone, indicating that it was transposed into the subtelomeric region of chromosome 12 prior to its deletion from chromosome 1 (Fig. 3B). This could have resulted from either gene conversion or reciprocal recombination. These data indicate that DSBs that occur within subtelomeric domains can be repaired either by HR or telomeric healing and that the repair process can generate new var genes.
Sequence preference for insertion of telomeric repeats by telomerase.
A model for telomere healing in higher eukaryotic cells has been derived from extensive experiments conducted primarily in yeast. When a DSB forms within a subtelomeric region, the DNA is initially resected by one of two exonucleolytic pathways (Exo1 or Dna2/Sgs1), revealing a region of 3′ single-stranded DNA (29). The resection continues until a sequence is encountered that can anneal to the template region of telomerase RNA. This allows telomerase to begin synthesizing the telomeric repeats directly at the end of the chromosome, continuing for multiple rounds of DNA synthesis and resulting in a functional telomere that can stabilize the chromosome end (Fig. 4A). As predicted by this model, telomere healing is initiated at sites that display sequences complementary to the template region of telomerase RNA. In yeast, these sites almost always include GT, TG, or CG dinucleotides at the site where telomerase initiates synthesis of telomere repeats (25, 26). In addition, the flanking region also plays a role in determining where the new telomere is synthesized, presumably by influencing how efficiently telomerase is recruited to the chromosome end (30). In S. cerevisiae, stretches of TG repeats between 22 and 250 bp effectively recruit telomerase for healing, but stretches either longer or shorter are repaired much less efficiently (31–34). Proximal enhancer sequences that can function as binding sites for proteins that associate with the telomerase complex can also increase the efficiency of telomere healing (30). Sites of telomere healing in P. falciparum were previously identified within genes that are known to be commonly disrupted in cultured parasites, presumably because such deletion events provide a growth advantage in vitro (22, 24); the ability of P. falciparum telomerase obtained from nuclear extracts to extend from specific sequences in vitro was also investigated (23). The identification of 9 independent telomere healing events in our sequence data, including events that we induced with radiation exposure, allowed us to investigate in more detail the properties at the site of a DSB that contribute to telomere healing in P. falciparum and whether they differ from sites that are instead repaired by HR.
The sequence of the telomerase RNA template region for several Plasmodium species was predicted bioinformatically by Chakrabarti et al. (35). This template sequence is known to be species specific and, unlike the equivalent sequence in yeast, the P. falciparum sequence does not consist exclusively of As and Cs and instead includes two additional bases (UG) within the sequence (5′-ACCCUGAACCC-3′). Interestingly, given that the two major telomeric repeat sequences are 5′-TTCAGGG-3′ and 5′-TTTAGGG-3′, the G within the template sequence appears to specify either C or T. The sites where telomeric repeats were added to the chromosome ends for all nine telomere healing events identified in our sequence datasets are shown in Fig. 4B. As can be easily discerned, the sites where repair was initiated in all nine sequences displayed the ability to anneal to the telomerase template region at precisely the same position; thus, in all cases the first bases added to the new telomere were GGGTT. These data indicate that telomere healing in P. falciparum likely involves the same mechanism described for yeast, yet the sequence of the telomerase RNA template explains why the site of telomere repeat addition diverges from the GT, TG, or CG preference described for S. cerevisiae. Examination of the sequences upstream of the newly added telomere repeats did not identify any discernible motifs or compositional bias, indicating either that proximal enhancers like those identified in yeast do not exist in P. falciparum or that recruitment of the telomerase complex is not influenced by the primary sequence.
Choice of repair pathway: HR versus telomere healing.
In most extensively studied organisms, DSB repair in most regions of the genome results from competition between the two primary repair pathways, HR and NHEJ. Within subtelomeric regions of the genome, telomere healing can serve as a third potential repair pathway. The pathway that is ultimately chosen depends on multiple factors and is often species specific (20, 21). In yeast, the likelihood of telomere healing occurring at a specific break appears to depend on how efficiently the telomerase complex is recruited to the site of the break, which in turn depends on the DNA sequence immediately upstream of where the repair event occurs (20). Telomerase can then initiate the addition of telomeric repeats when 3 to 6 bp of DNA anneals to the telomerase RNA template (Fig. 4) (25). Telomerase has been shown to be recruited by either Cdc13 or the Ku70/80 complex (26, 36–38), and a proximal Cdc13 binding sequence was shown to greatly increase the likelihood of a healing event at a particular chromosomal position (30). Malaria parasites lack the above-mentioned DNA repair proteins; thus, what determines whether HR or telomere healing occurs at the site of a subtelomeric break is not known.
Previous work has shown that efficient HR within a nonsubtelomeric region of the genome requires near-complete sequence identity between the sequence surrounding the break and the template used for repair (15). If this property also applies to HR within subtelomeric regions, the use of HR might be limited only to breaks that occur within stretches of sequence that are duplicated with near-complete sequence identity elsewhere in the genome. To investigate this possibility, we identified the sequences immediately surrounding the break points of the three products of HR that we obtained from our irradiated clone and performed BLAST searches to determine if these regions are duplicated elsewhere in the genome. Indeed, in all three cases there were regions of perfect sequence identity surrounding the site of the recombination event. These stretches of 100% identity extended for 296, 127, and 27 bp within the sequences of the recombining chromosomes (Fig. 5). In contrast, BLAST searches using stretches of sequence at the nine sites of telomere healing indicated that these regions are unique within the genome, with no identifiable stretches of sequence identity at the site of telomere addition. These data are consistent with a model in which HR is the dominant pathway of DSB repair throughout the genome, including within the subtelomeric regions. However, the strict requirement for extensive sequence identity prevents this pathway from repairing breaks that occur within unique sequences, thus allowing telomere healing to occur when sequences similar to the telomerase template RNA are exposed during resection of the DNA strand.
DISCUSSION
Telomere healing appears to be a conserved mechanism for stabilizing chromosomes in the event of a DSB that occurs within a subtelomeric domain. The basic machinery involved in telomere healing is likely shared between malaria parasites and model eukaryotes. In yeast, either the Exo1 or DNA2/Sgs1 pathways mediate DNA resection from the site of the break. Orthologues to both of these enzymes appear to be encoded within the P. falciparum genome (PF3D7_0725000 and PF3D7_1010200, respectively). In addition, telomerase reverse transcriptase (PfTERT) has also been identified and displays the expected motifs, though with many insertions of stretches of basic amino acid sequences, as is often seen in Plasmodium proteins (39). However, our work revealed some unique aspects to telomere healing in P. falciparum. The strong preference for stretches of TG repeats at the site of telomere addition that is observed in yeast was not seen in P. falciparum, nor was any proximal enhancer sequence detected. The different sequence preference likely results from differences in the template region of telomerase RNA, which in yeast consists exclusively of AC base pairs, while in P. falciparum this sequence also includes a GU dinucleotide. The apparent lack of any preference in the sequence immediately flanking the site of repair might indicate more significant evolutionary divergence in how telomerase is recruited. Organism-specific proteins at telomeres have been characterized in model organisms and in African trypanosomes, and they indicate there can be significant evolutionary divergence (40). Proteins known to play important roles in telomerase function and telomere stabilization in other organisms, such as TRF, Cdc13, Rad 52, and POT1, could not be identified in the P. falciparum genome using standard bioinformatics approaches, and thus the recruitment, retention, and function of telomerase in P. falciparum is likely to have unique elements, such as the recently characterized protein PfTRZ. This protein was found to be a functional homologue to the transcription factor TFIIIA, yet it is associated with parasite telomeres and has a role in telomere maintenance (41).
The subtelomeric domains of the chromosomes of P. falciparum are of significant research interest due to the large multicopy gene families that reside within these regions. The unique structure of the chromatin found here has been shown to play a role in regulating clonally variant expression, thereby facilitating the process of antigenic variation and immune system avoidance (42). The positions of these large, semiredundant gene families within the subtelomeric domains of most or all of the chromosomes mean that these genomic regions share significant blocks of sequence identity. In addition, the clustering of these regions at the nuclear periphery makes them prime substrates for HR in the event of a DSB (11). This unique genomic organization, combined with the absence of canonical NHEJ, provides a simple mechanism that drives the generation of chimeric genes and thus the vast diversity of these gene families that is observed in the field. Indeed, a chimeric var gene was readily generated over the course of the experiments described here.
While repair of DSBs within subtelomeric domains by HR is the likely source of diversity within the multicopy gene families, HR appears to not be efficient when a DSB occurs within a sequence that diverges more than ~2% in identity from any possible template for repair (15). Indeed, the three examples of HR identified in this study all displayed long stretches of complete sequence identity between the two chromosomal regions involved in the recombination events (Fig. 5). This finding has been confirmed by other studies that observed similar, albeit somewhat shorter, regions of sequence identity at sites of recombination within var genes (43–45). Given the extensive sequence diversity within var, rifin, stevor, and Pfmc-2TM genes, the chance that a randomly occurring DSB will occur precisely at a position with sufficient sequence identity to another position in the genome to serve as a template for HR is low. In the absence of efficient NHEJ, such DSBs would generally be lethal. However, telomere healing provides an alternative pathway for stabilizing DSBs that occur within subtelomeric domains, thus enabling parasites to survive DNA damage within these regions and maintain genome integrity. Chromosomes that have undergone telomere healing could later undergo HR when a subsequent DSB occurs within a sequence that shares identity with a region of a full-length subtelomeric domain. The resulting gene conversion event would reestablish typical chromosomal structure, including a full complement of clonally variant gene copies as well as TAREs (Fig. 6). When such events occur within the coding regions of the variant antigen-encoding genes, new genes are created, as demonstrated by the HR event that occurred within the subtelomeric region of chromosome 13 described here (Fig. 3B) and as has been observed in other studies (43–45). Of note, these recombination events retain reading frame and general gene structure. Evidence for telomere loss and potential healing events in field isolates indicates that this mechanism of chromosome stabilization occurs in naturally circulating parasites (46–48). This provides a model for mitotic diversification of these important gene families using both telomere healing to stabilize chromosome ends and HR when breaks occur within areas of sequence identity, thereby creating new chimeric genes and restoring complete subtelomeric regions. Telomere healing therefore provides a complementary method to HR for preserving the structure of chromosome ends. Together, these two pathways of DSB repair function to maintain genome integrity and chromosome stability in the absence of robust NHEJ and also drive the generation of diversity within the clonally variant multicopy gene families of P. falciparum.
MATERIALS AND METHODS
Parasite culture.
P. falciparum parasites were cultivated at a 5% hematocrit in RPMI 1640 medium (Corning Life Sciences, Tewksbury, MA), 0.5% AlbuMAX II (Invitrogen), 0.25% sodium bicarbonate, and 0.1 mg/ml gentamicin. Cultures were maintained at 37°C in an atmosphere of 5% oxygen, 5% carbon dioxide, and 90% nitrogen. Individual clonal lines were obtained by limiting dilution (49).
Parasite irradiation.
Irradiation was administered using a Rad Source 2000 irradiator set at 160 kV/25 mA. To generate random DSBs, 3D7 parasites seeded at 0.5% were exposed to 100-Gy X-ray irradiation three times consecutively, being allowed to recover to normal growth between each irradiation exposure. The degree of subtelomeric damage was preliminarily assessed after each round of irradiation by assaying for var gene deletion by quantitative PCR using genomic DNA (gDNA) as the template and the var-specific PCR primer set described previously by Salanti and colleagues (50). The subclone chosen for whole-genome sequencing displayed loss of three subtelomeric var clusters in this assay.
Genomic DNA isolation.
One hundred milliliters of cultured parasites at 5 to 8% parasitemia was harvested for isolation of gDNA. DNA was isolated and purified using phenol-chloroform extraction followed by ethanol precipitation, as previously described (51).
SMRTbell library preparation.
Sequencing libraries were produced using the PacBio 20-kb library preparation protocol for high-molecular-weight gDNA obtained from clonal parasite lines, as previously described (28). We used the SMRTbell template prep kit 1.0 (Pacific Biosciences) following the standard 20-kb template preparation using the BluePippin size Selection system protocol (Pacific Biosciences). Briefly, parasite DNA was sheared twice for 1 min at 5,300 rpm in an Eppendorf 5424 centrifuge using a g-Tube (Covaris) followed by damage repair, end repair, and ligation of SMARTbell adapters. Unligated DNA was digested with exonucleases, and the libraries were size selected using a BluePippin pulsed-field gel electrophoresis instrument (Sage Science) to isolate fragments greater than 15 kb. Library concentration was measured with the Qubit fluorometer dsDNA BR assay kit (Life Technologies, Inc.), and fragment length distributions were generated using the 2200 TapeStation (Agilent). Sequencing primer and P6 polymerase were annealed to the libraries according to the manufacturer’s protocols (Pacific Biosciences) and performed with P6-C5 chemistry and v3 SMRT cells on an RSII instrument at Weill Cornell Medicine.
Genome sequencing and analysis.
Pacific Biosciences RSII-based single-molecule sequencing was used to prepare long-read datasets suitable for accurate assembly of the subtelomeric regions of the parasite’s genome. Six RSII SMRTCells were used for the nonirradiated clone library, while the irradiated clone library was sequenced with eight SMRTCells. Filtered sequence data were assembled using HGAP 2.0 with Quiver polishing. This Celera Assembler-based de novo assembly approach produced genome sizes of 23.2 Mb and 23.3 Mb for the nonirradiated and irradiated clones, respectively. The largest contig was 3.2 Mb, and assembled contig N50 values were 1.5 Mb for both lines. The nonirradiated line generated 29 polished contigs, while the irradiated line produced 24 polished contigs. The average coverage for each assembly was 130× and 100× for the nonirradiated and irradiated lines, respectively, and the average consensus concordance for both assemblies was greater than 0.9975; therefore, only 2.5 bases in 1,000 would be assigned incorrectly.
In order to identify recombination break points, telomere additions, or telomere deletions, each assembly was first aligned to a 3D7 genomic sequence from PlasmoDB using Mauve (28). From the Mauve alignment, chromosomes were assigned to each polished contig in both assemblies. Thirty kilobases from the right and left ends of each chromosome from the irradiated and nonirradiated polished assemblies were then aligned to the PlasmoDB reference genome by using BLAST (52). Recombination break points and telomere additions were determined via manual inspection of the tabulated BLAST output. Each break point was also confirmed using the NCBI Blastn online tool (https://blast.ncbi.nlm.nih.gov/Blast.cgi).
To validate structural variants (SV) seen in comparisons between the PlasmoDB 3D7 reference and our assemblies, reads from the datasets used for assemblies of irradiated and nonirradiated clones were aligned to the Plasmo DB reference using BWA-MEM (version 0.7.13-r1126; arXiv:1303.3997) with default parameters. The resulting alignments were processed with SAMtools (version 2.6.32-279.el6.x86_64) (53) and visualized with IGV (version 2.3.92) for manual inspection of loci with predicted SVs. Analyses of the sequences upstream of the loci at which telomere repeats were added were performed using the MEME software suite and did not identify discernible motifs shared between any of the sequences.
ACKNOWLEDGMENTS
The Department of Microbiology and Immunology at Weill Medical College of Cornell University acknowledges the support of the William Randolph Hearst Foundation. This work was supported by the National Institutes of Health (AI 52390 to K.W.D., AI 99327 to K.W.D. and L.A.K., and AI76635 to L.A.K.). K.W.D. is a Stavros S. Niarchos Scholar and a recipient of a William Randolph Hearst Endowed Faculty Fellowship. S.F.C. is supported by a diversity supplement from the National Institutes of Health. L.A.K. is a William Randolph Hearst Foundation Clinical Scholar in Microbiology and Infectious Diseases. Funding for the open access charge was provided by the National Institutes of Health.
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Footnotes
Citation Calhoun SF, Reed J, Alexander N, Mason CE, Deitsch KW, Kirkman LA. 2017. Chromosome end repair and genome stability in Plasmodium falciparum. mBio 8:e00547-17. https://doi.org/10.1128/mBio.00547-17.
REFERENCES
- 1.Cousins S. 2016. Targets to reduce malaria unlikely to be met, says WHO. BMJ 355:i6668. doi: 10.1136/bmj.i6668. [DOI] [PubMed] [Google Scholar]
- 2.Miller LH, Baruch DI, Marsh K, Doumbo OK. 2002. The pathogenic basis of malaria. Nature 415:673–679. doi: 10.1038/415673a. [DOI] [PubMed] [Google Scholar]
- 3.Golenser J, Kamyl M, Tsafack A, Marva E, Cohen A, Kitrossky N, Chevion M. 1992. Correlation between destruction of malarial parasites by polymorphonuclear leucocytes and oxidative stress. Free Radic Res Commun 17:249–262. doi: 10.3109/10715769209079517. [DOI] [PubMed] [Google Scholar]
- 4.Schnappinger D, Ehrt S, Voskuil MI, Liu Y, Mangan JA, Monahan IM, Dolganov G, Efron B, Butcher PD, Nathan C, Schoolnik GK. 2003. Transcriptional adaptation of Mycobacterium tuberculosis within macrophages: insights into the phagosomal environment. J Exp Med 198:693–704. doi: 10.1084/jem.20030846. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gopalakrishnan AM, Kumar N. 2015. Antimalarial action of artesunate involves DNA damage mediated by reactive oxygen species. Antimicrob Agents Chemother 59:317–325. doi: 10.1128/AAC.03663-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Gardner MJ, Hall N, Fung E, White O, Berriman M, Hyman RW, Carlton JM, Pain A, Nelson KE, Bowman S, Paulsen IT, James K, Eisen JA, Rutherford K, Salzberg SL, Craig A, Kyes S, Chan MS, Nene V, Shallom SJ, Suh B, Peterson J, Angiuoli S, Pertea M, Allen J, Selengut J, Haft D, Mather MW, Vaidya AB, Martin DM, Fairlamb AH, Fraunholz MJ, Roos DS, Ralph SA, McFadden GI, Cummings LM, Subramanian GM, Mungall C, Venter JC, Carucci DJ, Hoffman SL, Newbold C, Davis RW, Fraser CM, Barrell B. 2002. Genome sequence of the human malaria parasite Plasmodium falciparum. Nature 419:498–511. doi: 10.1038/nature01097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Figueiredo LM, Pirrit LA, Scherf A, Pirritt LA. 2000. Genomic organisation and chromatin structure of Plasmodium falciparum chromosome ends. Mol Biochem Parasitol 106:169–174. doi: 10.1016/S0166-6851(99)00199-1. [DOI] [PubMed] [Google Scholar]
- 8.Chookajorn T, Dzikowski R, Frank M, Li F, Jiwani AZ, Hartl DL, Deitsch KW. 2007. Epigenetic memory at malaria virulence genes. Proc Natl Acad Sci U S A 104:899–902. doi: 10.1073/pnas.0609084103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lopez-Rubio JJ, Gontijo AM, Nunes MC, Issar N, Hernandez Rivas RR, Scherf A. 2007. 5′ Flanking region of var genes nucleate histone modification patterns linked to phenotypic inheritance of virulence traits in malaria parasites. Mol Microbiol 66:1296–1305. doi: 10.1111/j.1365-2958.2007.06009.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Jiang L, Mu J, Zhang Q, Ni T, Srinivasan P, Rayavara K, Yang W, Turner L, Lavstsen T, Theander TG, Peng W, Wei G, Jing Q, Wakabayashi Y, Bansal A, Luo Y, Ribeiro JM, Scherf A, Aravind L, Zhu J, Zhao K, Miller LH. 2013. PfSETvs methylation of histone H3K36 represses virulence genes in Plasmodium falciparum. Nature 499:223–227. doi: 10.1038/nature12361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Freitas-Junior LH, Bottius E, Pirrit LA, Deitsch KW, Scheidig C, Guinet F, Nehrbass U, Wellems TE, Scherf A. 2000. Frequent ectopic recombination of virulence factor genes in telomeric chromosome clusters of P. falciparum. Nature 407:1018–1022. doi: 10.1038/35039531. [DOI] [PubMed] [Google Scholar]
- 12.Bhattacharyya MK, Norris DE, Kumar N. 2004. Molecular players of homologous recombination in protozoan parasites: implications for generating antigenic variation. Infect Genet Evol 4:91–98. doi: 10.1016/j.meegid.2004.01.008. [DOI] [PubMed] [Google Scholar]
- 13.Mladenov E, Iliakis G. 2011. Induction and repair of DNA double strand breaks: the increasing spectrum of non-homologous end joining pathways. Mutat Res 711:61–72. doi: 10.1016/j.mrfmmm.2011.02.005. [DOI] [PubMed] [Google Scholar]
- 14.LaRocque JR, Jasin M. 2010. Mechanisms of recombination between diverged sequences in wild-type and BLM-deficient mouse and human cells. Mol Cell Biol 30:1887–1897. doi: 10.1128/MCB.01553-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kirkman LA, Lawrence EA, Deitsch KW. 2014. Malaria parasites utilize both homologous recombination and alternative end joining pathways to maintain genome integrity. Nucleic Acids Res 42:370–379. doi: 10.1093/nar/gkt881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Singer M, Marshall J, Heiss K, Mair GR, Grimm D, Mueller AK, Frischknecht F. 2015. Zinc finger nuclease-based double-strand breaks attenuate malaria parasites and reveal rare microhomology-mediated end joining. Genome Biol 16:249. doi: 10.1186/s13059-015-0811-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Bhattacharyya MK, Kumar N. 2003. Identification and molecular characterisation of DNA damaging agent induced expression of Plasmodium falciparum recombination protein PfRad51. Int J Parasitol 33:1385–1392. doi: 10.1016/S0020-7519(03)00212-1. [DOI] [PubMed] [Google Scholar]
- 18.Roy N, Bhattacharyya S, Chakrabarty S, Laskar S, Babu SM, Bhattacharyya MK. 2014. Dominant negative mutant of Plasmodium Rad51 causes reduced parasite burden in host by abrogating DNA double-strand break repair. Mol Microbiol 94:353–366. doi: 10.1111/mmi.12762. [DOI] [PubMed] [Google Scholar]
- 19.Badugu SB, Nabi SA, Vaidyam P, Laskar S, Bhattacharyya S, Bhattacharyya MK. 2015. Identification of Plasmodium falciparum DNA repair protein Mre11 with an evolutionarily conserved nuclease function. PLoS One 10:e0125358. doi: 10.1371/journal.pone.0125358. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Pennaneach V, Putnam CD, Kolodner RD. 2006. Chromosome healing by de novo telomere addition in Saccharomyces cerevisiae. Mol Microbiol 59:1357–1368. doi: 10.1111/j.1365-2958.2006.05026.x. [DOI] [PubMed] [Google Scholar]
- 21.Ribeyre C, Shore D. 2013. Regulation of telomere addition at DNA double-strand breaks. Chromosoma 122:159–173. doi: 10.1007/s00412-013-0404-2. [DOI] [PubMed] [Google Scholar]
- 22.Mattei D, Scherf A. 1994. Subtelomeric chromosome instability in Plasmodium falciparum: short telomere-like sequence motifs found frequently at healed chromosome breakpoints. Mutat Res 324:115–120. doi: 10.1016/0165-7992(94)90055-8. [DOI] [PubMed] [Google Scholar]
- 23.Bottius E, Bakhsis N, Scherf A. 1998. Plasmodium falciparum telomerase: de novo telomere addition to telomeric and nontelomeric sequences and role in chromosome healing. Mol Cell Biol 18:919–925. doi: 10.1128/MCB.18.2.919. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Scherf A, Mattei D. 1992. Cloning and characterization of chromosome breakpoints of Plasmodium falciparum: breakage and new telomere formation occurs frequently and randomly in subtelomeric genes. Nucleic Acids Res 20:1491–1496. doi: 10.1093/nar/20.7.1491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Putnam CD, Pennaneach V, Kolodner RD. 2004. Chromosome healing through terminal deletions generated by de novo telomere additions in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A 101:13262–13267. doi: 10.1073/pnas.0405443101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Stellwagen AE, Haimberger ZW, Veatch JR, Gottschling DE. 2003. Ku interacts with telomerase RNA to promote telomere addition at native and broken chromosome ends. Genes Dev 17:2384–2395. doi: 10.1101/gad.1125903. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Eid J, Fehr A, Gray J, Luong K, Lyle J, Otto G, Peluso P, Rank D, Baybayan P, Bettman B, Bibillo A, Bjornson K, Chaudhuri B, Christians F, Cicero R, Clark S, Dalal R, Dewinter A, Dixon J, Foquet M, Gaertner A, Hardenbol P, Heiner C, Hester K, Holden D, Kearns G, Kong X, Kuse R, Lacroix Y, Lin S, Lundquist P, Ma C, Marks P, Maxham M, Murphy D, Park I, Pham T, Phillips M, Roy J, Sebra R, Shen G, Sorenson J, Tomaney A, Travers K, Trulson M, Vieceli J, Wegener J, Wu D, Yang A, Zaccarin D, Zhao P, Zhong F, Korlach J, Turner S. 2009. Real-time DNA sequencing from single polymerase molecules. Science 323:133–138. doi: 10.1126/science.1162986. [DOI] [PubMed] [Google Scholar]
- 28.Vembar SS, Seetin M, Lambert C, Nattestad M, Schatz MC, Baybayan P, Scherf A, Smith ML. 2016. Complete telomere-to-telomere de novo assembly of the Plasmodium falciparum genome through long-read (>11 kb), single molecule, real-time sequencing. DNA Res 23:339–351. doi: 10.1093/dnares/dsw022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bonetti D, Martina M, Clerici M, Lucchini G, Longhese MP. 2009. Multiple pathways regulate 3′ overhang generation at S. cerevisiae telomeres. Mol Cell 35:70–81. doi: 10.1016/j.molcel.2009.05.015. [DOI] [PubMed] [Google Scholar]
- 30.Obodo UC, Epum EA, Platts MH, Seloff J, Dahlson NA, Velkovsky SM, Paul SR, Friedman KL. 2016. Endogenous hot Spots of de novo telomere addition in the yeast genome contain proximal enhancers that bind Cdc13. Mol Cell Biol 36:1750–1763. doi: 10.1128/MCB.00095-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hirano Y, Sugimoto K. 2007. Cdc13 telomere capping decreases Mec1 association but does not affect Tel1 association with DNA ends. Mol Biol Cell 18:2026–2036. doi: 10.1091/mbc.E06-12-1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hirano Y, Fukunaga K, Sugimoto K. 2009. Rif1 and rif2 inhibit localization of tel1 to DNA ends. Mol Cell 33:312–322. doi: 10.1016/j.molcel.2008.12.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zhang W, Durocher D. 2010. De novo telomere formation is suppressed by the Mec1-dependent inhibition of Cdc13 accumulation at DNA breaks. Genes Dev 24:502–515. doi: 10.1101/gad.1869110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Negrini S, Ribaud V, Bianchi A, Shore D. 2007. DNA breaks are masked by multiple Rap1 binding in yeast: implications for telomere capping and telomerase regulation. Genes Dev 21:292–302. doi: 10.1101/gad.400907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Chakrabarti K, Pearson M, Grate L, Sterne-Weiler T, Deans J, Donohue JP, Ares M Jr. 2007. Structural RNAs of known and unknown function identified in malaria parasites by comparative genomics and RNA analysis. RNA 13:1923–1939. doi: 10.1261/rna.751807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Pennock E, Buckley K, Lundblad V. 2001. Cdc13 delivers separate complexes to the telomere for end protection and replication. Cell 104:387–396. doi: 10.1016/S0092-8674(01)00226-4. [DOI] [PubMed] [Google Scholar]
- 37.Fisher TS, Taggart AK, Zakian VA. 2004. Cell cycle-dependent regulation of yeast telomerase by Ku. Nat Struct Mol Biol 11:1198–1205. doi: 10.1038/nsmb854. [DOI] [PubMed] [Google Scholar]
- 38.DuBois ML, Haimberger ZW, McIntosh MW, Gottschling DE. 2002. A quantitative assay for telomere protection in Saccharomyces cerevisiae. Genetics 161:995–1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Figueiredo L, Scherf A. 2005. Plasmodium telomeres and telomerase: the usual actors in an unusual scenario. Chromosome Res 13:517–524. doi: 10.1007/s10577-005-0996-3. [DOI] [PubMed] [Google Scholar]
- 40.Lira CB, Giardini MA, Neto JL, Conte FF, Cano MI. 2007. Telomere biology of trypanosomatids: beginning to answer some questions. Trends Parasitol 23:357–362. doi: 10.1016/j.pt.2007.06.005. [DOI] [PubMed] [Google Scholar]
- 41.Bertschi NL, Toenhake CG, Zou A, Niederwieser I, Henderson R, Moes S, Jenoe P, Parkinson J, Bartfai R, Voss TS. 2017. Malaria parasites possess a telomere repeat-binding protein that shares ancestry with transcription factor IIIA. Nat Microbiol 2:17033. doi: 10.1038/nmicrobiol.2017.33. [DOI] [PubMed] [Google Scholar]
- 42.Cortés A, Deitsch KW. 2017. Malaria epigenetics. Cold Spring Harb Perspect Med 7:a025528. doi: 10.1101/cshperspect.a025528. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Sander AF, Lavstsen T, Rask TS, Lisby M, Salanti A, Fordyce SL, Jespersen JS, Carter R, Deitsch KW, Theander TG, Pedersen AG, Arnot DE. 2014. DNA secondary structures are associated with recombination in major Plasmodium falciparum variable surface antigen gene families. Nucleic Acids Res 42:2270–2281. doi: 10.1093/nar/gkt1174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Claessens A, Hamilton WL, Kekre M, Otto TD, Faizullabhoy A, Rayner JC, Kwiatkowski D. 2014. Generation of antigenic diversity in Plasmodium falciparum by structured rearrangement of var genes during mitosis. PLoS Genet 10:e1004812. doi: 10.1371/journal.pgen.1004812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Bopp SE, Manary MJ, Bright AT, Johnston GL, Dharia NV, Luna FL, McCormack S, Plouffe D, McNamara CW, Walker JR, Fidock DA, Denchi EL, Winzeler EA. 2013. Mitotic evolution of Plasmodium falciparum shows a stable core genome but recombination in antigen families. PLoS Genet 9:e1003293. doi: 10.1371/journal.pgen.1003293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Dharia NV, Plouffe D, Bopp SE, González-Páez GE, Lucas C, Salas C, Soberon V, Bursulaya B, Kochel TJ, Bacon DJ, Winzeler EA. 2010. Genome scanning of Amazonian Plasmodium falciparum shows subtelomeric instability and clindamycin-resistant parasites. Genome Res 20:1534–1544. doi: 10.1101/gr.105163.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Murillo SC, Akinyi OS, Abdallah JF, Pava Z, Dorado E, Incardona S, Huber CS, Macedo de Oliveira A, Bell D, Udhayakumar V, Barnwell JW. 2015. Deletion of Plasmodium falciparum histidine-rich protein 2 (pfhrp2) and histidine-rich protein 3 (pfhrp3) genes in Colombian parasites. PLoS One 10:e0131576. doi: 10.1371/journal.pone.0131576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Biggs BA, Kemp DJ, Brown GV. 1989. Subtelomeric chromosome deletions in field isolates of Plasmodium falciparum and their relationship to loss of cytoadherence in vitro. Proc Natl Acad Sci U S A 86:2428–2432. doi: 10.1073/pnas.86.7.2428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Kirkman LA, Su XZ, Wellems TE. 1996. Plasmodium falciparum: isolation of large numbers of parasite clones from infected blood samples. Exp Parasitol 83:147–149. doi: 10.1006/expr.1996.0058. [DOI] [PubMed] [Google Scholar]
- 50.Salanti A, Staalsoe T, Lavstsen T, Jensen ATR, Sowa MPK, Arnot DE, Hviid L, Theander TG. 2003. Selective upregulation of a single distinctly structured var gene in chondroitin sulphate A-adhering Plasmodium falciparum involved in pregnancy-associated malaria. Mol Microbiol 49:179–191. doi: 10.1046/j.1365-2958.2003.03570.x. [DOI] [PubMed] [Google Scholar]
- 51.Swamy L, Amulic B, Deitsch KW. 2011. Plasmodium falciparum var. gene silencing is determined by cis DNA elements that form stable and heritable interactions. Eukaryot Cell 10:530–539. doi: 10.1128/EC.00329-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 53.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup . 2009. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.