Skip to main content
eLife logoLink to eLife
. 2021 Apr 13;10:e58791. doi: 10.7554/eLife.58791

Converting endogenous genes of the malaria mosquito into simple non-autonomous gene drives for population replacement

Astrid Hoermann 1, Sofia Tapanelli 1, Paolo Capriotti 1, Giuseppe Del Corsano 1, Ellen KG Masters 1, Tibebu Habtewold 1, George K Christophides 1, Nikolai Windbichler 1,
Editors: Philipp W Messer2, Diethard Tautz3
PMCID: PMC8043746  PMID: 33845943

Abstract

Gene drives for mosquito population replacement are promising tools for malaria control. However, there is currently no clear pathway for safely testing such tools in endemic countries. The lack of well-characterized promoters for infection-relevant tissues and regulatory hurdles are further obstacles for their design and use. Here we explore how minimal genetic modifications of endogenous mosquito genes can convert them directly into non-autonomous gene drives without disrupting their expression. We co-opted the native regulatory sequences of three midgut-specific loci of the malaria vector Anopheles gambiae to host a prototypical antimalarial molecule and guide-RNAs encoded within artificial introns that support efficient gene drive. We assess the propensity of these modifications to interfere with the development of Plasmodium falciparum and their effect on fitness. Because of their inherent simplicity and passive mode of drive such traits could form part of an acceptable testing pathway of gene drives for malaria eradication.

Research organism: Other

Introduction

After more than a decade of sustained success in the fight against malaria, data from 2015 onwards suggests that no significant progress in reducing global malaria cases has been achieved (World malaria report, 2019). The rise of insecticide resistance in mosquito vectors and drug resistance in parasites highlights the urgent need to develop new tools if malaria eradication is to remain a viable goal.

Synthetic gene drives spreading by super-Mendelian inheritance within vector populations have been proposed as an area-wide genetic strategy for the control of malaria (Raban et al., 2020). They mimic the mechanism of proliferation of a class of naturally occurring selfish genes found in protists, mitochondria, and chloroplasts called homing endonucleases and could be deployed to modify the genetic makeup of disease vector populations (Burt, 2003; Windbichler et al., 2011). Proof-of-principle laboratory experiments of CRISPR/Cas9-based gene drives for population suppression, which aims at the elimination of the target population (Kyrou et al., 2018; Hammond et al., 2016), as well as population replacement (Gantz et al., 2015; Pham et al., 2019; Adolfi et al., 2020), which aims to spread an anti-malarial trait within a vector population, suggest that these strategies could be deployed to reduce malaria transmission in the field.

The success of population replacement in particular hinges on the availability of molecules that can efficiently block Plasmodium development within the mosquito vector, as well as ways to express such elements. Few pre-characterized promoter elements driving tissue-specific expression in infection-relevant tissues currently exist in malaria vectors. Examples include the regulatory elements of the zinc carboxypeptidase A1 (CP), peritrophin1 (Aper1), or the vitellogenin (Vg) genes that have been reported to drive transgene expression (Nolan et al., 2011; Ito et al., 2002; Abraham et al., 2005; Nirmala et al., 2006; Volohonsky et al., 2015). A variety of endogenous and exogenous effector molecules have been expressed in transgenic mosquitoes to interfere with the development of the parasite including anti-malarial regulators such as Cecropin A (Kim et al., 2004), fibrinogen domain-containing immunolectin 9 (FBN9) (Simões et al., 2017), TEP1 (Volohonsky et al., 2017), the transcription factor Rel2 (Dong et al., 2011), as well as the protein kinase Akt (Corby-Harris et al., 2010; Arik et al., 2015) and the phosphatase and tensin homolog (PTEN) (Hauck et al., 2013) involved in insulin signalling. Synthetic peptides Vida3 (Ito et al., 2002) and SM1, engineered as a quadruplet (Meredith et al., 2011), the bee venom phospholipases A2 (PLA2) (Abraham et al., 2005; Moreira et al., 2002), as well as single-chain antibodies (scFvs, m1C3, m4B7, and m2A10) targeting the ookinete protein chitinase 1 and the circumsporozoite protein (CSP) (Isaacs et al., 2011; Isaacs et al., 2012 ) have been used. Recently, microRNA sponges have been suggested as modulators of mosquito immunity (Dong et al., 2020a). The performance of these effector molecules has been evaluated using laboratory strains of the human parasite Plasmodium falciparum or the rodent parasite Plasmodium berghei, but the efficacy against circulating polymorphic P. falciparum strains of human malaria parasites is currently unknown.

A further consideration is the population dynamics and persistence of transmission-blocking gene drives (Beaghton et al., 2017; Noble et al., 2017), which would suggest a trade-off between the targeting of a conserved sequence with a resulting fitness cost versus the targeting of a neutral genomic site that shows poor conservation and engenders the evolution of resistance to the drive. A number of ways have been suggested to work around this, for example, the linking of rescue-copies of essential genes to drive elements (Adolfi et al., 2020; Esvelt et al., 2014; Champer et al., 2020a). Cleave and rescue elements are a related approach that does not rely on homing (Oberhofer et al., 2019; Oberhofer et al., 2020; Champer et al., 2020b).

We have recently modelled an alternative approach to achieve population replacement that was specifically designed to address these questions (Nash et al., 2019). The approach, we have termed integral gene drive (IGD), imitates the way naturally occurring homing endonuclease genes propagate, that is, by copying themselves into highly conserved sites within genes. They do not disrupt their target genes due to their association with introns or inteins (Hafez and Hausner, 2012; Barzel et al., 2011). We analysed the theoretical behaviour of IGDs featuring the complete molecular separation of drive and effector functions into minimal modifications of endogenous host genes at two or more different genomic loci (Nash et al., 2019). This approach takes advantage of the promoter and also of its surrounding regulatory regions of the modified endogenous locus. To accommodate the guide-module required for subsequent gene drive and a marker-module required for monitoring transgenesis, we suggested to insert an intron into the effector gene.

In the present study we sought to scope the feasibility of this approach in the African malaria mosquito A. gambiae. Our aim was to generate minimal genetic modifications of mosquito genes in situ, which would turn particular alleles into non-autonomous gene drives whilst also expressing a putative antimalarial effector molecule. We sought to assess the efficacy of gene drive, the ability to express anti-Plasmodium molecules by linkage to genes active in infection-relevant tissues, and how these modifications would affect the expression of such host genes, as well as overall mosquito fitness. In order to establish this experimental proof-of-principle for IGD, we chose Scorpine as a prototypical effector molecule. This known anti-microbial peptide (AMP) had previously been shown to have a transmission blocking effect on P. falciparum in the context of paratransgenesis with several different microorganisms, and recently in transgenic mosquitoes in combination with other effectors (Fang et al., 2011; Bongio and Lampe, 2015; Wang et al., 2012; Shane et al., 2018; Dong et al., 2020b). For the same reason we focused here on female mid-gut specific genes since they allow targeting the Plasmodium falciparum parasite during the ookinete to oocyst transition stage (Sinden, 2004).

Results

Direct modification of three A. gambiae midgut loci

We chose CP (AGAP009593) and Aper1 (AGAP006795) for experimental validation of the IGD strategy because their promoters had been used previously for conventional transgene overexpression (Ito et al., 2002; Abraham et al., 2005). CP is expressed in the guts of pupae and sugar-fed adults and becomes 10-fold upregulated by blood-feeding as well as by feeding on protein-free meals, probably triggered by the gut distention (Edwards et al., 1997). Its mRNA levels peak 3 hr post blood-feeding (pbf), with spatial expression limited to the posterior midgut, and return to normal levels 24 hr pbf. Aper1 is only detected in the adult gut, but not in larvae, pupae, or adult carcass, and its mRNA expression profile is independent of age and blood-feeding (Shen and Jacobs-Lorena, 1998). The protein is stored in secretory vesicles and released into the midgut lumen soon after blood meal, where it is cross-linked into the peritrophic matrix via its two tandem domains that bind chitin (Devenport et al., 2004). The peritrophic matrix retracts from the midgut wall 48 hr pbf and is excreted after 72 hr pbf (Dinglasan et al., 2009). Using the VectorBase expression explorer (Maccallum et al., 2011) we identified further genes with significant expression in the female midgut and upregulation after the blood meal. Amongst them we chose alkaline phosphatase 2 (AP2, AGAP006400), which has a secretion signal and a GPI-anchor and is detected in the detergent resistant membrane (DRM) proteome of non-blood-fed adult Anopheles gambiae midguts (Parish et al., 2011).

We next identified guide RNA (gRNA) target sites around the start and stop codons of these genes for the insertion of the construct sequences (Figure 1A). The gRNAs were chosen based on a compromise between proximity of the cut site to the start or stop codons, activity and off-target scores, as well as the lack of common target site polymorphisms in public population genetic data sets of the mosquito (Table S1). We generated transformation constructs consisting of homology arms for these three loci designed to facilitate integration of the Scorpine coding sequence within each gene. In the case of CP and AP2, the Scorpine coding sequence was inserted at the start codon and linked to the coding sequence of the host gene via the 2A autocleavage peptide (de Felipe and Ryan, 2004). In contrast, we anchored the effector to the peritrophic matrix via a C-terminal fusion to Aper1. In the AP2 strain, Scorpine is expressed as a GFP-fusion with the aim to enhance stability of the protein (Bokman and Ward, 1981; Janczak et al., 2015). In each case, the effector coding sequences harboured an artificial intron encoding the gRNA (Figure 1A). We have recently characterized, in S12 cells and transgenic Drosophila strains, a number of artificial introns optimized equally for splicing and gRNA expression using an intronic RNA polymerase III promoter (A. Nash, unpublished) and we applied these designs here. Each artificial intron harbours a fluorescent marker driven by the 3xP3 promoter, flanked by loxP-sites, and the corresponding guide RNA under the control of the ubiquitous and constitutive Anopheles gambiae U6 promoter (Figure 1A).

Figure 1. Homology-directed modification of Anopheles gambiae midgut loci by CRISPR/Cas9.

Figure 1.

(A) Schematic showing the insertion of the effector construct at the carboxypeptidase (CP), alkaline phosphatase 2 (AP2), and peritrophin 1 (Aper 1) loci. The donor plasmid supplies the effector coding sequence (yellow), which accommodates an artificial intron. The intron harbours a fluorescent marker (either GFP or CFP, green) under the control of the 3xP3 promoter flanked by loxP sites (black dots) and a U6 driven guide-RNA module (red), required for both transgenesis and subsequently for gene drive. The plasmid features regions of homology that drive N- or C-terminal insertion at the start (CP, AP2) or stop codon (Aper1) as well as a 3xP3::DsRed plasmid-backbone marker. The gRNA target sequence (red) including the PAM motif (bold) and the target strand are indicated. (B) Summary of embryo microinjections and the identification of transgenic individuals by fluorescent screening. (C) Adult transgenic mosquitos with fluorescent expression of GFP or CFP in the eyes under the control of the 3xP3 promoter as well as a pupa showing DsRed fluorescence, indicating plasmid-backbone integration (AP2’).

For transgenesis, we co-injected a helper plasmid carrying the Cas9 coding sequence under the control of the vasa promoter (Hammond et al., 2016) and the results of these experiments are summarized in Figure 1B and C. We established the three transgenic strains ScoGFP-CP, ScoGCFP-AP2, and Aper1-ScoGFP, in the latter case from a single G1 founder (Figure 1B). Integration of the vector backbone distinguished via the additional DsRed marker was only observed for the AP2 locus in several G1 individuals that were discarded (Figure 1C). Sequencing of all individuals within a G1 founder-cage for each strain confirmed that insertion into the genomic locus was precise at all three loci, and no aberrant integration events could be detected.

Establishing minimal genetic modifications

We predicted that efficient splicing of the artificial introns would occur only in the absence of the 3xP3-GFP or CFP fluorescent transformation marker genes, as they contain the bidirectional SV40 terminator. We thus expected that the integrations we had established would interfere with the function of the mosquito host genes. However, following sib-mating we found that for all three modified loci, homozygous individuals were viable and fertile and showed no striking fitness defects during rearing and maintenance. In order to establish minimal genetic modifications, that is, to remove the fluorescent transformation markers flanked by loxP sites, we next crossed all three transgenic strains to a vasa-Cre strain (Volohonsky et al., 2015). We hereby established the markerless strains Sco-CP, ScoG-AP2, and Aper1-Sco (Figure 2A), all of which were also found to be homozygous viable and fertile. Establishing and tracking markerless modifications in A. gambiae are challenging and have not been attempted previously. Specifically, we achieved to establish pure-breeding markerless strains either by genotyping of pupal cases of individuals lacking visible fluorescent markers in the case of Sco-CP (Figure 2B) or in the case of Aper1-Sco by crossing the transhemizygotes to a Cas9 expressing strain to induce homing and preferential inheritance of the markerless allele (Figure 2—figure supplement 1A). For generating ScoG-AP2, we used the homozygous ScoGCFP-AP2 as a dominant balancing marker for tracking inheritance of the unmarked allele (Figure 2—figure supplement 1B). PCR on genomic DNA (Figure 2C) and subsequent sequencing confirmed precise removal of the marker module and also suggested successful homing of the construct.

Figure 2. Generation of minimal genetic modifications.

(A) Schematic showing the inserted transgene constructs within the exon structure of the CP, AP2, and Aper1 loci prior and following the excision of the marker gene by Cre recombinase (top) and the observed changes in green or cyan fluorescence in L3 larvae (bottom). Half arrows indicate primers for the PCRs shown in panel C and white arrows indicate the eyes in the markerless individuals. 2A indicates the F2A self-cleaving peptide signal. (B) Crossing scheme used for the establishment of markerless strain Sco-CP by crossing to a Cre recombinase expressing strain. Non-fluorescent adults were allowed to hatch individually, and their pupal cases were used for genotyping. (C) PCR genotyping of genomic DNA of homozygous individuals of all strains with primer-pairs (shown in A) spanning the three loci. The entire locus could not be amplified in strain ScoGCFP-AP2 that contains both GFP and CFP, and hence separate 5’ and 3’ fragments were analysed by PCR.

Figure 2.

Figure 2—figure supplement 1. Crossing schemes used for the establishment of markerless strains.

Figure 2—figure supplement 1.

The strains Aper1-Sco (A) and ScoG-AP2 (B) were rendered homozygous by first crossing to a Cre recombinase and then a Cas9 expressing strain and by employing positive and negative selection via the fluorescent markers at each stage.

Expression analysis of integrated effector sequences and their effect on the host genes

For a comparative expression study, we extracted RNA from female midguts 3 hr after blood-feeding of the three strains ScoGFP-CP, ScoGCFP-AP2, and Aper1-ScoGFP and of the corresponding markerless strains Sco-CP, ScoG-AP2, and Aper1-Sco which carry minimal genetic modifications (Figure 3A). RT-PCRs over the splice junctions and subsequent sequencing confirmed that for all three markerless strains precise splicing occurs and that the removal of the artificial intron restores the open reading frame required for expression of both the effector protein and the host gene product (Figure 3B). However, we found evidence of full-length transcripts and successful splicing (possibly at reduced levels) even in the strains retaining the marker cassette which includes the bidirectional SV40 terminator (Figure 3—figure supplement 1A). This suggests that the modified host genes could be expressing the endogenous proteins to some degree even in these transgenic strains.

Figure 3. Splicing of the artificial intron and expression analysis.

(A) Schematic showing mRNA expression of the modified host genes of the three markerless strains assuming correct splicing of the artificial intron. Black arrows indicate RT-PCR primers used in B. (B) RT-PCR (left) and sequencing of the amplicons (right) indicate precise splicing of the three artificial introns. Midguts of homozygous strains Sco-CP, ScoG-AP2, and Aper1-Sco were dissected 3 hr after blood feeding and RNA was extracted for RT-PCR. (C) Relative mRNA expression of Scorpine in the transgenic strains with and without the marker cassette. (D) Expression of the host genes CP, AP2, and Aper1 in transgenic strains with and without the marker cassette relative to the wild type. RNA was extracted from 10 to 15 midguts 3 hr after the blood meal in the case of CP and Aper1, and non-blood-fed for AP2. qPCR with the primer pairs indicated as coloured half arrows in the schematic below was conducted on cDNA and expression was normalized to the S7 house-keeping reference gene. Data derive from three biological replicates with three technical replicates each. p-values were calculated on ΔCt values using the unpaired Student’s t-test. *p≤0.05, **p≤0.01, ***p≤0.001, ****p≤0.0001, and ns: not significant.

Figure 3.

Figure 3—figure supplement 1. Scorpine expression levels and mass spectrometry of co-opted host genes.

Figure 3—figure supplement 1.

(A) RT-PCR with primers spanning the splice site (indicated as half arrows in the schematic below) on 3 hr blood-fed guts. (B) qPCR for the integrated Scorpine sequence was performed on cDNA from both non-blood-fed guts and guts dissected 3 hr after blood-feed of all three markerless transgenic strains. The primer-pair binding to Scorpine is indicated as yellow half arrows in the inset. Error bars indicate standard deviation from three technical replicates normalized to the S7 reference gene. G3 wild-type controls (blood-fed and non-blood-fed) run in parallel were negative (Ct mean values above 35). (C) Peptides identified in mass-spectrometry that map to the endogenous host genes. Proteins were extracted from non-blood-fed guts of the homozygous transgenic lines Sco-CP and Aper1-Sco. Note that Sco-CP (but not Aper1-Sco) is expected to generate two protein products due to the presence of the 2A peptide. Green rectangles indicate high confidence peptides with amino acid sequences shown below.

We next analysed mRNA expression by qPCR with a primer pair targeting exon 1 of Scorpine. In this preliminary experiment we compared non-blood-fed guts with guts dissected 3 hr after the blood meal for the three markerless lines (Figure 3—figure supplement 1B). In the case of ScoG-AP2, the qPCR signal of Scorpine was near the detection limit in blood-fed guts and low in non-blood-fed guts. For Sco-CP, an upregulation after the blood meal was observed in accordance with published data for the CP promoter (Edwards et al., 1997). Aper1-Sco showed the highest expression levels under both conditions. We thus continued our analysis with the 3 hr pbf time point for CP and Aper1 strains and with non-blood-fed guts for the AP2 strains. Using the Scorpine primers, no significant difference in the level of mRNA expression was found when comparing the ScoGFP-CP and Aper1-ScoGFP with their respective markerless lines Sco-CP and Aper1-Sco (Figure 3C). A significant increase in expression was observed for strain ScoG-AP2 following removal of the fluorescent marker-module although overall expression levels were substantially lower when compared to the other two strains (Figure 3C). We further evaluated mRNA expression of the modified and unmodified host genes using primer pairs targeting the corresponding coding sequences of CP, AP2, and Aper1 (Figure 3D). Our analysis showed that host gene expression in strains ScoGFP-CP and Sco-CP is not significantly different when compared to the wild type. The expression of AP2 and Aper1 is significantly reduced by the presence of the transgene including the marker cassette. Removal of the marker-module significantly restored host gene expression to approximately half of the level observed in the wild type in strain Aper1-Sco but remained below 10% for strain ScoG-AP2 (Figure 3D).

We also performed mass-spectrometry (LC-MS/MS) on non-blood-fed guts to confirm that protein expression of the co-opted host gene is unaffected by effector integration or splicing of the artificial intron. We detected high-confidence peptides for both genes in the Sco-CP and Aper1-Sco strains (Figure 3—figure supplement 1B). For Aper1, the peptides map to two out of the three regions from published data, and for Sco-CP 4 out of the 14 published peptides were identified (Chaerkady et al., 2011).

Mosquito fitness analysis

We assessed the effects of these modifications on fecundity and larval hatching of all six transgenic strains and compared them to individuals of the wild-type (G3) colony and additionally to the vasa-Cre strain (Volohonsky et al., 2015), which derives from the Anopheles gambiae KIL background (Meredith et al., 2011) and was used to generate the markerless strains (Figure 4). While females of the strains including the marker-module showed no statistically significant difference compared to G3, the markerless strains Sco-CP, ScoG-AP2, and Aper1-Sco laid a significantly reduced number of eggs. The comparison suggests that here the modification of the host genes is not likely to be the main driver of the observed fitness effects. Since the removal of the fluorescent module by itself is unlikely to decrease fitness (or could even marginally improve it), inbreeding effects or the contribution of the KIL background (introduced via crossing to the Cre line) could partially explain the observed reduction in fecundity of the markerless strains. Indeed, the Cre strain also showed a significant decrease when compared to G3. ScoGCFP-AP2 and Aper1-Sco strains additionally showed a statistically significant reduction in the hatching rate compared to the G3 wild type (Figure 4C and D). Since all three modified host genes are female and midgut specific, and because fitness effects would be expected to be sex-specific, we also measured pupal sex ratio of the markerless strains but found no significant deviations from an expected 1:1 sex ratio for any of the transgenics (Figure 4—figure supplement 1). Nevertheless, the presence of negative fitness effects of the introduced traits under these or other conditions could not be ruled out by our experiments.

Figure 4. Fecundity and larval hatch rates of the transgenic strains.

Fecundity of single females of the homozygous strains ScoGFP-CP, ScoGCFP-AP2, and Aper1-ScoGFP(A) and the markerless strains Sco-CP, ScoG-AP2, and Aper1-Sco (B) compared to the G3 wild-type and the vasa-Cre strain (KIL background) used to remove the marker module. All data sets have a Gaussian distribution according to the Shapiro–Wilk test, except of ScoGFP-CP. p-values were calculated using the unpaired two-tailed Student’s t-test. Larval hatch rates of the transgenic strains with (C) and without (D) the marker, as well as the control strains, from the eggs above. None of the data sets showed a Gaussian distribution according to Shapiro–Wilk, and the p-values were calculated using the Kolmogorov–Smirnov test. Data from three biological replicates were pooled and the total number analysed is indicated on top. Mean with standard error (SEM) were plotted. *p≤0.05, **p≤0.01, ***p≤0.001, ****p≤0.0001, and ns: not significant.

Figure 4.

Figure 4—figure supplement 1. Pupal sex ratio.

Figure 4—figure supplement 1.

Pupal sex ratio of the homozygous markerless transgenic strains Sco-CP, Sco-AP2, and Aper1-Sco compared to the G3 wild-type control. No statistically significant deviation from an expected 1:1 sex ratio was detected (chi-squared test). Error bars represent standard deviation from three biological replicates and the number of counted individuals is indicated (n).

P. falciparum transmission blocking assays

We assessed the effect of these modifications on mosquito infection with the P. falciparum NF54 strain by feeding transgenic female mosquitoes on in vitro cultured gametocytes using a standard membrane feeding assay (SMFA) (Habtewold et al., 2019). Both the alteration and/or reduction of host gene expression as well as the expression of Scorpine could independently or jointly change the transmission characteristics of these strains. The number of oocysts in the midgut of wild-type and the homozygous transgenic strains was quantified 7 days pbf (Figure 5). We did not observe a significant effect on oocyst intensity or prevalence in strains ScoGCFP-AP2 and Aper1-ScoGFP (Figure 5A). This indicated that the observed reduction in host gene expression that occurs in both these strains had no substantial effect on transmission. Interestingly, strain ScoGFP-CP showed a significant reduction in infection prevalence (IP; Figure 5A), whereas strain Sco-CP, on the contrary, showed a significantly enhanced level of infection by both measures (Figure 5B). It had previously been shown that expression of two A. gambiae carboxypeptidase B genes is upregulated upon P. falciparum infection and that antibodies against one of them blocked parasite development (Lavazec et al., 2007). Similarly, antibodies against the A. stephensi carboxypeptidases A and B significantly reduced the oocyst number of P. berghei and P. falciparum, respectively (VenkatRao et al., 2017; Raz et al., 2013). A decrease in CP protein levels in strain ScoGFP-CP could possibly explain the observed effect. Strains ScoG-AP2 and Aper1-Sco both showed a significant reduction in P. falciparum oocysts (Figure 5B) in both IP and intensity.

Figure 5. Transmission blocking assay.

Figure 5.

Standard membrane feeding assay with P. falciparum using the homozygous transgenic strains with the marker-module (A) and the corresponding markerless strains (B). Infection intensity is measured by the number of oocysts in each gut and the mean (m, blue bar) and median (M, red bar) are shown on top, as well as the infection prevalence (IP). The statistical significance of the infection intensity (stars above the bar) and IP (below) were calculated with the Mann–Whitney test and the chi-squared test, respectively. N is the number of mosquitoes analysed and R is the number of replicates performed. (C) Analyses of data plotted in (B) via a generalized linear mixed model (GLMM). The variation in the fixed effect estimates for each replicate (squares) and all replicates (diamonds) are shown as forest plots (95% confidence interval, glmmADMB). The square size is proportional to the sum of midguts analysed in each replicate. *p≤0.05, **p≤0.01, ***p≤0.001, ****p≤0.0001, and ns: not significant.

Analysis of non-autonomous gene drive induced by intronic gRNAs

Finally, we sought to determine whether the modified alleles of the three mosquito genes functioned as gene drives in the germline when provided with a source of Cas9 in trans (Figure 6). For these experiments we relied predominantly on the strains carrying the fluorescent marker in order to assay homing by scoring fluorescence in the progeny. We assessed homing efficiency of each gRNA under the control of the U6 promoter within the intron, by crossing the ScoGFP-CP, ScoGCFP-AP2 and Aper1-ScoGFP strains to a vasa-Cas9 strain (E. Marois, unpublished). Transhemizygote individuals were subsequently crossed to the wild type and the rate of fluorescent progeny was recorded (Figure 6A). From this analysis, we excluded progeny that had also inherited Cas9 linked to the 3xP3-YFP marker (Figure 6—figure supplement 1). We calculated a homing rate of 95.95% and 98.43% for ScoGFP-CP and Aper1-ScoGFP, respectively, with slightly higher rates in females compared to males (Table S5). Strain ScoGCFP-AP2 reached a homing efficiency of 87.78%. Although this locus showed higher variability around the ATG insertion site in the Ag1000G data set (The Anopheles gambiae 1000 Genomes Consortium, 2017), no SNPs had been detected in 24 sequenced individuals from the G3 lab colony (Bioproject Accession PRJNA397539, 2017) suggesting that the lower homing rate was unlikely to be related to pre-existing resistant alleles.

Figure 6. Assessment of non-autonomous gene drive of the modified host genes.

(A) Homozygous individuals of strains ScoGFP-CP, ScoGCFP-AP2, and Aper1-ScoGFP or the markerless strain ScoG-AP2 were crossed to the vasa-Cas9 strain to assess the homing potential induced by the intronic guide RNAs. As a control, hemizygous individuals lacking Cas9 were crossed to WT. In each case homing was measured by the rate of fluorescent larvae recorded in the progeny with the exception of the markerless strain ScoG-AP2 where it was assessed via PCR genotyping of the progeny. Mean and standard deviation from three biological replicates are plotted and the number n is indicated at the base of the column. All comparisons to the control crosses were significant (p<0.0001, chi-squared test). The data is found in Supplementary file 1 - Supplementary Tables 4 and 5. (B) Amplicons from PCRs over the predicted cut site within the three loci performed on pooled progeny from transhemizygotes were subjected to next generation sequencing. G3 and Cas9 served as controls. Overall number of reads and the percentage of reads with modifications in the quantification window are shown. (C) Predicted classes of modifications represented within the set of modified alleles for each locus.

Figure 6.

Figure 6—figure supplement 1. Crossing schemes for assessing the homing rate.

Figure 6—figure supplement 1.

The non-homing vasa-Cas9-line carrying a 3xP3-YFP marker was crossed to the transgenics. If homing in the germline occurs, all progeny from the cross to WT will be heterozygotes. In the case of ScoGFP-CP (A) and Aper1-ScoGFP (B), only the larvae without the Cas9 transgene can be considered for the assessment due to an overlap of fluorescent markers in the GFP channel, whereas for ScoGCFP-AP2 (C) there was no overlap. ScoG-AP2 (D) was screened via PCR.
Figure 6—figure supplement 2. Analysis of rearrangements in ScoGCFP-AP2 following Cas9 cleavage.

Figure 6—figure supplement 2.

(A) PCR analysis of non-fluorescent offspring from the cross of ScoGCFP-AP2/Cas9 to WT with primers spanning the locus showed that the majority of individuals carries an insertion but of reduced size. (B) Sequencing result of the upper band identified in the PCR suggests the loss of the intron cassette as well as a recombination and sequence exchange between the GFP and CFP coding sequences.
Figure 6—figure supplement 3. Sanger sequencing of non-fluorescent individuals.

Figure 6—figure supplement 3.

(A) Schematic explaining the difference between the sets of chromosomes subjected to Sanger and amplicon sequencing. (B) Non-fluorescent individuals identified in the homing assay (shown in grey in the first pie chart) were subjected to PCR over the gRNA target side, the products were Sanger sequenced, and the results were categorized (shown in the final pie chart). The corresponding alignments are shown to the right, with the number of occurrences of each particular sequence indicated in the coloured square before the sequence. Only few non-fluorescent individuals were found for ScoGFP-CP and Aper1-ScoGFP and they were not genotyped further for the construct. ScoGCFP-AP2 had to be genotyped by multiplex PCR first, which either gave bands for the WT and the construct or for the WT only. Subsequently only individuals that solely gave the WT band were sequenced. SNPs are indicated with a grey background, deletions are represented by a dash, and the red dashed line indicates the cleavage site.

We next sought to compare this result to the homing of a corresponding minimal genetic modification of the same locus. Homozygous strain ScoG-AP2 was crossed to the vasa-Cas9 strain and transhemizygotes were crossed to wild-type individuals. For this experiment, where homing could not be tracked via a fluorescent marker, the resulting progeny was assessed via PCR genotyping with primers amplifying the Scorpine coding sequence. We recorded near complete homing of 94.98% with 503 out of 516 individuals inheriting the ScoG-AP2 allele. Although this is an improvement compared to the homing rate of ScoGCFP-AP2, the difference was not statistically significant (t-test, p=0.1670).

To better understand the reason for the lower homing rate of ScoGCFP-AP2, we analysed non-fluorescent individuals by PCR genotyping (Figure 6—figure supplement 2A). For 73 out of 93 individuals analysed by primers spanning the entire locus, we detected a 2 kb product in addition to the wild-type amplicon. Sequencing of these PCR products suggested the construct was present but carried rearrangements that were likely the result of recombination between the GFP and CFP coding sequences (Figure 6—figure supplement 2B). Of the remaining individuals, half showed wild-type configuration at the guide RNA target site (Figure 6—figure supplement 3B). For ScoGFP-CP, 92% of the sequenced non-fluorescent individuals carried point-mutations or indels (Figure 6—figure supplement 3B). In contrast, all nine non-fluorescent individuals obtained from the experiments with Aper1-ScoGFP retained the wild-type configuration (Figure 6—figure supplement 3B).

To obtain more quantitative data, we performed PCR amplicon sequencing using DNA from pooled offspring of the crosses of transhemizygotes with the wild type (Figure 6B). This experiment was designed to detect all possible modifications of the target sites, including those caused by zygotic/embryonic Cas9 activity either due to maternal or paternal deposition or, alternatively, due to somatic Cas9 activity in the 50% of the progeny that also inherit the Cas9 transgene. Embryonic Cas9 activity has previously been reported for the vasa promoter (Hammond et al., 2016). As the background we used the Cas9 strain and the G3 wild type and found that at all three loci the percentage of alleles called as variant was below 5% in these controls. In the progeny from crosses with female transhemizygotes the percentage of modified alleles was 73.7% for CP, 92.5% for AP2%, and 86.0% for Aper1, suggesting a significant effect of maternal deposition of Cas9 as previously described for the vasa promoter (Papathanos et al., 2009). In the offspring deriving from male transhemizygotes, we observed that the percentage of modified reads in the progeny was substantially lower than in the female cross but higher than expected if no Cas9 carryover or somatic Cas9 activity is assumed to occur. We observed 17.2% modified reads for CP (against an expected maximum of 7.3% if activity is assumed to be restricted to the germline), 37.2% for AP2 (expected 7.6%), and 11.2% for Aper1 (expected 1.7%), suggesting embryonic Cas9 activity occurs in all three strains. For all three loci small indels at the cut site and towards the 5’ end of the gRNA were found to be responsible for the majority of modifications. When we predicted how these would affect the host gene coding potential, we observed distinct outcomes for the three loci (Figure 6C). At the CP locus, cleavage occurs upstream of the start codon and the majority of modifications were predicted to only affect the 5’ UTR. In contrast, cleavage of AP2 occurs downstream from the ATG, and hence the modifications observed are predicted to result in frameshifts or the loss of the start codon. Interestingly, the reciprocal crosses showed varying patterns of repair suggesting the different repair characteristics in the male and female germline or, alternatively, when repair follows after either germline or zygotic cleavage. At the Aper1 locus the effector was fused to the 3’ end of the coding sequence and the cut site is located 19 bases upstream of the STOP codon. The observed in-frame mutations (22.2% and 31.4% in paternal and maternal crosses) would thus affect the terminal amino acids only, whereas frameshifts were predicted to lead either to a premature STOP or to the addition of 25 (frame +2) or 26 (frame +3) C-terminal amino acids.

Discussion

We modified three different genomic loci of Anopheles gambiae via precise CRISPR/Cas9 homology-directed integration and subsequently removed the fluorescent transformation marker cassettes using Cre recombinase mediated excision. A range of methods for the generation of markerless, genetically modified organisms have been developed in plants (Tuteja et al., 2012). The presence of such marker genes in genetically modified plants, and subsequently in food, feed, and the environment, are of concern and thus subject to special government regulations in many countries. With the view that gene drive strains that are currently being developed could eventually be deployed in the field and will have to undergo a stringent regulatory pathway and also that marker genes are known to induce negative fitness costs (Catteruccia, 2003; Scolari et al., 2011), the generation of minimal genetic modifications is paramount for moving gene drives towards application. Combinations of multiple gene drives and multiple fluorescent markers would not only compound such costs but also reduce the usefulness of such markers. Furthermore, the predictive power of marker genes in the field will be low as gene drives can decouple from any linked marker cassettes within one generation (Oberhofer et al., 2018). We show for the first time how markerless gene drive traits can be constructed in the malaria mosquito.

Unexpectedly, all strains including those carrying the entire fluorescent marker genes were found to be homozygous viable and fertile and showed no noticeable fitness cost during standard rearing conditions. We had assumed that removal of the marker would be necessary to avoid negative fitness consequences. RNA expression analysis indicated that host gene expression had indeed been reduced in strains ScoGCFP-AP2 and Aper1-ScoGFP carrying the full insert including the fluorescent marker. The lack of a severe fitness phenotype could be explained by a certain redundancy in the set of digestive enzymes, proteases, phosphatases, and peritrophins expressed in the mosquito midgut, at least under standard rearing conditions, including a high-quality source of human blood. Alternatively, even low mRNA levels in conjunction with correct splicing of the entire intron including the marker cassette (we found evidence for this occurring in all three genes) could have led to protein expression at a sufficient level, so as not to result in a measurable impact on fitness in our experiments. Following the removal of the marker gene, we found exact splicing of the artificial intron and a significantly restored expression level of the host genes carrying now only a minimal transgene insert. It remains to be seen whether the reduced levels of restored expression (in the case of AP2 and Aper1 loci) observed in this study would preclude the use of essential genes, for which expression levels are particularly critical, to host such IGDs. The intron we used derives from the Drosophila melanogaster homeobox transcription factor fushi tarazu (ftz), essential for early embryonic development, and we have separately found that a full gRNA transgene can be hosted within this intron (A. Nash, unpublished). Our data here show similarly that the ftz intron can accommodate the A. gambiae U6 promoter and the gRNAs for gene drive and remains functional in mosquitoes. The splicing was precise, in the context of Scorpine (AAG-intron-GG) or when inserted within Scorpine::GFP (AGG-intron-AG).

When we provided a source of Cas9 in the male or female germline, all gRNA transgenes were able to induce high rates of gene drive of the modified loci. In the case of the AP2 locus we found that the minimal, markerless locus ScoG-AP2 outperformed ScoGCFP-AP2 although the difference was not statistically significant. We observed aberrant homing events in the latter strain, which are likely the result of recombination between the GFP and CFP coding sequences. This highlights that simple genetic modifications may avoid undesired interactions found in more complex constructs. The interaction of multiple gene drives (sharing homologous sequences such as marker genes, promoters, or terminators) could thus lead to unintended outcomes in the field. We analysed the formation of resistance alleles at these loci, including due to zygotic activity of vasa-driven Cas9 and predicted distinct outcomes for the three target loci. While the host genes we targeted appeared not to be essential (with the above-mentioned caveat that sufficient expression may occur in our strains), clear negative effects may result from their genuine disruption, for example, by a frameshift. Further studies for example in population cages could thus reveal to what degree the observed classes of mutations (Figure 6C) would be actively selected against, which could favour faithful transmission of the drive (Nash et al., 2019).

While our amplicon-sequencing experiments could not distinguish paternal carryover of Cas9 from somatic Cas9 activity (in male individuals inheriting both transgenes) they clearly showed an increased level of non-homologous repair in progeny of transhemizygous females indicating maternal deposition of Cas9. It should be noted that non-autonomous effectors as described herein could also be combined with male-specific gene drives, suppressive gene drives, or autonomous rescue drives and the rate of the formation and the dynamic of selection of functional resistance alleles via these routes would be quite different in each case.

The regulatory regions of the target genes CP and Aper1 had previously been used for mosquito transgenesis (Ito et al., 2002; Abraham et al., 2005). In contrast, AP2 chosen due to differential expression in the female gut has not been described before. In our experiments, overall expression from the AP2 locus was found to be lower when compared to the CP and Aper1 loci, suggesting that AP2 would be less suitable as a host gene or as a transgene promoter for the expression of effectors requiring high levels of protein. Strong promoters might not always be ideal however, as the dosage-response curves of other types of effectors (e.g. transcriptional modulators) may vary significantly. Together, our data on all three genes suggest, that co-opting the regulatory elements of endogenous loci directly without prior labour-intensive characterization is a viable approach. Since there is a dearth of promoters for many infection-relevant tissues (e.g. salivary glands, hemocytes, fatbody) we propose that this characterization step could be avoided, and effector molecules directly incorporated into mosquito gene loci and mobilized by non-autonomous drive. Existing RNA expression data sets or enhancer-trap screens could provide a starting point for the identification of suitable genes (Reid et al., 2018).

Our analysis suggested however that modifications of endogenous genes could also have unforeseen consequences. In the case of the zinc carboxypeptidase A1 (CP) we found that modifications of the host gene are possibly directly altering infection outcomes. The reduced infection level observed for strain ScoGFP-CP which may express reduced levels of CP protein is consistent with reports on other carboxypeptidase genes (Lavazec et al., 2007; VenkatRao et al., 2017; Raz et al., 2013). In contrast, strain Sco-CP showed a positive effect on infection levels which would suggest that the CP locus may not be a suitable host gene for gene drives. We observed a reduction in P. falciparum transmission in strains ScoG-AP2 and Aper1-Sco. Scorpine was chosen for our study as a prototypical effector molecule as it had previously shown to reduce Plasmodium survival (Fang et al., 2011; Bongio and Lampe, 2015; Wang et al., 2012; Shane et al., 2018) when expressed via different routes in the mosquito midgut. We do not know how the midgut environment affects the stability of the lysine-rich Scorpine and detection of even the host proteins in these samples proved challenging. Modifications of the host genes AP2 and Aper1 did not appear to alter infection levels on their own. Overall, the moderate effects on transmission we observed in different directions in our pilot experiments call for further transmission studies with these strains in order to determine whether Scorpine has realistic potential as an antimalarial effector and to lend further support to the suitability of the three mosquito genes to host effector molecules. Both genetic polymorphisms linked to the modified loci in these inbred strains and the effect of the genetic backgrounds used could influence parasite infections and our transmission experiments were not designed to distinguish these effects in full. In addition, determining the effect of these modifications on later stages of parasite development using molecular assays is also necessary to obtain a full picture. Strains ScoGFP-CP, ScoGCFP-AP2, and Aper1-ScoGFP could also be used to isolate defined NHEJ and knockout alleles of the three host genes and characterize their effect on transmission and fitness. Such studies will likely be required to demonstrate the safety of this and related approaches utilizing host genes.

In summary, the strategy and the selected host genes we have presented here now allow for the insertion and direct comparisons of a range of different antimalarial effectors either by secretion into the gut or by anchoring these molecules in the peritrophic matrix. Such effectors traits would then by design, as we have shown here, be capable of efficient non-autonomous gene drive. The most significant knowledge gap, when it comes to transmission blocking modifications of transgenic mosquitoes, is to what degree any effects observed with lab strains of P. falciparum would be reproducible using circulating parasites isolated from patient blood or would hold up under actual field conditions which could wildly differ from standard laboratory experiments. The integral effector designs we have described could be valuable tools to attempt to answer this question. Since they are incapable of autonomous gene drive, they can be established and maintained, unlike suppression drives where this is impossible, as true-breeding strains. Importantly, the regulatory burden for importing or generating such non-autonomous effector strains would be expected to be much lower than strains capable of full gene drive and hence, simplify advanced stages of testing against polymorphic isolates of the P. falciparum parasite in the endemic setting without the need for strict containment or geographical isolation. When combined with other strains capable of autonomous Cas9 gene drive, whether designed for replacement or suppression, these strains could then, without the need for any further genetic modification, be deployed to contribute to reducing malaria transmission by mosquito populations in the field.

Materials and methods

Plasmids and primers

The two intermediate plasmids pI-Scorpine-GFP and pI-Scorpine were generated first to allow insertion of different gRNAs. We synthesized the Scorpine coding sequences, optimized for codon usage in Anopheles gambiae and retained the endogenous secretion signal from Scorpine. The GFP containing the intron, based on the Drosophila melanogaster fushi tarazu (ftz) intron, was amplified from pQUAST 3xP3 CFP (A. Nash, unpublished) and the U6 promoter was exchanged for the A. gambiae U6 spliceosomal RNA (AGAP013695) promoter from p165 (Hammond et al., 2016). The EGFP marker module within pI-Scorpine has a 3xP3 promoter and a SV40 terminator and was amplified from p163 (Hammond et al., 2016). The intron within Scorpine was inserted between the nucleotides AAG and G to promote splicing (Schwartz et al., 2008). We used the Drosophila splice predictor (Reese et al., 1997) to remove cryptic splice sites. A furin cleavage site and the F2A peptide (Galizi et al., 2014) were generated via annealing long oligos (53-F2A-long-F and 54-F2A-long-R). Some of the initial fragments were pre-fused via overlap extension PCR in order to minimize the number of parts to be fused for the final Gibson assembly.

The guide-RNAs for the three target loci were generated via the annealing of oligonucleotides 39-CP-gRNA59-F and 40-CP-gRNA59-R, 57-AP2-gRNA53-F and 58-AP2-gRNA53-R, as well as 75-Per1-gRNA48-F and 76-Per1-gRNA48-R. They were then inserted into the intermediate plasmids through the BbsI restriction site via Golden gate cloning. Subsequently, each effector-cassette hosting the corresponding gRNA was PCR-amplified from the resulting intermediate plasmids. For the donor plasmids pD-ScoG-AP2 and pD-Sco-CP, the cassette was amplified with primers 51-Scorpine-F and 6-F2A-R, and for pD-Aper1-Sco with primers 77-Arg-GSG-Arg-Scorp-F and 97-Scorpine-STOP-R.

The 5’ and 3’ homology arms of approximately 800 bp were amplified from A. gambiae G3 genomic DNA. All amino acid changes detected in protein sequences were confirmed via the variation data available in http://www.vectorbase.org, except the A23V amino acid change in the secretion signal of the CP CDS. Nevertheless, this variant was observed in the 24 individual G3 sequencing reads. The guide RNA target present in the Aper1 CDS 5’ of the STOP codon was destroyed by re-coding five wobble bases in the Aper1 5’ homology arm.

The donor plasmids pD-Sco-CP, pD-ScoG-AP2, and pD-Aper1-Sco (full plasmid sequences are provided in Supplementary file 2) were generated by assembling the homology arms, the cassette, and the backbone containing an additional 3xP3-DsRed marker module. All plasmids were propagated in Agilent Sure cells in order to avoid recombination. All gRNAs were evaluated in vitro with the Guide-it sgRNA In Vitro Transcription and Screening Kit (Takara), and were found to cleave efficiently. For primers and plasmids see Tables S2 and S3.

Microinjections and establishment of transgenic strains

Plasmids were isolated with the ZymoPURE II Plasmid Maxiprep kit (Zymo Research) and 300 ng/µL of the donor plasmid and the p155-helper-plasmid (Hammond et al., 2016) were microinjected into 30–45 min old eggs of A. gambiae G3 as described previously (Lobo et al., 2006). Hatching larvae were screened for transient expression of the fluorescent marker and the adults crossed to wild type. The G1 offspring were screened for the presence of the GFP (ScoGFP-CP, Aper1-ScoGFP) or CFP (ScoGCFP-AP2) markers and we selected against red fluorescence in the eyes, in order to exclude individuals with possible plasmid backbone integration events, which was only observed in the case of ScoGCFP-AP2. G1 transgenic founder adults were backcrossed to WT and following egg laying, they were sacrificed and genomic DNA isolated. We performed PCR over the 5’ and 3’ insertion points using one primer binding within the construct and another primer binding the flanking genome sequence beyond the homology arms. Genomic DNA was obtained from single adults via crushing them in 100 µL of 5% w/v Chelex100-resin beads (BioRad Inc) in water and 4 µL of 600 U/mL Proteinase K. After incubation at 55°C and 750 rpm for 2 hr, the Proteinase was heat inactivated at 99°C for 10 min, the samples were centrifuged at maximum speed for 3 min and the supernatant was transferred to a new tube. All genotyping was performed with RedTaq Polymerase (VWR). Since for ScoGCFP-AP2 we obtained more than 100 G1 transgenic founder individuals, 12 single crosses were prepared in cups and offspring pooled after the parents were genotyped by sequencing. The transgenic founders confirmed by sequencing were subsequently outcrossed to G3 WT for three generations.

Establishment of pure breeding transgenic strains

ScoGFP-CP and ScoGCFP-AP2 were rendered homozygous by setting up sibling crosses over two generations and letting the adults hatch single in cups and genotyping their left-over pupal cases by PCR over the entire locus to distinguish WT, heterozygotes, and homozygotes. ScoGFP-CP was genotyped with primers 99-CP-locus-F and 100-CP-locus-R, and ScoGCFP-AP2 with 101-AP-locus-F and 102-AP-locus-R. In the case of Aper1-ScoGFP, the transgene was crossed to the vasa-Cas9 strain and kept in this background for several generations to allow for homing to occur. The YFP marker is also visible in the green channel and was removed via screening for orange fluorescence (excitation 515–545 nm, emission 585–670 nm). Afterwards, crosses of four females to four males were prepared in cups and the parents were genotyped with 230-Per-short-F and 231-Per-short-R after egg laying. gDNA was isolated from pupae cases in 20 µL dilution buffer of the Phire Tissue Direct PCR Master Mix kit (Thermo Scientific).

Removal of the marker module using Cre recombinase

Markerless strains were generated by removal of the fluorescent marker cassette through crossing to strain C2S (Volohonsky et al., 2015) expressing Cre recombinase driven by the vasa promoter and carrying a 3xP3-DsRed marker (herein referred to simply as Cre). The Cre transgene is on the third chromosome and resides in a KIL background and contains an additional 3xP3-CFP marker (Meredith et al., 2011). In the case of CP, the crossing scheme was initiated with homozygous ScoGFP-CP individuals and offspring showing green fluorescence and red fluorescence for Cre were kept for setting up a siblings-cross (Figure 2B). CFP and GFP fluorescence were separated by employing a narrower CFP filter-set (ET436/20x, ET470/24m, T455LP, Chroma) and a YFP filter-set (ET480/20x, ET525/50m, T495LP, Chroma). The progeny from ScoGFP-CP/Cre were screened against the presence of both markers and 88 larvae still containing the Cre transgene were discarded. There were no more larvae with GFP present, and hence, the efficiency of the Cre-mediated excision was 100%. Thirty-two individuals were markerless and were genotyped by PCR on pupal cases for homozygosity. Seven individuals were homozygous, three hemizygous, one WT, and 18 PCRs had unclear results. A pure breeding strain was established immediately afterwards from three male and two female markerless founders.

In the case of Aper1, the crossing scheme was started with hemizygous Aper1-ScoGFP individuals and the offspring were screened for GFP and DsRed (Figure 2—figure supplement 1A). After crossing them to the Cas9 strain, the progeny was screened against CFP to remove the Cre transgene but were kept in this Cas9 background over several generations, in order to make them homozygous. After removal of the Cas9 transgene, 13 final founders were confirmed via genotyping and sequencing over the lox-out, as well as over the 5' and 3' insertion sites and no imprecise homing events were detected.

In the case of the AP2, we used homozygous ScoGCFP-AP2 individuals to initiate the cross to the Cre strain which also features the 3xP3-CFP marker (Figure 2—figure supplement 1B). Offspring exhibiting both blue and red fluorescence were crossed to the vasa-Cas9 strain and the progeny were screened for the presence of YFP and the absence of the CFP and DsRed markers, which would maintain the markerless transgene in a Cas9 background. One further generation was propagated without screening in order to let the cassette home. Subsequently, individuals were genotyped using pupal cases and homozygotes were crossed to homozygous ScoGCFP-AP2 individuals. The offspring were screened against the presence of the YFP to select against Cas9. Finally, ScoG-AP2/ScoGCFP-AP2 mosquitoes were crossed, and the progeny were screened against CFP. The majority of them was also genotyped after egg laying.

RT-PCR and qPCR

WT and transgenic mosquitos were dissected 3 hr after blood-feeding and tissue from 10 to 15 guts homogenized with glass beads for 30 s at 6,800 rpm in a Precellys 24 homogenizer (Bertin) in Trizol. RNA was extracted with the Direct-zol RNA Mini-prep kit (Zymo Research) and converted into cDNA with the iScript gDNA Clear cDNA Synthesis Kit (Bio-Rad). RT-PCR was performed with RedTaq Polymerase (VWR) with primers 51-Scorpine-F and 117-CP-ctrl-R on Sco-CP (1267 bp), 160-Sco-probe-F and 6-F2A-R on ScoG-AP2 (1089 bp), and 105-Per-locus-F and 162-Sco-probe-2-R on Aper1-Sco (533 bp). The products were subsequently evaluated via gel-electrophoresis and sequencing (Genewiz). RNA was isolated from three biological replicates and qPCRs were performed in triplicates for each with the Fast SYBR Green Master Mix (Thermo Scientific) on an Applied Biosystems 7500 Fast Real-Time PCR System. Expression was normalized to the S7 reference gene and analysed using the ΔΔCt method. The following primer pairs were used for the qPCRs: 270-q-CP-F1 and 271-q-CP-R3, 450-qAP-F3 and 451-qAP-R3, 274-qPer1-F and 275-qPer1-R, as well as 213-q-Sco1-F1 and 523-qSco1-all-R (Table S2).

Mass spectrometry

Following dissection, 25 non-blood-fed guts of strains Sco-CP or Aper1-Sco were put in 50 µL CERI reagent from the NE-PER Nuclear and Cytoplasmic Extraction kit (Thermo Fisher Scientific), homogenized with a motorized pestle and cytoplasmic proteins were extracted according to the manufacturer's protocol. The supernatant was filtered through a 100 kDa or 50 kDa Amicon Ultra-0.5 Centrifugal Filter Unit for Sco-CP and Aper1-Sco, respectively. Trypsin-digest and LC-MS/MS were performed at the Advanced Mass Spectrometry Facility at the University of Birmingham.

Fitness assays

For fecundity and fertility assays, single blood-fed females were transferred to cups containing water, lined with filter paper. Females that failed to lay eggs or produce larvae were dissected and excluded from the analysis if no sperm was detected in their spermatheca. Eggs and L1 larvae from at least 10 technical replicates were counted and the data from three biological replicates were pooled. All biological replicates passed the Shapiro–Wilk normality test, except of ScoGFP-CP. To determine the pupal sex ratio, between 100 and 140 L1 larvae per tray were reared to the pupal stage and sexed. Data from three biological replicates was pooled and analysed for deviations from the expected 1:1 sex ratio via the chi-squared test.

Plasmodium falciparum standard membrane feeding assay

Infections of mosquitoes using the streamlined standard membrane feeding assay were performed as described previously (Habtewold et al., 2019). Briefly, mosquitoes were fed for 15 min at room temperature using an artificial membrane feeder with a volume of 300–500 µL of mature Plasmodium falciparum (NF54) gametocyte cultures (2–6% gametocytaemia). Afterwards, mosquitoes were maintained at 26°C with 70–80% relative humidity. For 48 hr after the infective meal, mosquitoes were deprived of light and starved without fructose, in order to eliminate the unfed mosquitoes. Midguts were dissected after 7 days and oocysts were counted to calculate mean and median (including all zero values). Statistical significance was calculated using the non-parametric Mann–Whitney test for oocyst load (infection intensity) and the chi-squared test for oocyst presence (IP) with GraphPad Prism v7.0. The generalized linear mixed model (GLMM) was used to determine statistical significance of oocyst infection intensity for each independent biological replicate. GLMM analyses were performed in R (version 1.2.5019) using the Wald Z-test on a zero-inflated negative binomial regression (glmmADMB). The different strains were considered as covariates and the replicates as a random component. Fixed effect estimates are the regression coefficients.

Assessment of the homing rate

Homozygous individuals of strains ScoGFP-CP, ScoGCFP-AP2, and Aper1-ScoGFP were crossed to the vasa-Cas9 strain carrying the 3xP3-YFP marker (chromosome 2) and the offspring were screened for the presence of orange fluorescence (Figure 6—figure supplement 1, Table S5). Subsequently, the transhemizygotes were sexed and crossed to G3 wild type. Offspring deriving from the ScoGCFP-AP2 cross were screened for the presence of CFP, whereas progeny deriving from ScoGFP-CP and Aper1-ScoGFP were selected based on the absence of orange fluorescence and the presence of the GFP marker.

As controls, hemizygous individuals of each strain were crossed to WT, and the rate of fluorescent larvae was calculated. For the calculation of the homing rate e, we used the individuals negative for the effector (Eneg), the number n, and the Mendelian distribution of 50% as a baseline, as follows:

e = (n*0.5 – Eneg) / (n*0.5) *100.

In order to evaluate the homing rate of the markerless AP2-line, transhemizygote offspring were screened by PCR for the presence of the construct. gDNA was isolated with Chelex beads (BioRad) or the dilution buffer of the Phire Tissue Direct PCR Kit (Thermo Scientific). For the first biological replicate, PCRs were performed with primers 101-AP-locus-F and 102-AP-locus-R, and in case of doubt another PCR with primers 160-Sco-probe-F and 161-Sco-probe-R was performed. For the second and third biological replicates, a multiplex PCR with primers 101-AP-locus-F, 102-AP-locus-R, and 273-qSco1-R2 was conducted.

Non-fluorescent individuals were subjected to PCR with primers 99-CP-locus-F and 100-CP-locus-R, 101-AP-locus-F and 102-AP-locus-R, and 230-Per1-short-F and 231-Per1-short-R, respectively. CRISP-ID (Dehairs et al., 2016) was used for the deconvolution of the Sanger-sequencing chromatograms.

Amplicon sequencing

Genomic DNA was extracted from all offspring of the respective homing crosses at larval stage L2 with the Monarch Genomic DNA Purification Kit (NEB). The vasa-Cas9 strain and G3 wild type were included as controls. PCRs over the insertion sides using Q5 High-Fidelity DNA Polymerase (NEB) were performed with primers 99-CP-locus-F and 524-CP-ampli-R (443 bp amplicon), A3-AP-PrimerC-F and 221-q-AP-R (488 bp amplicon), as well as 230-Per-short-F and 231-Per-short-R (303 bp amplicon). An extension time of 5 s was chosen to exclude transgene amplification. Annealing temperature and cycle number were set to 66°C and 25 for CP and Per, and 60°C and 29 for AP, respectively. Amplicons were purified with QIAquick PCR Purification Kit (Qiagen) and submitted to Amplicon-EZ NGS (Genewiz). Raw NGS data (SRA accession PRJNA701314) were analysed with CRISPResso2 (Clement et al., 2019) and the minimum average read quality score (phred33) was set to 30.

Acknowledgements

We thank Alexander Nash, Roberto Galizi, and Andrew Hammond for template-constructs, George Avraam and Olivia Bates for help with line maintenance, Claudia Wyer for technical assistance, Louise Marston and Carla Siniscalchi for injection training, Eric Marois for sharing the vasa-Cre-line and vasa-Cas9-line, and Jinglei Yu from the Advanced Mass Spectrometry Facility at the University of Birmingham. This work was funded by the Bill and Melinda Gates Foundation grant OPP1158151 to NW and GKC.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Nikolai Windbichler, Email: nikolai.windbichler@imperial.ac.uk.

Philipp W Messer, Cornell University, United States.

Diethard Tautz, Max-Planck Institute for Evolutionary Biology, Germany.

Funding Information

This paper was supported by the following grant:

  • Bill and Melinda Gates Foundation OPP1158151 to George K Christophides, Nikolai Windbichler.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Data curation, Formal analysis, Investigation, Visualization, Writing - original draft, Writing - review and editing.

Data curation, Formal analysis.

Resources, Data curation.

Resources, Data curation, Investigation.

Data curation.

Conceptualization, Data curation, Supervision, Funding acquisition, Project administration, Writing - review and editing.

Conceptualization, Supervision, Funding acquisition, Visualization, Writing - original draft, Project administration, Writing - review and editing.

Conceptualization, Resources, Supervision, Funding acquisition, Investigation, Visualization, Writing - original draft, Project administration, Writing - review and editing.

Additional files

Supplementary file 1. Supplementary Tables S1–S7.

Table S1. Guide RNA and target site characteristics. The start codon of the target gene is indicated within the gRNA sequence (bold) and the protospacer adjacent motif (PAM) separated via a hyphen. All predicted off-target cleavage sites were found to be located in non-coding (NC) regions and the number of mismatches (MM) is indicated. The number of SNPs within the 24 individuals of the G3 strain and within the Ag1000G is indicated. The SNPs observed for CP in the Ag1000G did not pass the quality control. Table S2. Primers used in this study. Table S3. Plasmids used in this study. Table S4. Transmission rate of control-crosses without Cas9. Table S5. Transmission rates and homing rates. Epos and Eneg refer to individuals with or without the effector construct, respectively. The homing rate e was calculated as follows: e = (n*0.5–Eneg)/(n*0.5)*100. Table S6. Modified sequences identified in the Amplicon sequencing. Table S7. Raw data of the transmission blocking assay.

elife-58791-supp1.xlsx (39.9KB, xlsx)
Supplementary file 2. GenBank-DNA-files of donor plasmids pD-Sco-CP, pD-ScoG-AP2, and pD-Aper1-Sco.
elife-58791-supp2.zip (17.2KB, zip)
Transparent reporting form

Data availability

All data generated or analysed during this study are included in the manuscript and supporting files. Amplicon-sequencing raw data have been deposited in SRA under accession code PRJNA701314.

The following dataset was generated:

Hoermann A. 2021. Amplicon-sequencing over the Anopheles gambiae CP, AP2 and Aper1 loci after non-autonomous gene drive. NCBI Sequence Read Archive. PRJNA701314

References

  1. Abraham EG, Donnelly-Doman M, Fujioka H, Ghosh A, Moreira L, Jacobs-Lorena M. Driving midgut-specific expression and secretion of a foreign protein in transgenic mosquitoes with AgAper1 regulatory elements. Insect Molecular Biology. 2005;14:271–279. doi: 10.1111/j.1365-2583.2004.00557.x. [DOI] [PubMed] [Google Scholar]
  2. Adolfi A, Gantz VM, Jasinskiene N, Lee HF, Hwang K, Terradas G, Bulger EA, Ramaiah A, Bennett JB, Emerson JJ, Marshall JM, Bier E, James AA. Efficient population modification gene-drive rescue system in the malaria mosquito anopheles stephensi. Nature Communications. 2020;11:5553. doi: 10.1038/s41467-020-19426-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Arik AJ, Hun LV, Quicke K, Piatt M, Ziegler R, Scaraffia PY, Badgandi H, Riehle MA. Increased akt signaling in the mosquito fat body increases adult survivorship. The FASEB Journal. 2015;29:1404–1413. doi: 10.1096/fj.14-261479. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Barzel A, Naor A, Privman E, Kupiec M, Gophna U. Homing endonucleases residing within inteins: evolutionary puzzles awaiting genetic solutions. Biochemical Society Transactions. 2011;39:169–173. doi: 10.1042/BST0390169. [DOI] [PubMed] [Google Scholar]
  5. Beaghton A, Hammond A, Nolan T, Crisanti A, Godfray HC, Burt A. Requirements for driving antipathogen effector genes into populations of disease vectors by homing. Genetics. 2017;205:1587–1596. doi: 10.1534/genetics.116.197632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bioproject Accession PRJNA397539 24 Individuals From the G3 Anopheles Gambiae Strain. 2017 https://www.ncbi.nlm.nih.gov/bioproject/PRJNA397539/
  7. Bokman SH, Ward WW. Renaturation of Aequorea green-fluorescent protein. Biochemical and Biophysical Research Communications. 1981;101:1372–1380. doi: 10.1016/0006-291X(81)91599-0. [DOI] [PubMed] [Google Scholar]
  8. Bongio NJ, Lampe DJ. Inhibition of plasmodium berghei development in mosquitoes by effector proteins secreted from Asaia sp. Bacteria using a novel native secretion signal. PLOS ONE. 2015;10:e0143541. doi: 10.1371/journal.pone.0143541. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Burt A. Site-specific selfish genes as tools for the control and genetic engineering of natural populations. Proceedings of the Royal Society of London. Series B: Biological Sciences. 2003;270:921–928. doi: 10.1098/rspb.2002.2319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Catteruccia F. Impact of genetic manipulation on the fitness of anopheles stephensi mosquitoes. Science. 2003;299:1225–1227. doi: 10.1126/science.1081453. [DOI] [PubMed] [Google Scholar]
  11. Chaerkady R, Kelkar DS, Muthusamy B, Kandasamy K, Dwivedi SB, Sahasrabuddhe NA, Kim MS, Renuse S, Pinto SM, Sharma R, Pawar H, Sekhar NR, Mohanty AK, Getnet D, Yang Y, Zhong J, Dash AP, MacCallum RM, Delanghe B, Mlambo G, Kumar A, Keshava Prasad TS, Okulate M, Kumar N, Pandey A. A proteogenomic analysis of anopheles gambiae using high-resolution Fourier transform mass spectrometry. Genome Research. 2011;21:1872–1881. doi: 10.1101/gr.127951.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Champer J, Yang E, Lee E, Liu J, Clark AG, Messer PW. A CRISPR homing gene drive targeting a haplolethal gene removes resistance alleles and successfully spreads through a cage population. PNAS. 2020a;117:24377–24383. doi: 10.1073/pnas.2004373117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Champer J, Lee E, Yang E, Liu C, Clark AG, Messer PW. A toxin-antidote CRISPR gene drive system for regional population modification. Nature Communications. 2020b;11:1082. doi: 10.1038/s41467-020-14960-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Clement K, Rees H, Canver MC, Gehrke JM, Farouni R, Hsu JY, Cole MA, Liu DR, Joung JK, Bauer DE, Pinello L. CRISPResso2 provides accurate and rapid genome editing sequence analysis. Nature Biotechnology. 2019;37:224–226. doi: 10.1038/s41587-019-0032-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Corby-Harris V, Drexler A, Watkins de Jong L, Antonova Y, Pakpour N, Ziegler R, Ramberg F, Lewis EE, Brown JM, Luckhart S, Riehle MA. Activation of akt signaling reduces the prevalence and intensity of malaria parasite infection and lifespan in anopheles stephensi mosquitoes. PLOS Pathogens. 2010;6:e1001003. doi: 10.1371/journal.ppat.1001003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. de Felipe P, Ryan MD. Targeting of proteins derived from self-processing polyproteins containing multiple signal sequences. Traffic. 2004;5:616–626. doi: 10.1111/j.1398-9219.2004.00205.x. [DOI] [PubMed] [Google Scholar]
  17. Dehairs J, Talebi A, Cherifi Y, Swinnen JV. CRISP-ID: decoding CRISPR mediated indels by Sanger sequencing. Scientific Reports. 2016;6:28973. doi: 10.1038/srep28973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Devenport M, Fujioka H, Jacobs-Lorena M. Storage and secretion of the peritrophic matrix protein Ag-Aper1 and trypsin in the midgut of anopheles gambiae. Insect Molecular Biology. 2004;13:349–358. doi: 10.1111/j.0962-1075.2004.00488.x. [DOI] [PubMed] [Google Scholar]
  19. Dinglasan RR, Devenport M, Florens L, Johnson JR, McHugh CA, Donnelly-Doman M, Carucci DJ, Yates JR, Jacobs-Lorena M. The anopheles gambiae adult midgut peritrophic matrix proteome. Insect Biochemistry and Molecular Biology. 2009;39:125–134. doi: 10.1016/j.ibmb.2008.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Dong Y, Das S, Cirimotich C, Souza-Neto JA, McLean KJ, Dimopoulos G. Engineered anopheles immunity to plasmodium infection. PLOS Pathogens. 2011;7:e1002458. doi: 10.1371/journal.ppat.1002458. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Dong S, Fu X, Dong Y, Simões ML, Zhu J, Dimopoulos G. Broad spectrum immunomodulatory effects of anopheles gambiae microRNAs and their use for transgenic suppression of plasmodium. PLOS Pathogens. 2020a;16:e1008453. doi: 10.1371/journal.ppat.1008453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Dong Y, Simões ML, Dimopoulos G. Versatile transgenic multistage effector-gene combinations for Plasmodium falciparum suppression in Anopheles. Science Advances. 2020b;6:eaay5898. doi: 10.1126/sciadv.aay5898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Edwards MJ, Lemos FJ, Donnelly-Doman M, Jacobs-Lorena M. Rapid induction by a blood meal of a carboxypeptidase gene in the gut of the mosquito anopheles gambiae. Insect Biochemistry and Molecular Biology. 1997;27:1063–1072. doi: 10.1016/S0965-1748(97)00093-3. [DOI] [PubMed] [Google Scholar]
  24. Esvelt KM, Smidler AL, Catteruccia F, Church GM. Concerning RNA-guided gene drives for the alteration of wild populations. eLife. 2014;3:e03401. doi: 10.7554/eLife.03401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Fang W, Vega-Rodríguez J, Ghosh AK, Jacobs-Lorena M, Kang A, St Leger RJ. Development of transgenic fungi that kill human malaria parasites in mosquitoes. Science. 2011;331:1074–1077. doi: 10.1126/science.1199115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Galizi R, Doyle LA, Menichelli M, Bernardini F, Deredec A, Burt A, Stoddard BL, Windbichler N, Crisanti A. A synthetic sex ratio distortion system for the control of the human malaria mosquito. Nature Communications. 2014;5:3977. doi: 10.1038/ncomms4977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Gantz VM, Jasinskiene N, Tatarenkova O, Fazekas A, Macias VM, Bier E, James AA. Highly efficient Cas9-mediated gene drive for population modification of the malaria vector mosquito anopheles stephensi. PNAS. 2015;112:E6736–E6743. doi: 10.1073/pnas.1521077112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Habtewold T, Tapanelli S, Masters EKG, Hoermann A, Windbichler N, Christophides GK. Streamlined SMFA and mosquito dark-feeding regime significantly improve malaria transmission-blocking assay robustness and sensitivity. Malaria Journal. 2019;18:24. doi: 10.1186/s12936-019-2663-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Hafez M, Hausner G. Homing endonucleases: DNA scissors on a mission. Genome. 2012;55:553–569. doi: 10.1139/g2012-049. [DOI] [PubMed] [Google Scholar]
  30. Hammond A, Galizi R, Kyrou K, Simoni A, Siniscalchi C, Katsanos D, Gribble M, Baker D, Marois E, Russell S, Burt A, Windbichler N, Crisanti A, Nolan T. A CRISPR-Cas9 gene drive system targeting female reproduction in the malaria mosquito vector anopheles gambiae. Nature Biotechnology. 2016;34:78–83. doi: 10.1038/nbt.3439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Hauck ES, Antonova-Koch Y, Drexler A, Pietri J, Pakpour N, Liu D, Blacutt J, Riehle MA, Luckhart S. Overexpression of phosphatase and tensin homolog improves fitness and decreases Plasmodium falciparum development in anopheles stephensi. Microbes and Infection. 2013;15:775–787. doi: 10.1016/j.micinf.2013.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Isaacs AT, Li F, Jasinskiene N, Chen X, Nirmala X, Marinotti O, Vinetz JM, James AA. Engineered resistance to Plasmodium falciparum development in transgenic anopheles stephensi. PLOS Pathogens. 2011;7:e1002017. doi: 10.1371/journal.ppat.1002017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Isaacs AT, Jasinskiene N, Tretiakov M, Thiery I, Zettor A, Bourgouin C, James AA. Transgenic anopheles stephensi coexpressing single-chain antibodies resist Plasmodium falciparum development. PNAS. 2012;109:E1922–E1930. doi: 10.1073/pnas.1207738109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Ito J, Ghosh A, Moreira LA, Wimmer EA, Jacobs-Lorena M. Transgenic anopheline mosquitoes impaired in transmission of a malaria parasite. Nature. 2002;417:452–455. doi: 10.1038/417452a. [DOI] [PubMed] [Google Scholar]
  35. Janczak M, Bukowski M, Górecki A, Dubin G, Dubin A, Wladyka B. A systematic investigation of the stability of green fluorescent protein fusion proteins. Acta Biochimica Polonica. 2015;62:407–411. doi: 10.18388/abp.2015_1026. [DOI] [PubMed] [Google Scholar]
  36. Kim W, Koo H, Richman AM, Seeley D, Vizioli J, Klocko AD, O'Brochta DA. Ectopic expression of a cecropin transgene in the human malaria vector mosquito anopheles gambiae (Diptera: culicidae): effects on susceptibility to plasmodium. Journal of Medical Entomology. 2004;41:447–455. doi: 10.1603/0022-2585-41.3.447. [DOI] [PubMed] [Google Scholar]
  37. Kyrou K, Hammond AM, Galizi R, Kranjc N, Burt A, Beaghton AK, Nolan T, Crisanti A. A CRISPR-Cas9 gene drive targeting doublesex causes complete population suppression in caged anopheles gambiae mosquitoes. Nature Biotechnology. 2018;36:1062–1066. doi: 10.1038/nbt.4245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lavazec C, Boudin C, Lacroix R, Bonnet S, Diop A, Thiberge S, Boisson B, Tahar R, Bourgouin C. Carboxypeptidases B of anopheles gambiae as targets for a Plasmodium falciparum transmission-blocking vaccine. Infection and Immunity. 2007;75:1635–1642. doi: 10.1128/IAI.00864-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Lobo NF, Clayton JR, Fraser MJ, Kafatos FC, Collins FH. High efficiency germ-line transformation of mosquitoes. Nature Protocols. 2006;1:1312–1317. doi: 10.1038/nprot.2006.221. [DOI] [PubMed] [Google Scholar]
  40. Maccallum RM, Redmond SN, Christophides GK. An expression map for anopheles gambiae. BMC Genomics. 2011;12:620. doi: 10.1186/1471-2164-12-620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Meredith JM, Basu S, Nimmo DD, Larget-Thiery I, Warr EL, Underhill A, McArthur CC, Carter V, Hurd H, Bourgouin C, Eggleston P. Site-specific integration and expression of an anti-malarial gene in transgenic anopheles gambiae significantly reduces plasmodium infections. PLOS ONE. 2011;6:e14587. doi: 10.1371/journal.pone.0014587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Moreira LA, Ito J, Ghosh A, Devenport M, Zieler H, Abraham EG, Crisanti A, Nolan T, Catteruccia F, Jacobs-Lorena M. Bee venom phospholipase inhibits malaria parasite development in transgenic mosquitoes. Journal of Biological Chemistry. 2002;277:40839–40843. doi: 10.1074/jbc.M206647200. [DOI] [PubMed] [Google Scholar]
  43. Nash A, Urdaneta GM, Beaghton AK, Hoermann A, Papathanos PA, Christophides GK, Windbichler N. Integral gene drives for population replacement. Biology Open. 2019;8:bio037762. doi: 10.1242/bio.037762. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Nirmala X, Marinotti O, Sandoval JM, Phin S, Gakhar S, Jasinskiene N, James AA. Functional characterization of the promoter of the vitellogenin gene, AsVg1, of the malaria vector, anopheles stephensi. Insect Biochemistry and Molecular Biology. 2006;36:694–700. doi: 10.1016/j.ibmb.2006.05.011. [DOI] [PubMed] [Google Scholar]
  45. Noble C, Olejarz J, Esvelt KM, Church GM, Nowak MA. Evolutionary dynamics of CRISPR gene drives. Science Advances. 2017;3:e1601964. doi: 10.1126/sciadv.1601964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Nolan T, Petris E, Müller HM, Cronin A, Catteruccia F, Crisanti A. Analysis of two novel midgut-specific promoters driving transgene expression in anopheles stephensi mosquitoes. PLOS ONE. 2011;6:e16471. doi: 10.1371/journal.pone.0016471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Oberhofer G, Ivy T, Hay BA. Behavior of homing endonuclease gene drives targeting genes required for viability or female fertility with multiplexed guide RNAs. PNAS. 2018;115:E9343–E9352. doi: 10.1073/pnas.1805278115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Oberhofer G, Ivy T, Hay BA. Cleave and rescue, a novel selfish genetic element and general strategy for gene drive. PNAS. 2019;116:6250–6259. doi: 10.1073/pnas.1816928116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Oberhofer G, Ivy T, Hay BA. Gene drive and resilience through renewal with next generation cleave and rescue selfish genetic elements. PNAS. 2020;117:9013–9021. doi: 10.1073/pnas.1921698117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Papathanos PA, Windbichler N, Menichelli M, Burt A, Crisanti A. The vasa regulatory region mediates germline expression and maternal transmission of proteins in the malaria mosquito anopheles gambiae: a versatile tool for genetic control strategies. BMC Molecular Biology. 2009;10:65. doi: 10.1186/1471-2199-10-65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Parish LA, Colquhoun DR, Ubaida Mohien C, Lyashkov AE, Graham DR, Dinglasan RR. Ookinete-interacting proteins on the microvillar surface are partitioned into detergent resistant membranes of anopheles gambiae midguts. Journal of Proteome Research. 2011;10:5150–5162. doi: 10.1021/pr2006268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Pham TB, Phong CH, Bennett JB, Hwang K, Jasinskiene N, Parker K, Stillinger D, Marshall JM, Carballar-Lejarazú R, James AA. Experimental population modification of the malaria vector mosquito, anopheles stephensi. PLOS Genetics. 2019;15:e1008440. doi: 10.1371/journal.pgen.1008440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Raban RR, Marshall JM, Akbari OS. Progress towards engineering gene drives for population control. The Journal of Experimental Biology. 2020;223:jeb208181. doi: 10.1242/jeb.208181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Raz A, Dinparast Djadid N, Zakeri S. Molecular characterization of the carboxypeptidase B1 of anopheles stephensi and its evaluation as a target for transmission-blocking vaccines. Infection and Immunity. 2013;81:2206–2216. doi: 10.1128/IAI.01331-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Reese MG, Eeckman FH, Kulp D, Haussler D. Improved splice site detection in genie. Journal of Computational Biology. 1997;4:311–323. doi: 10.1089/cmb.1997.4.311. [DOI] [PubMed] [Google Scholar]
  56. Reid W, Pilitt K, Alford R, Cervantes-Medina A, Yu H, Aluvihare C, Harrell R, O'Brochta DA. An anopheles stephensi Promoter-Trap: Augmenting Genome Annotation and Functional Genomics. G3: Genes, Genomes, Genetics. 2018;8:3119–3130. doi: 10.1534/g3.118.200347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Schwartz SH, Silva J, Burstein D, Pupko T, Eyras E, Ast G. Large-scale comparative analysis of splicing signals and their corresponding splicing factors in eukaryotes. Genome Research. 2008;18:88–103. doi: 10.1101/gr.6818908. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Scolari F, Siciliano P, Gabrieli P, Gomulski LM, Bonomi A, Gasperi G, Malacrida AR. Safe and fit genetically modified insects for pest control: from lab to field applications. Genetica. 2011;139:41–52. doi: 10.1007/s10709-010-9483-7. [DOI] [PubMed] [Google Scholar]
  59. Shane JL, Grogan CL, Cwalina C, Lampe DJ. Blood meal-induced inhibition of vector-borne disease by transgenic Microbiota. Nature Communications. 2018;9:4127. doi: 10.1038/s41467-018-06580-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Shen Z, Jacobs-Lorena M. A type I peritrophic matrix protein from the malaria vector anopheles gambiae binds to chitin cloning, expression, and characterization. The Journal of Biological Chemistry. 1998;273:17665–17670. doi: 10.1074/jbc.273.28.17665. [DOI] [PubMed] [Google Scholar]
  61. Simões ML, Dong Y, Hammond A, Hall A, Crisanti A, Nolan T, Dimopoulos G. The anopheles FBN9 immune factor mediates plasmodium species-specific defense through transgenic fat body expression. Developmental & Comparative Immunology. 2017;67:257–265. doi: 10.1016/j.dci.2016.09.012. [DOI] [PubMed] [Google Scholar]
  62. Sinden RE. A proteomic analysis of malaria biology: integration of old literature and new technologies. International Journal for Parasitology. 2004;34:1441–1450. doi: 10.1016/j.ijpara.2004.10.005. [DOI] [PubMed] [Google Scholar]
  63. The Anopheles gambiae 1000 Genomes Consortium Genetic diversity of the African malaria vector Anopheles gambiae. Nature. 2017;552:96–100. doi: 10.1038/nature24995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Tuteja N, Verma S, Sahoo RK, Raveendar S, Reddy IN. Recent advances in development of marker-free transgenic plants: regulation and biosafety concern. Journal of Biosciences. 2012;37:167–197. doi: 10.1007/s12038-012-9187-5. [DOI] [PubMed] [Google Scholar]
  65. VenkatRao V, Kumar SK, Sridevi P, Muley VY, Chaitanya RK. Cloning, characterization and transmission blocking potential of midgut carboxypeptidase A in anopheles stephensi. Acta Tropica. 2017;168:21–28. doi: 10.1016/j.actatropica.2016.12.035. [DOI] [PubMed] [Google Scholar]
  66. Volohonsky G, Terenzi O, Soichot J, Naujoks DA, Nolan T, Windbichler N, Kapps D, Smidler AL, Vittu A, Costa G, Steinert S, Levashina EA, Blandin SA, Marois E. Tools for anopheles gambiae transgenesis. G3: Genes, Genomes, Genetics. 2015;5:1151–1163. doi: 10.1534/g3.115.016808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Volohonsky G, Hopp AK, Saenger M, Soichot J, Scholze H, Boch J, Blandin SA, Marois E. Transgenic expression of the Anti-parasitic factor TEP1 in the malaria mosquito anopheles gambiae. PLOS Pathogens. 2017;13:e1006113. doi: 10.1371/journal.ppat.1006113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Wang S, Ghosh AK, Bongio N, Stebbings KA, Lampe DJ, Jacobs-Lorena M. Fighting malaria with engineered symbiotic Bacteria from vector mosquitoes. PNAS. 2012;109:12734–12739. doi: 10.1073/pnas.1204158109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Windbichler N, Menichelli M, Papathanos PA, Thyme SB, Li H, Ulge UY, Hovde BT, Baker D, Monnat RJ, Burt A, Crisanti A. A synthetic homing endonuclease-based gene drive system in the human malaria mosquito. Nature. 2011;473:212–215. doi: 10.1038/nature09937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. World malaria report Geneva: World Health Organization; 2019 2019

Decision letter

Editor: Philipp W Messer1
Reviewed by: Mara Lawniczak2, Eric Marois3

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

Engineered gene drives could potentially be used to spread genes that prevent malaria transmission in mosquitoes. In this study, the authors develop a proof-of-principle of effector components that would be part of a proposed integral gene drive. Such drives differ from standard gene drives by separating the Cas9 and effector components at different loci, with each one having biased inheritance, a useful strategy if the Cas9 has a substantial fitness cost.

Decision letter after peer review:

Thank you for submitting your article "Converting endogenous genes of the malaria mosquito into simple non-autonomous gene drives for population replacement" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Diethard Tautz as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Mara Lawniczak (Reviewer #2); Eric Marois (Reviewer #3).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

As the editors have judged that your manuscript is of interest, but as described below that additional experiments are required before it is published, we would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). First, because many researchers have temporarily lost access to the labs, we will give authors as much time as they need to submit revised manuscripts. We are also offering, if you choose, to post the manuscript to bioRxiv (if it is not already there) along with this decision letter and a formal designation that the manuscript is "in revision at eLife". Please let us know if you would like to pursue this option. (If your work is more suitable for medRxiv, you will need to post the preprint yourself, as the mechanisms for us to do so are still in development.)

Summary:

This paper demonstrates a number of important steps necessary for implementing the recently proposed "integral gene drive" strategy. In this approach, endogenous mosquito genes are hijacked to express a heterologous effector peptide intended to render mosquitoes resistant to human pathogens. Such drives differ from standard gene drives by separating the Cas9 and effector components at different loci, with each one having biased inheritance. This could be useful if the Cas9 has a substantial fitness cost and could also more easily target conserved sites of important genes compared to a standard drive. While it remains to be seen how effective this approach will be in practice, the paper provides valuable insights into how such gene drives could work in mosquitoes.

Essential revisions:

All reviewers found that overall the manuscript is technically sound. However, it was also felt that some of the stated conclusions were not yet fully supported by the performed analyses. After discussing these issues, we identified the following four major areas that we believe would need to be addressed in a revision in order to make this paper suitable for eLife:

1. Better quantification of target gene expression for AP2 and CP. To demonstrate the potential usefulness of the integral gene drive concepts, more evidence should be provided that the method can indeed reliably insert an intron and genes (at least gRNAs) without significantly reducing the expression of the target gene. So far, a quantitative technique was only used on one of the three targets (Aper1) and showed substantially reduced expression. For the two other targets only Westerns blots are presented. As pointed out in the reviews, these would need additional analysis and controls to be truly quantitative. If this cannot be shown, the authors would have to state this caveat very prominently in the manuscript, as it could severely limit the applicability of the integral gene drive concept.

2. Improved fitness assessment. Reviewers 3 and 4 point towards potential issues of the fitness analysis due to the fact that the lines were not outcrossed before they were interbred. We suggest that these experiments are redone with better controls. If this cannot be done, the caveats and limits would need to be clearly presented.

3. Additional controls for transmission blocking assays are needed if the authors want to sell Scorpine as a promising antimalarial effector. As detailed by Reviewers 2 and 3, the authors should consider adding inbred controls for phenotypic assays relating to transmission. They note themselves that the process of inbreeding can influence infections. It would thus seem important to try to mitigate this noise with replicate transgenic lines from independent G0s and potentially also "control" transgenic lines that have been through inbreeding, or even just several other inbred lines from the G3 background. Reviewer 3 additionally raised questions about the statistical analyses shown in Figure 5, which should be addressed.

4. Expanded treatment of resistance potential. It's unclear how exactly resistance alleles would be dealt with in the integral gene drive framework. At the very least, the authors should significantly expand the discussion of this issue. As pointed out by Reviewer 4, it is also not clear on what evidence the author's conclusion was based "that no end-joining was detected". This should be better explained.

Reviewer #2:

This is a compelling demonstration of a number of important steps that take population replacement gene drive for malaria control closer to reality. I have no major concerns and think the manuscript shows the authors have made substantial progress in (a) taking Integral Gene Drive (which is a recent idea from senior author Windbichler) into mosquitoes, (b) successfully removing marker genes to make the whole system more effective, (c) demonstrating that the approach works to express a molecule to reduce parasite infection rates in the lab while also making it possible to test these effector molecules in natural settings without risk of accidental drive release, and (d) also showing that drive is successful. I think the study is high impact.

Reviewer #3:

Hoermann et al. present a new gene engineering concept for disease vector mosquitoes, whereby endogenous mosquito genes are hijacked to express a heterologous effector peptide intended to render mosquitoes resistant to human pathogens. In addition, a synthetic intron added within the effector-coding sequence will express gRNAs for the CRISPR-Cas9 system, recognizing the transgene's own wild-type insertion locus. In the presence of a source of Cas9, the effector gene is thus able to home into a wild-type chromosome, triggering a gene drive effect that can increase the frequency of the modification in the mosquito population. A fluorescent marker, also cloned within the intron, is used at early steps to track the transgene, but is subsequently removed by Cre/lox excision to restore host gene + effector expression and to result in minimal genetic modification.

This is an extremely elegant procedure and a remarkable technical achievement, especially in such a difficult species as Anopheles gambiae. The choice of midgut-specific promoters to express anti-malaria effectors makes sense to target early stages of developement of parasites, before they had a chance to amplify in the mosquito. Using endogenous regulatory sequences without a need for promoter cloning alleviates the tedious work of individual promoter characterization. The molecular designs are well described, and the results likely to have a large future impact in the development of vector control tools, notwithstanding some weakness in assessing the antiparasitic effect of Scorpine in the transgenic mosquitoes (see below). I agree that this type of transgene should facilitate semi-field or field testing of candidate anti-parasitic effectors, before any true gene drive intervention is envisaged. I fully support publication of this manuscript in eLife, once the following major point concerning Figure 5is processed by the authors.

P. falciparum transmission blocking assays – Figure 5

I have several questions about figure 5.

– Are mosquitoes with 0 parasite taken into account in the calculation of the mean and median? This should be precise in the legend or in Exp procedures.

– Several replicates have been pooled to generate the figure, for each transgenic strain. Is this legitimate? i.e. were the mean oocyst number and prevalence, reflecting the quality of each ookinete culture, similar enough between replicates to allow pooling? If not, it would be more legitimate to show the result of a single representative replicate. Please provide a table with the raw parasite counts of the separate replicates in a Supplementary file so that readers can better judge these results. I note that panel C is very useful.

– I find the bar graph hard to interpret. The median M is represented either as a stroke inside some bars, or overlapping the x axis when M=0. The size of the bar doesn't represent the mean, m. Does it represent a confidence interval? this must be explained in the legend.

Maybe a dot plot where each dot represents the parasite counts of one mosquito would better represent these results?

– From my point of view, mosquito numbers in some of these infections may be too low to yield solid results. Especially in the ScoG-AP2 experiment: 37 mosquitoes in the G3 control with a prevalence of 51% means that only 19 mosquitoes across R=2 replicates contained parasites. This low number is associated with a risk of atypical outliers in the parasite counts, even if the statistical tests presented here show good significance. In the panel C analysis of these values, we see from the size of the squares that the replicate that had the highest statistical significance also had the smallest number of mosquitoes. The replicate with a larger N has only one *. For the Aper1-Sco line, N is large and the statistical significance is high (although panel C shows that one of the 4 replicates showed no difference) but I'm still somewhat unconvinced of the effect of scorpine in this line: the mean only drops from 10 to 6 parasites, prevalence drops from 37 to 21%. Combining this moderate effect with the facts that (1) some replicates sometimes show no Scorpine effect, (2) the Sco-CP line, which has a comparably high level of scorpine expression according to Suppl Figure. 3, shows the exact opposite, i.e. pro-parasitic effect, makes me doubt the antiparasitic effect of scorpine.

In the case of the ScoG-AP2 line, scorpine expression is only 1/10 to 1/8 of the expression in the other two lines, but seems to have a similar effect as in the highest (Aper1) expressing line: one possibility is that fusion to GFP stabilizes Scorpine so that lower expression results in higher activity, but a milder effect would have been logical if scorpine had a dose-dependent effect.

One caveat of these experiments is that the genetic background of the control mosquitoes (G3) is not exactly the same as the transgenics (G3 x KIL). There is a possibility that the KIL background contributed some alleles conferring elevated Plasmodium resistance (or the opposite in the case of Sco-CP). I would find the results more trustable if a control of equivalent genetic background could have been generated for each transgenic strain (in the process of homozygous line selection, the homozygous WT siblings could have been retained to serve as specific controls, though I know how demanding this work would have been…)

Another caveat is that we don't know the precise kinetics (e.g. between 0-36h post blood meal) of the scorpine protein midgut concentration in each transgenic line, and we don't know at what time point after the blood meal parasites would be most susceptible to killing by scorpine (probably between 3 and 24h, time after which they transform into protected cysts).

Taken together, the scorpine data is not highly conclusive and there remains much uncertainty about the efficacy of transgenically expressed Scorpine as an anti-plasmodium molecule. I'm not requesting additional experiments (though future long term assessments of these transgenic lines with new isogenic controls would be very interesting), but I invite the authors to downstate scorpine's potential effectiveness as an antimalarial effector in vivo. This does not decrease the importance of this work of which scorpine is only one aspect. A candidate molecule had to be chosen for these proof-of-principle experiments. Scorpine may not have been a very lucky choice, but its moderate (or opposite) effect should be seen as an interesting result in itself. The way is now open to test other possible candidates.

Reviewer #4:

Gene drives are alleles that bias their inheritance to spread through a population. Engineered gene drives could potentially be used to spread genes that prevent malaria transmission in mosquitoes. In this study, the authors develop a proof-of-principle of effector components that would be part of a proposed integral gene drives. Such drives different from standard gene drives by separating the Cas9 and effector components at different loci, with each one having biased inheritance, a useful strategy if the Cas9 has a substantial fitness cost (though it remains unclear if this is the case). They can also more easily target conserved sites of important genes compared to a standard drive, though this is not unique to the integral gene drive strategy. The Cas9 and effector components would be expressed from natural promoters, with introns and translation skipping utilized so that the original gene works properly and so gRNAs can be expressed within the intron. The authors showed that the effector component of such a drive performed as expected, and that both effectors and the target gene were expressed. Overall, the manuscript is a mostly sound technical demonstration of the effector component of an integral gene drive.

1. It's unclear how exactly resistance alleles would be dealt with in the author's strategy. While an integral gene drive could target an essential gene so that resistance alleles are nonviable, that doesn't seem to be the strategy here, since the authors needed to target a gene with a promoter that would be a good match for their effector. The need for both an essential gene and a suitable promoter in one package may thus limit the use of the integral gene drive strategy. Higher fitness costs associated with disruption of the gene may partially ameliorate this issue, but this was not confirmed in the current study (transgenic strains had lower fitness, but was this due to the drive, the effector, or the reduced expression of the host target gene?).

2. The authors removed their marker genes by surrounding them with LoxP sites and crossing their lines to Cre. This was justified since the authors believed that the presence of the marker would interfere with expression of the target gene, causing fitness issues. However, the authors found no sign of fitness reduction based on anecdotal (?) observations. Were these observations actually quantified, in which case they should be supplemental material? It could be particularly interesting in light of the fact that even without the marker, the transgenic strains suffered fitness effects. It would be nice if the decision to remove the marker was better justified in this section, based on the next section where it was found that the marker interfered with effector expression. Perhaps even combining or reversing the order of the sections would be appropriate (for example, consider first saying that the marker interferes with expression, then mention how this was expected and the marker could be removed, solving the problem).

3. Based on figure 3D-E, it appears that the target host gene has reduced expression even after the marker is removed. This is quite important for future considerations, yet seems to be glossed over. For example, if a target is chosen that can effectively help remove resistance alleles due to fitness costs from disrupting the target gene, this means that the gene drive will also suffer fitness costs.

4. The fitness analysis examining fecundity and hatch rates is not very informative. While similar fitness effects among the transgenic strains lends some weak evidence that inbreeding may account for the fitness reduction, variability between individuals certainly does not (after all, wild-type individuals were also highly variable). Also, if the Cre line has a different background than G3, wouldn't all the lines have received some of this background from prior crosses? Perhaps this could be the answer. It would nonetheless have been better for the authors to outcross the lines before inbreeding them, with similar inbreeding for the wild-type control, before doing this experiment. Because of the issues with this experiment, I'd suggest that it is conducted again with better controls or is moved to the supplement.

5. It's hard to believe that no end-joining took place, even though the last sentence of the results indicates that no end-joining was detected. Did the authors not sequence any progeny with the drive, to look for end-joining products formed from maternally deposited Cas9? Other studies with vasa-Cas9 in Anopheles saw this phenomenon occur at a high rate. For end-joining products formed as an alternative to HDR, was it 21 individuals that were sequenced (nine with Aper1 and twelve form the full AP2 sequencing)?

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "Converting endogenous genes of the malaria mosquito into simple non-autonomous gene drives for population replacement" for further consideration by eLife. Your revised article has been reviewed by two peer reviewers and the evaluation has been overseen by Diethard Tautz as the Senior Editor and a Reviewing Editor.

Summary:

Engineered gene drives could potentially be used to spread genes that prevent malaria transmission in mosquitoes. In this study, the authors develop a proof-of-principle of effector components that would be part of a proposed integral gene drive. Such drives differ from standard gene drives by separating the Cas9 and effector components at different loci, with each one having biased inheritance, a useful strategy if the Cas9 has a substantial fitness cost. They might also be able to more easily target conserved sites of important genes compared to other drive designs. The Cas9 and effector components would be expressed from natural promoters, with introns and translation skipping utilized so that the original gene works properly and so gRNAs can be expressed within the intron. The authors showed that the effector component of such a drive performed as expected, and that both effectors and the target gene were expressed.

The manuscript has been substantially improved but there are still a few remaining issues raised by reviewer #4. We will be happy to accept the manuscript when these points have been addressed, which should only require a few short and easy edits to the text.

Reviewer #2:

The authors have addressed the comments from the first review well and I have no further concerns. I believe this should be published at eLife.

Reviewer #4:

I was reviewer #4 from the previous round. The authors made a good effort to improve their manuscript, and the extra experiments were certainly quite helpful. Some of the issues raised in the first review were successfully addressed. However, based on the data, I'm still not convinced by this manuscript that integral gene drives would offer a substantial advantage over other simpler systems or even that they can reach the capability of other rescue drives without a relatively greater development effort due to complications with target gene selection and expression (again, it is still certainly a possibility, but seems that future studies would really be needed to show this).

On a better note, only a few more revisions are needed in the current manuscript for all conclusions in to be sound and clearly presented. These necessary revisions can probably be done by changing the text without any new data or analysis (the length of my comments here is just an attempt to clarify what I'm suggesting – actual responses and revisions will almost certainly be far shorter).

On the assessment and considering of resistance alleles:

Regarding the new treatment of resistance, the authors say in their discussion that:

"Further studies e.g. in population cages could reveal to what degree the observed classes of mutations would be actively selected against, which would favour faithful transmission of the drive and is one predicted benefit of integral gene drives (33)."

This seems a bit misleading, since in the integral gene drive setup, such mutations could have a lower fitness cost than the drive, potentially strongly selecting against the drive once the pool of wild-type alleles is extinguished. The integral drive system, if anything, seems less able to deal with this sort of resistance allele than other designs unless it targets an essential gene, which the authors have noted does not appear to be the case with their designs in this study. This should be more clearly laid out at this point in the discussion.

Additionally, the amplicon sequencing was perhaps not the optimal experiment to directly measure the rate of paternal carryover. While it is certainly clear from the data that there was substantial maternal carryover, it can't be assessed quantitatively, and it is unclear if there was paternal carryover at all. This is because somatic Cas9 cleavage could account for some altered sequences. Based on the crossed, nearly half the offspring that were sequenced should have both integral drive alleles (almost 100% from homing) and vasa-Cas9 alleles (50% from normal inheritance). Somatic expression of Cas9 and gRNAs in these offspring could produce cleavage in individual cells, resulting in modified target site loci that would subsequently get detected during pooled amplicon sequenced. This needs to be clearly delineated in the results to explain the limitations of the experiment.

Side note: In the future, such an experiment could have been done with only offspring that did not inherit Cas9 (thus preventing acquisition of mutations from leaky somatic cleavage). To avoid amplicon sequencing, the authors could also have sequenced individual offspring (with or without Cas9) and looked for the presence of just 1-2 sequenced, which would confirm alleles by parental deposition (though only if Cas9 cleavage occurred in the very early embryo). The authors could also have done additional crosses with these offspring to isolate complete resistance allele sequences or assess the ability of drive and Cas9 heterozygotes the bias inheritance in a subsequent generation (lack of the ability to do so would be indicative of parental deposition forming resistance alleles).

For the resistance alleles that form in the germline, I'm now satisfied with the additional detail and interpretation. These seem to be below the rate that can be detected by the reasonable-sized experiment, which is quite nice.

On the fecundity/pupa hatch rate assessment:

The new data is quite informative and appears to support the notion that the differences in fecundity between lines is not based on the effector construct. There is no difference between the wild-type line and the GFP/CFP lines in Figure 4A. Yet, all the lines with GFP/CFP removed together with the Cre line have lower fecundity than the wild-type line in Figure 4B. Since removing the fluorescent protein should itself have no downside and probably even marginally improve fitness, this means that the reduced fitness is probably due to crossing with the Cre line. It might be best to spell this out a bit more thoroughly in the results. Additionally, the reduced fitness of the lines after crossing with Cre to remove the fluorescent proteins is now more of a minor point not really related to the integral drive effectors themselves, so perhaps this data should be moved to the supplement, leaving only the new data as a main figure.

For the pupa hatch rate, it should be noted if the relatively low level of significance can survive multiple-testing correction. This seems to be obliquely indicated in the text (that it cannot), but maybe something in the figure legend too so that readers don't focus too much on the "*".

On the other hand, the lack of fitness effects when the fluorescent proteins are present (and reducing the expression of the host gene as per Figure 3) does lend evidence to indicate that the target genes are perhaps not essential (as the authors note), increasing the difficulty in dealing with resistance, as noted above.

On the assessment of host gene expression changes:

The quantitative analysis provided in Figure 3D was certainly a useful addition, confirming the reduced expression of the host gene in two of three effectors, particularly in AP2. This indicates that the integral gene method may have difficulty when working with essential genes, which is likely needed to reduce the accumulation of resistance alleles. Yet, the relatively high expression of CP provides hope that the method could work for some target sites. Still, this may be a substantial issue that the integral drive strategy must overcome when targeting essential genes, and this is not clearly spelled out in the discussion when it certainly should be (next to the discussion of reduced host genes). After all, why bother restoring host gene expression in the first place if it unimportant? You could just insert the effector gene (scorpine or something else) without bothering to add the intron needed for host gene expression.

In line 384 (of the marked-up version) of the discussion, it is noted that the reduced mRNA could lead to normal protein levels. However, another possibility is that even somewhat reduced protein levels are not sufficient to have a fitness impact.

As a side note, the similar analysis of scorpine expression is also of note, in which one of the three constructs had very low expression. This further indicates that the integral gene drive method may not be suitable for all candidate sites. This is most likely true for more standard drive designs as well, though, so the issue here may not be due to the design of the integral design itself. However, the authors should still make this clearer in the Results section and in the discussion that one of the three systems had lower expression of scorpine. It is now only obliquely or indirectly referred to at present, but it needs to be clearly presented as a major result of the paper (even if it is also fine to indicate that the reduction was not likely due to the integral system itself).

eLife. 2021 Apr 13;10:e58791. doi: 10.7554/eLife.58791.sa2

Author response


Essential revisions:

All reviewers found that overall the manuscript is technically sound. However, it was also felt that some of the stated conclusions were not yet fully supported by the performed analyses. After discussing these issues, we identified the following four major areas that we believe would need to be addressed in a revision in order to make this paper suitable for

1. Better quantification of target gene expression for AP2 and CP. To demonstrate the potential usefulness of the integral gene drive concepts, more evidence should be provided that the method can indeed reliably insert an intron and genes (at least gRNAs) without significantly reducing the expression of the target gene. So far, a quantitative technique was only used on one of the three targets (Aper1) and showed substantially reduced expression. For the two other targets only Westerns blots are presented. As pointed out in the reviews, these would need additional analysis and controls to be truly quantitative. If this cannot be shown, the authors would have to state this caveat very prominently in the manuscript, as it could severely limit the applicability of the integral gene drive concept.

We now performed qPCR on all 6 lines for the endogenous host gene, as well as for the effector transgene (Figures 3C and D) allowing us to quantify expression better.

2. Improved fitness assessment. Reviewers 3 and 4 point towards potential issues of the fitness analysis due to the fact that the lines were not outcrossed before they were interbred. We suggest that these experiments are redone with better controls. If this cannot be done, the caveats and limits would need to be clearly presented.

The transgenic founders were outbred to G3 WT for 3 generations. We made this clearer in the Methods section. Once the fluorescent marker is removed, performing further rounds of outcrossing is too costly as it would involve molecular genotyping (there is no trackable visible marker). We now performed additional fitness assays with the lines including the marker-module (Figures 4A and C) as a more relevant comparator.

3. Additional controls for transmission blocking assays are needed if the authors want to sell Scorpine as a promising antimalarial effector. As detailed by Reviewers 2 and 3, the authors should consider adding inbred controls for phenotypic assays relating to transmission. They note themselves that the process of inbreeding can influence infections. It would thus seem important to try to mitigate this noise with replicate transgenic lines from independent G0s and potentially also "control" transgenic lines that have been through inbreeding, or even just several other inbred lines from the G3 background. Reviewer 3 additionally raised questions about the statistical analyses shown in Figure 5, which should be addressed.

We want to be clear that it is not our intention to sell Scorpine as an antimalarial effector in this paper, in fact it is neither mentioned in the title or abstract. The paper describes a new way of generating minimal non-autonomous gene drives and methods for the integration and co-expression of effectors from endogenous loci. If this was not clear, we have made an additional effort to make sure that message of the paper stays focussed and made changes to the results and Discussion sections.

We felt that for the sake of completeness it would nonetheless be necessary to report any obvious or extreme effects that the modifications of the target genes or the expression of Scorpine may have on transmission. The former in particular could have possibly had a strong effect in either direction. We chose Scorpine as a prototype effector, because it was previously reported to have an anti-malarial effect in paratransgenesis, although no transgenic Scorpine line was published before submission of this manuscript. As a reviewer mentions we needed to start somewhere.

We did observe a statistically significant effect on transmission in some of the experiments. Further experiments will be needed to tease apart the various effects that may contribute to this. The reviewers correctly identified inbreeding as a contributor, which is affecting most published homozygous transgenic lines to a greater or lesser degree (using inbreed controls is not a common procedure and could not settle this issue either). More generally, current methods to measure transmission blockage in mosquitoes have many known limitations some of which the reviewers also alluded to. A mixed rearing protocol is currently being established at Imperial and various molecular readouts (for various stages including sporozoites which we haven’t examined here) to deal with the shortfalls of the oocyte counting assay are being worked on by many labs. But dealing with all this here would go beyond the topic and scope of the present manuscript. We have changed the discussion to make sure it is clear why this experiment was done and what we can and cannot conclude from it.

Because these issues and pitfalls are well known, many gene drive papers with a focus on new drive designs for population replacement do not always conduct transmission experiments (for example https://www.pnas.org/content/early/2015/11/18/1521077112 with 700+ citations), perform their experiments without effectors or perform their experiments in Drosophila.

4. Expanded treatment of resistance potential. It's unclear how exactly resistance alleles would be dealt with in the integral gene drive framework. At the very least, the authors should significantly expand the discussion of this issue. As pointed out by Reviewer 4, it is also not clear on what evidence the author's conclusion was based "that no end-joining was detected". This should be better explained.

We now provided two sets of additional data for the current set of experiments with the Vasa-Cas9 driver. Dealing with another reviewer comment we performed two more biological replicates for the homing assays, and this allowed us to isolate more non-fluorescent individuals for Sanger sequencing. However, this is a numbers game, so for example with a 99.55% homing rate and a sample size of n=1980 we still could only find and analyse 9 non-fluorescent individuals for Aper1-ScoGFP and this did not include any non-WT alleles. For CP two replicates showed 100% homing but we managed to obtain some non-fluorescent individuals in replicate 3.

To make our data more meaningful we now also performed amplicon sequencing of all pooled larvae that allows us to look at maternal deposition and germline modification quantitatively. Here we included also WT and the Cas9 strain as a background control to distinguish between pre-existing SNPs and newly emerged indels as a result of CRISPR/Cas9 activity.

The two datasets are looking at slightly different set of chromosomes (Figure 6—figure supplement 3A).

Reviewer #3:

Hoermann et al. present a new gene engineering concept for disease vector mosquitoes, whereby endogenous mosquito genes are hijacked to express a heterologous effector peptide intended to render mosquitoes resistant to human pathogens. In addition, a synthetic intron added within the effector-coding sequence will express gRNAs for the CRISPR-Cas9 system, recognizing the transgene's own wild-type insertion locus. In the presence of a source of Cas9, the effector gene is thus able to home into a wild-type chromosome, triggering a gene drive effect that can increase the frequency of the modification in the mosquito population. A fluorescent marker, also cloned within the intron, is used at early steps to track the transgene, but is subsequently removed by Cre/lox excision to restore host gene + effector expression and to result in minimal genetic modification.

This is an extremely elegant procedure and a remarkable technical achievement, especially in such a difficult species as Anopheles gambiae. The choice of midgut-specific promoters to express anti-malaria effectors makes sense to target early stages of developement of parasites, before they had a chance to amplify in the mosquito. Using endogenous regulatory sequences without a need for promoter cloning alleviates the tedious work of individual promoter characterization. The molecular designs are well described, and the results likely to have a large future impact in the development of vector control tools, notwithstanding some weakness in assessing the antiparasitic effect of Scorpine in the transgenic mosquitoes (see below). I agree that this type of transgene should facilitate semi-field or field testing of candidate anti-parasitic effectors, before any true gene drive intervention is envisaged. I fully support publication of this manuscript in eLife, once the following major point concerning Figure 5 is processed by the authors.

P. falciparum transmission blocking assays – Figure 5

I have several questions about figure 5.

– Are mosquitoes with 0 parasite taken into account in the calculation of the mean and median? This should be precise in the legend or in Exp procedures.

Zeros were included when calculating mean and median and this was now clarified in the manuscript.

– Several replicates have been pooled to generate the figure, for each transgenic strain. Is this legitimate? i.e. were the mean oocyst number and prevalence, reflecting the quality of each ookinete culture, similar enough between replicates to allow pooling? If not, it would be more legitimate to show the result of a single representative replicate. Please provide a table with the raw parasite counts of the separate replicates in a Supplementary file so that readers can better judge these results. I note that panel C is very useful.

It is the common procedure in the field to pool replicates. We added Supplementary Table S6 with all the raw data of the transmission blocking assay.

– I find the bar graph hard to interpret. The median M is represented either as a stroke inside some bars, or overlapping the x axis when M=0. The size of the bar doesn't represent the mean, m. Does it represent a confidence interval? this must be explained in the legend.

Maybe a dot plot where each dot represents the parasite counts of one mosquito would better represent these results?

We agree with the reviewer and converted Figure 5 to a dot blot.

– From my point of view, mosquito numbers in some of these infections may be too low to yield solid results. Especially in the ScoG-AP2 experiment: 37 mosquitoes in the G3 control with a prevalence of 51% means that only 19 mosquitoes across R=2 replicates contained parasites. This low number is associated with a risk of atypical outliers in the parasite counts, even if the statistical tests presented here show good significance. In the panel C analysis of these values, we see from the size of the squares that the replicate that had the highest statistical significance also had the smallest number of mosquitoes. The replicate with a larger N has only one *. For the Aper1-Sco line, N is large and the statistical significance is high (although panel C shows that one of the 4 replicates showed no difference) but I'm still somewhat unconvinced of the effect of scorpine in this line: the mean only drops from 10 to 6 parasites, prevalence drops from 37 to 21%. Combining this moderate effect with the facts that (1) some replicates sometimes show no Scorpine effect, (2) the Sco-CP line, which has a comparably high level of scorpine expression according to Suppl Figure. 3, shows the exact opposite, i.e. pro-parasitic effect, makes me doubt the antiparasitic effect of scorpine.

We thank the reviewer for looking into the details and for spotting a mistake in the ScoG-AP2 experiment, where the analysis of the G3 control only included 37 out of 78 available control mosquitoes. This was corrected.

In the case of the ScoG-AP2 line, scorpine expression is only 1/10 to 1/8 of the expression in the other two lines, but seems to have a similar effect as in the highest (Aper1) expressing line: one possibility is that fusion to GFP stabilizes Scorpine so that lower expression results in higher activity, but a milder effect would have been logical if scorpine had a dose-dependent effect.

One caveat of these experiments is that the genetic background of the control mosquitoes (G3) is not exactly the same as the transgenics (G3 x KIL). There is a possibility that the KIL background contributed some alleles conferring elevated Plasmodium resistance (or the opposite in the case of Sco-CP). I would find the results more trustable if a control of equivalent genetic background could have been generated for each transgenic strain (in the process of homozygous line selection, the homozygous WT siblings could have been retained to serve as specific controls, though I know how demanding this work would have been…)

Another caveat is that we don't know the precise kinetics (e.g. between 0-36h post blood meal) of the scorpine protein midgut concentration in each transgenic line, and we don't know at what time point after the blood meal parasites would be most susceptible to killing by scorpine (probably between 3 and 24h, time after which they transform into protected cysts).

We agree that one has to be cautious to draw a conclusion about the effect of Scorpine and this is also not the paper’s focus. Its mode of action might be locus-depended, and its efficacy might be dosage-dependent, which is not necessarily linear. The fact that we are exploring 3 quite different host genes for expression makes a direct comparison more difficult. Our intention was not to focus on Scorpine, but to explore different loci and design-options to express effector molecules within a novel gene drive framework. (See response to point 3 in essential revisions).

Taken together, the scorpine data is not highly conclusive and there remains much uncertainty about the efficacy of transgenically expressed Scorpine as an anti-plasmodium molecule. I'm not requesting additional experiments (though future long term assessments of these transgenic lines with new isogenic controls would be very interesting), but I invite the authors to downstate scorpine's potential effectiveness as an antimalarial effector in vivo. This does not decrease the importance of this work of which scorpine is only one aspect. A candidate molecule had to be chosen for these proof-of-principle experiments. Scorpine may not have been a very lucky choice, but its moderate (or opposite) effect should be seen as an interesting result in itself. The way is now open to test other possible candidates.

We agree and we have re-phrased several paragraphs to be more sober about Scorpine’s potential.

The reviewer is correct that we needed to start with a candidate effector and Scorpine was used as a proof-of-principle.

Reviewer #4:

Gene drives are alleles that bias their inheritance to spread through a population. Engineered gene drives could potentially be used to spread genes that prevent malaria transmission in mosquitoes. In this study, the authors develop a proof-of-principle of effector components that would be part of a proposed integral gene drives. Such drives different from standard gene drives by separating the Cas9 and effector components at different loci, with each one having biased inheritance, a useful strategy if the Cas9 has a substantial fitness cost (though it remains unclear if this is the case). They can also more easily target conserved sites of important genes compared to a standard drive, though this is not unique to the integral gene drive strategy. The Cas9 and effector components would be expressed from natural promoters, with introns and translation skipping utilized so that the original gene works properly and so gRNAs can be expressed within the intron. The authors showed that the effector component of such a drive performed as expected, and that both effectors and the target gene were expressed. Overall, the manuscript is a mostly sound technical demonstration of the effector component of an integral gene drive.

1. It's unclear how exactly resistance alleles would be dealt with in the author's strategy. While an integral gene drive could target an essential gene so that resistance alleles are nonviable, that doesn't seem to be the strategy here, since the authors needed to target a gene with a promoter that would be a good match for their effector. The need for both an essential gene and a suitable promoter in one package may thus limit the use of the integral gene drive strategy. Higher fitness costs associated with disruption of the gene may partially ameliorate this issue, but this was not confirmed in the current study (transgenic strains had lower fitness, but was this due to the drive, the effector, or the reduced expression of the host target gene?).

We did not know beforehand whether the host genes would turn out to be essential or not. To be more precise, we did not know whether their modification would lead to extreme fitness effects. We can also not rule out that their homozygous disruption (e.g. by NHEJ) does indeed have a fitness costs under either lab or field conditions and that such alleles would be selected against in a population. Generating clear knock-out alleles and studying their effect in population cages is a current undertaking. We made this distinction clearer in the discussion.

Essential genes are not absolutely required for this approach, although they provide additional benefits (this was discussed and modelled in the Nash et al. 2018 paper).

We now performed additional fitness assays with the lines including the marker-module (Figures 4A and C) as a more relevant comparator. These suggest that the modification of host gene expression can be ruled out as a source of major fitness effects.

2. The authors removed their marker genes by surrounding them with LoxP sites and crossing their lines to Cre. This was justified since the authors believed that the presence of the marker would interfere with expression of the target gene, causing fitness issues. However, the authors found no sign of fitness reduction based on anecdotal (?) observations. Were these observations actually quantified, in which case they should be supplemental material? It could be particularly interesting in light of the fact that even without the marker, the transgenic strains suffered fitness effects.

We now performed additional fitness assays with the lines including the marker-module (Figures 4A and C) as a more relevant comparator. These suggest that the modification of host gene expression can be ruled out as a source of major fitness effects.

It would be nice if the decision to remove the marker was better justified in this section, based on the next section where it was found that the marker interfered with effector expression. Perhaps even combining or reversing the order of the sections would be appropriate (for example, consider first saying that the marker interferes with expression, then mention how this was expected and the marker could be removed, solving the problem).

We tried to better justify this step in the text.

3. Based on figure 3D-E, it appears that the target host gene has reduced expression even after the marker is removed. This is quite important for future considerations, yet seems to be glossed over. For example, if a target is chosen that can effectively help remove resistance alleles due to fitness costs from disrupting the target gene, this means that the gene drive will also suffer fitness costs.

We now performed additional qPCR assays to quantify host gene mRNA expression. We improved the discussion to incorporate some of the considerations above. When combined with the new set of fitness experiments, again it appears that the modification of host gene expression can be ruled out as a source of major fitness effects at least in the assays we performed.

4. The fitness analysis examining fecundity and hatch rates is not very informative. While similar fitness effects among the transgenic strains lends some weak evidence that inbreeding may account for the fitness reduction, variability between individuals certainly does not (after all, wild-type individuals were also highly variable). Also, if the Cre line has a different background than G3, wouldn't all the lines have received some of this background from prior crosses? Perhaps this could be the answer. It would nonetheless have been better for the authors to outcross the lines before inbreeding them, with similar inbreeding for the wild-type control, before doing this experiment. Because of the issues with this experiment, I'd suggest that it is conducted again with better controls or is moved to the supplement.

The transgenic founders were outbred to G3 WT for 3 generations. Once the fluorescent marker is removed further rounds of outcrossing are too costly as it would involve molecular genotyping. Every published homozygous transgenic line is inbred to a certain degree, but it is not a common procedure to use inbreed wild-type controls as this ultimately can’t settle the question.

To complete this experiment and address some of the issues mentioned, we now performed additional fitness assays with the lines including the marker-module (Figures 4A and C) as a more relevant comparator. (see comments above).

5. It's hard to believe that no end-joining took place, even though the last sentence of the results indicates that no end-joining was detected. Did the authors not sequence any progeny with the drive, to look for end-joining products formed from maternally deposited Cas9? Other studies with vasa-Cas9 in Anopheles saw this phenomenon occur at a high rate. For end-joining products formed as an alternative to HDR, was it 21 individuals that were sequenced (nine with Aper1 and twelve form the full AP2 sequencing)?

See response to point 4 in essential revisions.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

[…] Reviewer #4:

I was reviewer #4 from the previous round. The authors made a good effort to improve their manuscript, and the extra experiments were certainly quite helpful. Some of the issues raised in the first review were successfully addressed. However, based on the data, I'm still not convinced by this manuscript that integral gene drives would offer a substantial advantage over other simpler systems or even that they can reach the capability of other rescue drives without a relatively greater development effort due to complications with target gene selection and expression (again, it is still certainly a possibility, but seems that future studies would really be needed to show this).

On a better note, only a few more revisions are needed in the current manuscript for all conclusions in to be sound and clearly presented. These necessary revisions can probably be done by changing the text without any new data or analysis (the length of my comments here is just an attempt to clarify what I'm suggesting – actual responses and revisions will almost certainly be far shorter).

We thank reviewer 4 for all the genuinely valuable comments, and the significant time he/she has invested in improving this manuscript. Fine-tuning of the integral gene drive designs is definitely required and further studies are already being undertaken.

On the assessment and considering of resistance alleles:

Regarding the new treatment of resistance, the authors say in their discussion that:

"Further studies e.g. in population cages could reveal to what degree the observed classes of mutations would be actively selected against, which would favour faithful transmission of the drive and is one predicted benefit of integral gene drives (33)."

This seems a bit misleading, since in the integral gene drive setup, such mutations could have a lower fitness cost than the drive, potentially strongly selecting against the drive once the pool of wild-type alleles is extinguished. The integral drive system, if anything, seems less able to deal with this sort of resistance allele than other designs unless it targets an essential gene, which the authors have noted does not appear to be the case with their designs in this study. This should be more clearly laid out at this point in the discussion.

We modified this statement and also added a sentence to further improve this section making clear that while the targeted genes don’t appear to be essential (or rather, could readily be modified), we don’t know for sure that negative effects won’t result from their disruption.

Additionally, the amplicon sequencing was perhaps not the optimal experiment to directly measure the rate of paternal carryover. While it is certainly clear from the data that there was substantial maternal carryover, it can't be assessed quantitatively, and it is unclear if there was paternal carryover at all. This is because somatic Cas9 cleavage could account for some altered sequences. Based on the crossed, nearly half the offspring that were sequenced should have both integral drive alleles (almost 100% from homing) and vasa-Cas9 alleles (50% from normal inheritance). Somatic expression of Cas9 and gRNAs in these offspring could produce cleavage in individual cells, resulting in modified target site loci that would subsequently get detected during pooled amplicon sequenced. This needs to be clearly delineated in the results to explain the limitations of the experiment.

Side note: In the future, such an experiment could have been done with only offspring that did not inherit Cas9 (thus preventing acquisition of mutations from leaky somatic cleavage). To avoid amplicon sequencing, the authors could also have sequenced individual offspring (with or without Cas9) and looked for the presence of just 1-2 sequenced, which would confirm alleles by parental deposition (though only if Cas9 cleavage occurred in the very early embryo). The authors could also have done additional crosses with these offspring to isolate complete resistance allele sequences or assess the ability of drive and Cas9 heterozygotes the bias inheritance in a subsequent generation (lack of the ability to do so would be indicative of parental deposition forming resistance alleles).

We included the possibility of somatic cleavage in the relevant section. This is an important point that got lost during revision and we thank the reviewer for picking up on it.

For the resistance alleles that form in the germline, I'm now satisfied with the additional detail and interpretation. These seem to be below the rate that can be detected by the reasonable-sized experiment, which is quite nice.

On the fecundity/pupa hatch rate assessment:

The new data is quite informative and appears to support the notion that the differences in fecundity between lines is not based on the effector construct. There is no difference between the wild-type line and the GFP/CFP lines in Figure 4A. Yet, all the lines with GFP/CFP removed together with the Cre line have lower fecundity than the wild-type line in Figure 4B. Since removing the fluorescent protein should itself have no downside and probably even marginally improve fitness, this means that the reduced fitness is probably due to crossing with the Cre line. It might be best to spell this out a bit more thoroughly in the results. Additionally, the reduced fitness of the lines after crossing with Cre to remove the fluorescent proteins is now more of a minor point not really related to the integral drive effectors themselves, so perhaps this data should be moved to the supplement, leaving only the new data as a main figure.

We now explain more clearly that inbreeding or the interbreeding steps involving the Cre line likely account for the observed fitness effects (and that the marker removal is not a likely contributor).

For the pupa hatch rate, it should be noted if the relatively low level of significance can survive multiple-testing correction. This seems to be obliquely indicated in the text (that it cannot), but maybe something in the figure legend too so that readers don't focus too much on the "*".

This is correct and depending on the test and correction performed (e.g. Kruskal-Wallis test with Dunn's correction) this comparison is not significant (panel D). However, if given the choice we’d rather not complicate the legend especially as we want to be conservative in reporting any possible fitness effects.

On the other hand, the lack of fitness effects when the fluorescent proteins are present (and reducing the expression of the host gene as per Figure 3) does lend evidence to indicate that the target genes are perhaps not essential (as the authors note), increasing the difficulty in dealing with resistance, as noted above.

On the assessment of host gene expression changes:

The quantitative analysis provided in Figure 3D was certainly a useful addition, confirming the reduced expression of the host gene in two of three effectors, particularly in AP2. This indicates that the integral gene method may have difficulty when working with essential genes, which is likely needed to reduce the accumulation of resistance alleles. Yet, the relatively high expression of CP provides hope that the method could work for some target sites. Still, this may be a substantial issue that the integral drive strategy must overcome when targeting essential genes, and this is not clearly spelled out in the discussion when it certainly should be (next to the discussion of reduced host genes). After all, why bother restoring host gene expression in the first place if it unimportant? You could just insert the effector gene (scorpine or something else) without bothering to add the intron needed for host gene expression.

We added a sentence to put essential genes and the necessity for sufficiently restored expression levels in context.

In line 384 (of the marked-up version) of the discussion, it is noted that the reduced mRNA could lead to normal protein levels. However, another possibility is that even somewhat reduced protein levels are not sufficient to have a fitness impact.

We rephrased the sentence to account for this possibility.

As a side note, the similar analysis of scorpine expression is also of note, in which one of the three constructs had very low expression. This further indicates that the integral gene drive method may not be suitable for all candidate sites. This is most likely true for more standard drive designs as well, though, so the issue here may not be due to the design of the integral design itself. However, the authors should still make this clearer in the Results section and in the discussion that one of the three systems had lower expression of scorpine. It is now only obliquely or indirectly referred to at present, but it needs to be clearly presented as a major result of the paper (even if it is also fine to indicate that the reduction was not likely due to the integral system itself).

We added a small section on the observed differences in expression strength to the discussion.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Hoermann A. 2021. Amplicon-sequencing over the Anopheles gambiae CP, AP2 and Aper1 loci after non-autonomous gene drive. NCBI Sequence Read Archive. PRJNA701314

    Supplementary Materials

    Supplementary file 1. Supplementary Tables S1–S7.

    Table S1. Guide RNA and target site characteristics. The start codon of the target gene is indicated within the gRNA sequence (bold) and the protospacer adjacent motif (PAM) separated via a hyphen. All predicted off-target cleavage sites were found to be located in non-coding (NC) regions and the number of mismatches (MM) is indicated. The number of SNPs within the 24 individuals of the G3 strain and within the Ag1000G is indicated. The SNPs observed for CP in the Ag1000G did not pass the quality control. Table S2. Primers used in this study. Table S3. Plasmids used in this study. Table S4. Transmission rate of control-crosses without Cas9. Table S5. Transmission rates and homing rates. Epos and Eneg refer to individuals with or without the effector construct, respectively. The homing rate e was calculated as follows: e = (n*0.5–Eneg)/(n*0.5)*100. Table S6. Modified sequences identified in the Amplicon sequencing. Table S7. Raw data of the transmission blocking assay.

    elife-58791-supp1.xlsx (39.9KB, xlsx)
    Supplementary file 2. GenBank-DNA-files of donor plasmids pD-Sco-CP, pD-ScoG-AP2, and pD-Aper1-Sco.
    elife-58791-supp2.zip (17.2KB, zip)
    Transparent reporting form

    Data Availability Statement

    All data generated or analysed during this study are included in the manuscript and supporting files. Amplicon-sequencing raw data have been deposited in SRA under accession code PRJNA701314.

    The following dataset was generated:

    Hoermann A. 2021. Amplicon-sequencing over the Anopheles gambiae CP, AP2 and Aper1 loci after non-autonomous gene drive. NCBI Sequence Read Archive. PRJNA701314


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES