Abstract
UDP-glycosyltransferases (UGTs) enzymes are pivotal in insecticide resistance by transforming hydrophobic substrates into more hydrophilic forms for efficient cell elimination. This study provides the first comprehensive investigation of Anopheles funestus UGT genes, their evolution, and their association with pyrethroid resistance. We employed a genome-wide association study using pooled sequencing (GWAS-PoolSeq) and transcriptomics on pyrethroid-resistant An. funestus, along with deep-targeted sequencing of UGTs in 80 mosquitoes Africa-wide. UGT310B2 was consistently overexpressed Africa-wide and significant gene-wise Fst differentiation was observed between resistant and susceptible populations: UGT301C2 and UGT302A3 in Malawi, and UGT306C2 in Uganda. Additionally, nonsynonymous mutations in UGT genes were identified. Gene-wise Tajima's D density curves provide insights into population structures within populations across these countries, supporting previous observations. These findings have important implications for current An. funestus control strategies facilitating the prediction of cross-resistance to other UGT-metabolised polar insecticides, thereby guiding more effective and targeted insecticide resistance management efforts.
Keywords: Insecticide resistance, Vector control, UDP-glycosyltransferases, Genomics, Transcriptomics, Target sequencing
Highlights
-
•
Comprehensive Investigation of UGTs' role in pyrethroid resistance in both laboratory and field An. funestus populations collected from Southern Africa (Malawi), Central West Africa (Cameroon) and East Africa (Uganda).
-
•
Identified UGTs as potential contributors to pyrethroid resistance compromising pyrethroid efficacy and potential cross-resistance to polar insecticides, impacting current vector control strategies.
-
•
Noted consistent overexpression of UGT310B2 in various African regions, confirmed in the FUMOZ colony.
-
•
In Malawi, UGT302A3 and UGT301C2 are the most differentiated detoxification genes, highlighting the potential role of UGTs in detoxification in this region.
-
•
This study highlights the complexity of insecticide resistance and the geographic-specific variability in resistance mechanisms.
1. Introduction
In sub-Saharan Africa, malaria remains a significant cause of morbidity and mortality, particularly among pregnant women and children under 5 years old. It accounts for >96% of malaria-related deaths globally [1]. The primary strategies for malaria control depend heavily on insecticide-based interventions, such as indoor residual spray (IRS) and long-lasting insecticide nets (LLINs) [1,2]. These interventions have been highly successful in reducing malaria cases and associated morbidity, globally preventing over 2 billion malaria cases and 11.7 million malaria-related deaths between 2000 and 2021 [1]. However, several mosquito species have developed multiple and cross-resistance to several insecticides, including pyrethroids the main compound approved by WHO for LLINs. Unless resistance management strategies are designed by elucidating the evolutionary and molecular basis of resistance, recent gains in reducing malaria burden could be lost [3,4].
Anopheles funestus, a major malaria vector across sub-Saharan Africa, has demonstrated resistance to several insecticides, including pyrethroids, jeopardising recent efforts to eradicate and control malaria [[5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19]]. The primary mechanisms of insecticide resistance are target site resistance and metabolic resistance [20]. In An. funestus, metabolic resistance plays a predominant role, as the high expression of efficient detoxifying enzymes allows for the rapid removal or destruction of insecticides [21]. Metabolic resistance involves three-phase metabolic pathways present in all major groups of organisms. These pathways consist of modification, biotransformation, and excretion of toxic insecticides [21]. During Phase I, the toxicity of insecticides is reduced when a reactive and polar group is added to the substrate by a variety of enzymes including, cytochrome P450 monooxygenases (P450s), esterases, alcohol dehydrogenases and aldehyde dehydrogenases. In Phase II, the activated metabolites produced by the Phase I reactions are conjugated with charged species and bio-transformed into more polar and soluble metabolites that can be actively transported [22,23]. The Phase II reactions are catalysed by a variety of transferases, including sulfotransferases, glutathione S-transferases (GSTs) and UDP-glycosyltransferases (UGTs) [24]. Conjugated toxins are then excreted from the cells into the extracellular medium via Phase III, where a variety of membrane-bound transporters are involved, notably ATP-binding cassette (ABC) transporters [24].
Previous studies have highlighted the role of An. funestus P450s enzymes, including CYP6P9a, CYP6P9b, CYP9J11, CYP6Z1, CYP6M7 and CYP9K1, in metabolising pyrethroids, leading to significant depletion and reduced efficacy of pyrethroids-treated bed nets [7,8,14,17,25,26]. Additional classes of detoxification enzymes in An. funestus, such as GSTe2, confer resistance to pyrethroids and DDT through allelic variations that increase metabolic activities, resulting in cross-resistance [15]. The upregulation of other detoxification enzymes belonging to Phase II such as UGTs has been observed in previous transcriptomic studies investigating the molecular basis of resistance to pyrethroid [17,26]. However, despite their established role in detoxification in other insects, UGTs role in pyrethroid resistance in malaria vectors, including An. funestus, remains largely uncharacterised.
UDP-glycosyltransferases (UGTs) constitute a superfamily of enzymes that play a vital role in the biotransformation of various hydrophobic compounds into more hydrophilic products [27]. These enzymes catalyse the covalent addition of a glycosyl group from an active donor, uridine diphosphate (UDP) glucose, to hydrophobic compounds containing hydroxyl, carboxyl, or amino functional groups through the glycosylation reaction [22,[27], [28], [29]]. The resulting glycosides are more polar metabolites that can be easily excreted from the cell by export transporters than the substrate compound. Glucose conjugation is involved in various physiological processes in insects, including pigmentation, cuticle formation (sclerotization) and metabolic detoxification [[30], [31], [32]]. In certain lepidopteran insects, especially, UGT-mediated glycosylation activities are associated with resistance to plant defensive allelochemicals. These activities have been observed in economically important species such as the silkworm Bombyx mori [33], the tobacco hornworms Manduca sexta [34] and the Asian corn borer Ostrinia furnacalis [[35], [36], [37]] UGT-mediated biotransformation of pyrethroids has been suggested in Anopheles sinensis, a common malaria vector in Southeast Asia [38]. Their contribution to resistance against several classes of insecticide has been reported in the Diamondback moth Plutella xylostella and the tomato leafminer Tuta absoluta resistance to chlorantraniliprole [39,40], the housefly Musca domestica resistance to organophosphate [41], and the greenfly cotton aphid Aphis gossypii resistance to neonicotinoid and spirotetramat [42,43].
While genome-wide characterisation of UGT genes, their evolution, and association with insecticide resistance has been conducted in various insects using different sequencing technologies [[44], [45], [46], [47]], UGT genes remain largely uncharacterised in malaria vectors. Although their overexpression in response to pyrethroids exposure has been reported in An. funestus, the investigations of UGTs in this context has been limited compared to other gene families, such as cytochrome p450s or GSTs. Selective sweeps and nonsynonymous mutations associated with UGTs, which could potentially contribute to pyrethroid resistance, have not been previously identified. In this study, we combine genome-wide association of pooled-template sequencing (GWAS-PoolSeq) with transcriptomic analysis of pyrethroids-resistant feild populations of An. funestus and deep sequencing of part of the genome of 80 mosquitoes to comprehensively characterise An. funestus UGTs genome-wide, their evolution, and association with pyrethroid resistance. This study provides the first comprehensive investigation into the role of UGTs role in pyrethroids resistance in An. funestus, the major malaria vector.
2. Methods
2.1. Mosquito collection, rearing and sequencing
The collection, rearing and sequencing of mosquitoes were described in detail previously in [7,11,17,18,26,48]. In brief, two An. funestus laboratory colonies (FANG and FUMOZ) and field mosquitoes from Malawi, Cameroon and Uganda were used in this study. The FUMOZ colony is a multi-insecticide-resistant An. funestus colony derived from Southern Mozambique [49]. While the FANG is an insecticide-susceptible An. funestus colony derived from Angola [49]. Mosquitoes were collected from field populations representing Southern Africa (Malawi), Central West Africa (Cameroon) and East Africa (Uganda). Mosquitoes were collected from Southern Chikwawa (16°1′S, 34°47′E), Malawi in 2002 and January 2014 [13]; Tororo (0°45′N, 34°5′E), Uganda in March 2014 [12], and from Mibellon (6°46′N, 11°70′E), Cameroon in February 2015. The collected F0 mosquitoes were determined to be belonging to the An. funestus group using morphological and molecular identification [50,51]. Genomic DNA was extracted using DNeasy blood and tissue kit (Qiagen, Hilden, Germany) from 40 F0 mosquitoes individually and pooled in equal amounts. Library preparation and whole-genome sequencing by Illumina HiSeq2500 (2 × 150 bp paired-end) were carried out by the Centre for Genomic Research (CGR), University of Liverpool, UK [18,48]. The pooled-sequencing data (PoolSeq) of 40 mosquitoes per pool for F0 populations from field An. funestus populations and two lab strains are available in the European Nucleotide Archive (ENA) under accession numbers PRJEB24384 and PRJEB13485 [18,48].
For transcriptional profiling of pyrethroid resistance in field populations and laboratory colonies, collected F0 gravid mosquitoes were forced to lay eggs using the forced egg-laying method [9]. Egg batches were transported to the Liverpool School of Tropical Medicine (DEFRA) license (PATH/125/2012). Eggs were allowed to hatch in the insectary, and larvae were reared to adulthood in distilled water and fed TetraminTM baby fish food every day. The water of each larvae tray was changed every two days to reduce mortality, according to rearing conditions described previously in [9]. F1 females that are two to five days old were subjected to insecticide resistance bioassays as described previously in [12,13]. F1 females were exposed to permethrin (0.75%) for a varying length of time to define putatively resistant and susceptible mosquitoes. In populations from Malawi and Uganda, susceptible mosquitoes were defined as those that die after 60 min of permethrin (0.75%) exposure and resistant mosquitoes that are still alive after 180 min. Due to lower levels of resistance in the population from Cameroon, susceptible mosquitoes were dead mosquitoes collected after 20 min of exposure and resistant are those that are still alive after 60 min exposure. Total RNA was extracted from pools of 10 female mosquitoes using the Arcturus PicoPure ™ RNA Isolation Kit (Thermo Scientific, MA, USA) as described previously [17,26]. A total of 18 RNA pools were collected for populations from Malawi, Cameroon and Uganda. In each field-collected population, 3 pools of alive mosquitoes after exposure to permethrin and 3 pools of unexposed mosquitoes were collected. In addition, 4 pools each were collected for laboratory colonies FANG and FUMOZ. Library preparation and sequencing by Illumina HiSeq 2500 (2 × 125-bp) were done at the Centre for Genomic Research (CGR), University of Liverpool [17,18]. The pooled RNAseq data are available in the European Nucleotide Archive (ENA) under accession numbers PRJEB24351, PRJEB45224 and PRJEB70998.
2.2. Identification of UGT genes, amino acids alignments, phylogenetic analysis and haplotype networks
An. funestus UGT genes were identified from the recently assembled and annotated FUMOZ and AfunG1 An. funestus genomes, with assembly IDs GCA_003951495.1 and GCF_943734845.2 respectively, based on protein family ID (Pfam ID) PF00201 from the recent protein family database Pfam 34.0 [[52], [53], [54], [55]]. The An. funestus FUMOZ assembly is a genome assembly for the FUMOZ laboratory resistant strain while the AfunG1 assembly is from an individual female specimen from La Lopé, Gabon. An. funestus UGTs amino acid sequences were retrieved from the An. funestus genome annotated protein and then were aligned in Geneious Prime 2022.1.1 (https://www.geneious.com) using Geneious alignment [55]. The signal peptide domain was predicted using SignalP 5.0 server (https://services.healthtech.dtu.dk/service.php?SignalP-5.0).
and transmembrane helices were predicted in the amino acid consensus sequence using TMHMM - 2.0 (https://services.healthtech.dtu.dk/service.php?TMHMM-2.0) [56]. While UGTs conservative motifs and functional domains were identified by alignment with other insect UGTs amino acid sequences and using InterPro scan http://www.ebi.ac.uk/interpro/search/sequence/ [38,44,45].
For the phylogenetic analysis, amino acid sequences of 123 UGT genes from four Diptera species, Anopheles funestus (27), Aedes aegypti (35), Anopheles gambiae (26), Drosophila melanogaster (35) were retrieved from Vectorbase and globally aligned using Clustal Omega alignment built within Geneious Prime 2022.1.1 (https://www.geneious.com) [54]. The phylogenetic tree was built using Geneious built-in FastTree. FastTree uses a modified version of the neighbour-joining algorithm to construct an initial tree that is refined using a maximum-likelihood approach and heuristics are used to speed up the tree-building process [57]. The phylogenetic tree was edited using the online tool, Interactive Tree of Life (http://itol.embl.de/). An. funestus UGT genes were named based on the UGT nomenclature system [58].
To construct the Templeton, Crandall and Sing (TCS) haplotype network, UGT haplotypes for all individuals were extracted from the variant calling files using bcftools [59] resulting in 160 haplotypes for each UGT gene. Haplotypes were aligned using MUSCLE aligner [60] and TCS haplotype network was built using POPART software [61].
2.3. Differential expression analysis of total RNA and analysis of PoolSeq data
All the 21 UGT genes annotated in the FUMOZ genome including the partially sequenced UGT308B3 (AFUN018708) were investigated for association with pyrethroid resistance using GWAS-PoolSeq and Differentially expressed genes (DEGs) analyses.
To identify UGTs that are overexpressed in response to exposure to permethrin, differentially expressed genes (DEGs) analyses between mosquitoes alive after exposure to permethrin and unexposed populations from Malawi, Uganda, and Cameroon were performed using pools of total RNA. In addition, differentially expressed gene analyses were performed by contrasting transcription profiles of populations resistant to permethrin from Malawi, Uganda, Cameroon, and laboratory-resistant colony (FUMOZ) against the transcription profile of laboratory-susceptible (FANG) (See Materials and Methods for details). In each separate contrast, DEGs were determined globally while DEGs of detoxification genes were highlighted including differential expressions of UGT genes (Supplementary Fig. 3). The analysis involves initial pre-processing of raw reads, alignment to the FUMOZ reference genome using the AfunF3.2 annotation (Assembly ID GCA_003951495.1) and count of reads on the gene level. The DEG analysis was conducted using edgeR [62]. Differentially regulated genes were defined in each separate contrast as those with a corrected p-value threshold of <0.05 and log2 fold change >1.
To validate the expression profiles of select UGTs using quantitative real-time PCR, RNA was extracted in pools of 10 from 3 to 5 days old females randomly selected and unexposed to any insecticides using Arcturus PicoPure RNA isolation kit (Applied Biosystems, CA, USA) according to the manufacturer's instructions. Reverse transcription and qRT-PCR were performed using Invitrogen reverse transcriptase kit (Invitrogen, Waltham, CA, USA) and SYBR green master mix kit (Sigma, Aldrich, Germany) respectively according to the manufacturer's instructions. The expression of RNA was calculated using ddCT protocol [63] and normalised to the expression of Ribosomal protein S7 housekeeping gene. The efficiency of the primers was incorporated into the analysis and values were finally converted into their logarithmic forms for normal distribution. The primer sequences are provided in Supplementary Table S9.
For the PoolSeq analysis, DNA pools from An. funestus laboratory colonies (FANG and FUMOZ) and F0 field-collected mosquitoes were examined for a signature of selection in the UGT gene family. The pre-processing of raw reads, alignment and variant calling was described previously [7]. In each population, pools of genomic DNA were analysed by calculating the gene-wise ratio between nucleotide diversity of nonsynonymous polymorphism and nucleotide diversity of synonymous polymorphism (πn/πs) for all genes including the 21 UGT genes using snpgenie [64] for details see [48].
2.4. Design of the SureSelect bait and target enrichment sequencing of candidate resistance regions from laboratory and field-collected colonies
Target sequencing baits were designed using the SureSelect DNA Advanced Design Wizard in the eArray program of Agilent. The design and sequencing of the SureSelect experiment were described in detail previously [7] In summary, a total of 1302 target sequences were included in the enrichment sequencing, constituting 3,059,528 bp. The library preparation and sequencing were performed by the Centre for Genomic Research (CGR), University of Liverpool, using the SureSelect target enrichment custom kit. The libraries were sequenced in 2 × 150 bp paired-end fragments on an Illumina MiSeq with 20 samples per run. The regions targeted by the enrichment sequencing include a selection of detoxification genes potentially involved in insecticide resistance, heat shock proteins, immune response genes and odorant binding proteins (see [7] for further details). In addition, all the genes in the major candidate trait loci associated with pyrethroid resistance previously identified were included in the targeted sequencing. These include a 120 kb region Bacterial Artificial Chromosome (BAC) clone of rp1 (resistance to pyrethroid 1) locus on chromosome 2R and a 113 kb BCA clone of rp2 on chromosome 2 L [19,65].
This fine-scale targeted technique was used to sequence part of the genome of a total of 80 individual mosquitoes from the two lab colonies (FANG and FUMOZ) and F1 field-collected mosquitoes from Cameroon, Malawi and Uganda. For field-collected populations exposure duration for susceptibility assay for each population is described in 2.1, 10 putatively F1 permethrin susceptible females and 10 F1 resistant females were targeted by the SureSelect bait. In addition, 10 mosquitoes were targeted from each laboratory colony FANG, FUMOZ. Centre for Genomic Research (CGR), the University of Liverpool, conducted the library construction and capture using the SureSelect target enrichment custom kit. The libraries were subjected to paired-end sequencing (2 × 150 bp) on an Illumina MiSeq instrument with 20 samples per run [7]. A broad-scale analysis of these data can be found in [7].
2.5. Analysis of the SureSelect data
SureSelect pair-end trimmed reads were aligned using BWA (version 0.7.17) against An. funestus FUMOZ reference genome (Assembly ID GCA_003951495.1) [55]. Sequence alignment map (sam) files were converted to binary alignment map (bam) files using samtools (1.13) [66,67]. Using Picard tools (2.26.3) alignment bam files were sorted according to the position in the reference genome, duplicated reads were marked, and reads were assigned a new read group tag [68]. Variant calling was carried out using freebayes (v1.2.0-dirty) by setting the number of alleles to be considered to 4 using the option (−-use-best-n-alleles 4) to reduce run time and all other options were set to default [69]. Subsequently, resulting variants were filtered using vcffilter (vcflib version 1.0.0_rc2) based on Phred-scaled quality-score >20 (QUAL ≥ 20) keeping 531,726 variants [70]. SnpEff (5.0) was used to annotate and predict the effects of genetic variants between samples and the FUMOZ reference genome [71]. Variants were further filtered by removing SNPs with missing values, removing all indels, and only retaining bi-allelic SNPs. Variants were separated by country to multiple VCF files so when filtered by missing values a maximum number of SNPs will be retained (Supplementary Table S6) [66,67].
2.6. Gene-wise FST differentiation and allele frequency spectrum (AFS) test statistics between pyrethroid resistance and putatively susceptible populations
To conduct population genetics analyses on a gene level from the SureSelect target enrichment sequencing only genes with an average coverage of 100 and above calculated in jvarkit https://github.com/lindenb/jvarkit using merged alignment files of all the FANG and FUMOZ individuals were retained. The target enrichment sequencing covered 807 genes, compromising 6.16% of the total number of genes in the FUMOZ reference genome. UGT genes associated with pyrethroid resistance were detected using gene-wise allele frequency spectrum (AFS)-based methods on biallelic SNPs within targeted regions. For each gene targeted by the enrichment sequencing, gene-wise FST was calculated using PopGenome R package to investigate divergence between resistant (alive) and susceptible (dead) Africa-wide, in each country and between laboratory colonies (FANG and FUMOZ) [72]. A p-value for each gene-wise FST was calculated using bootstrapping process. To understand how much variability there might be in the gene-wise FST values due to a chance we generated 1000 bootstrap samples by resampling from the gene-wise FST values. P-values were determined by comparing the distribution of bootstrap FST values to the observed FST values. Genes with gene-wise FST values that are in the top 0.05 quantiles of genes per chromosome and a p-value <0.05 were considered significant (Supplementary Dataset 5, Fig. 3, Supplementary Fig. 6). Additionally, the genomic fragment spanning the full length of UGTs targeted by enrichment sequencing were analysed in the 80 mosquitoes Africa-wide. Nucleotide diversity (π) was calculated within each population (resistant and susceptible) as well as between populations from Malawi, Uganda, and Cameroon, and both laboratory colonies the FANG and the FUMOZ.
To detect genomic regions impacted by selections, gene-wise Tajima's D neutrality test statistics were estimated for all genes included in the targeted sequencing. Gene-wise population genetics neutrality statistics including Tajima's D (on synonymous SNPs, nonsynonymous SNPs and combined) Fu, and Li‘s D and F tests, Watterson estimator and composite likelihood ratio (CLR) test [73] were calculated for all genes included by the targeted sequencing using PopGenome R package [72]. Coalescent simulation to derive the expected distribution of gene-wise neutrality tests statistics across the loci targeted by the enrichment analysis was generated using the MS program [74]. Tajima's D values were considered significant if they fall at the 0.05 quantiles at both extremities of the simulated Tajima's D density plot (Fig. 4, Supplementary Dataset 6).
2.7. Fst differentiation between pyrethroid resistance and putatively susceptible populations on the SNP level
For all SNPs in the targeted region by SureSelect, Weir & Cockerham's wc FST for two populations and p FST probabilistic approach to detect the difference in allele frequencies between the resistant and susceptible population in every country, Africa-wide and between laboratory colonies (FANG and FUMOZ) was calculated using vcflib (1.0.0_rc2) [70]. R qvalue package (https://github.com/StoreyLab/qvalue) was used to perform a false discovery rate (FDR) estimation for the list of p-values calculated from the p FST test, a p-value of 0.05 were used to extract significantly divergent SNPs from each analysis. The functional effect of each SNP with significant p FST from each comparison on the coding region was determined by overlapping using SNPs genomic positions. Supplementary Table S6 surmises the total number of SNPs identified in the targeted region, along with the significantly divergent SNPs between resistant and susceptible, and the number of predicted functional effects of significant SNPs including non-synonymous SNPs. The functional effect of each SNP as predicted by SnpEff, a p-value quantifying allele frequency difference (p FST) and Weir & Cockerham's FST (wc FST) are provided in (Supplementary Dataset 7).
3. Results
3.1. Characteristics, phylogenetics, and evolution of An. funestus UDP-glycosyltransferases (UGT) genes
The assembled and annotated Anopheles funestus genome for the FUMOZ resistant-strain colony contains 21 UGTs; 20 genes with complete transcripts located on chromosomes 2 and 3, encoding 511 to 550 amino acids (Table 1), and a partially sequenced UGT gene AFUN018708 (65 amino acids) due to incompleteness in the transcriptome assembly [55]. The latest An. funestus assembly from an individual female specimen from La Lopé, Gabon (AfunG1) contains seven more UGT genes and the partially sequenced UGT gene (AFUN018708) from the FUMOZ colony reference was identified to be a partial sequence of AFUN2_007305(b) later named UGT308B3 according to UGT nomenclature [75]. Overall, the number of predicted genes in the AfunG1 assembly (14,819 genes) is higher than the FUMOZ colony assembly (14,176 genes). In total 27 UGT genes were identified in An. funestus from both assemblies. (Table 1) [55]. Genomics and transcriptomics analyses in this paper used the FUMOZ strain reference genome as its annotation is well-integrated within VectorBase.
Table 1.
Family | subfamily | Official name | FUMOZ assembly VectorBase ID | AfunG1 VectorBase ID | Chr | Start | End | Strand | Gene length (bp) |
aa sequence length |
---|---|---|---|---|---|---|---|---|---|---|
UGT301 | A3 | UGT301A3 | AFUN016158 | AFUN2_011791 | 2 | 82,046,643 | 82,050,536 | + | 3893 | 537 |
C2 | UGT301C2 | AFUN004354 | AFUN2_010809 | 2 | 82,055,953 | 82,057,683 | + | 1730 | 528 | |
E3 | UGT301E3 | AFUN016159 | AFUN2_011791 | 2 | 82,044,343 | 82,046,282 | + | 1939 | 531 | |
UGT302 | A3 | UGT302A3 | AFUN019845 | AFUN2_003248 | 3 | 24,375,218 | 24,377,273 | − | 2055 | 532 |
H3 | UGT302H3 | AFUN2_006228 | 3 | 18,529,610 | 18,531,839 | − | 2229 | 527 | ||
J2 | UGT302J2 | AFUN2_001555 | 3 | 18,532,418 | 18,535,025 | + | 2607 | 523 | ||
UGT306 | A3 | UGT306A3 | AFUN016302 | AFUN2_005487 | 3 | 383,756 | 385,945 | − | 2189 | 522 |
C2 | UGT306C2 | AFUN005786 | AFUN2_005888 | 3 | 381,521 | 383,356 | − | 1835 | 514 | |
D2 | UGT306D2 | AFUN011189 | AFUN2_006129 | 3 | 91,183,810 | 91,186,351 | − | 2541 | 517 | |
UGT308 | A3 | UGT308A3 | AFUN2_003087 | 3 | 19,909,980 | 19,911,965 | + | 1985 | 543 | |
B3 | UGT308B3 | AFUN2_007305(b) | 3 | 19,912,087 | 19,916,060 | + | 3973 | 521 | ||
C3 | UGT308C3 | AFUN2_007018 | 3 | 19,907,113 | 19,909,550 | − | 2437 | 518 | ||
D2 | UGT308D2 | AFUN020198 | AFUN2_006461 | 3 | 10,785,724 | 10,787,606 | + | 1882 | 529 | |
F2 | UGT308F2 | AFUN2_008690 | 3 | 19,904,812 | 19,907,016 | + | 2204 | 524 | ||
G2 | UGT308G2 | AFUN004976 | AFUN2_007038 | 2 | 88,001,139 | 88,003,007 | + | 1868 | 521 | |
G3 | UGT308G3 | AFUN009064 | AFUN2_007038 | 2 | 87,998,349 | 88,000,579 | + | 2230 | 521 | |
G4 | UGT308G4 | AFUN019724 | AFUN2_002758 | 2 | 87,995,100 | 87,997,861 | + | 2761 | 524 | |
H2 | UGT308H2 | AFUN2_007305(a) | 3 | 19,912,087 | 19,916,060 | + | 3973 | 518 | ||
UGT309 | B2 | UGT309B2 | AFUN002692 | AFUN2_006612 | 3 | 59,521,361 | 59,523,238 | − | 1877 | 537 |
UGT310 | B2 | UGT310B2 | AFUN011266 | AFUN2_006376 | 2 | 38,876,258 | 38,878,428 | − | 2170 | 517 |
UGT313 | B2 | UGT313B2 | AFUN002865 | AFUN2_001585 | 2 | 73,697,488 | 73,703,548 | + | 6060 | 537 |
UGT314 | A3 | UGT314A3 | AFUN005498 | AFUN2_006345 | 2 | 37,732,412 | 37,740,072 | + | 7660 | 536 |
UGT315 | A3 | UGT315A3 | AFUN000679 | AFUN2_006363 | 3 | 40,270,911 | 40,273,123 | + | 2212 | 550 |
UGT36 | B3 | UGT36B3 | AFUN003590 | AFUN2_010674 | 2 | 90,789,423 | 90,801,509 | − | 12,086 | 537 |
C3 | UGT36C3 | AFUN003593 | AFUN2_011670 | 2 | 90,804,657 | 90,812,858 | − | 8201 | 517 | |
UGT49 | A4 | UGT49A4 | AFUN002058 | AFUN2_013956 | 3 | 3,257,425 | 3,259,981 | + | 2556 | 516 |
UGT50 | B8 | UGT50B8 | AFUN002999 | AFUN2_011767 | 2 | 16,896,116 | 16,933,648 | + | 37,532 | 511 |
Phylogenetic analysis of a total of 123 protein sequences of UGT genes from four Diptera species, Anopheles funestus (27), Aedes aegypti (35), Anopheles gambiae (26), and Drosophila melanogaster (35), divides UGT genes to 20 canonical families according to the UGT nomenclature committee (Fig. 1) (Supplementary Dataset 1) [58]. The phylogenetic tree reveals lineage-specific expansion and interspecific conservation of the UGT families. Five out of the 20 families are common in all four Diptera species (UGT36, UGT49, UGT50, UGT301 and UGT302), while 8 families are specific to Drosophila melanogaster (UGT35, UGT37, UGT303, UGT304, UGT305, UGT307, UGT316, and UGT317) and the remaining 7 families are specific to Culicidae species (UGT306, UGT308, UGT309, UGT310, UGT313, UGT314 and UGT315).
Anopheles funestus UGT genes were distributed into 12 families, of which 9 UGTs were clustered within the UGT308 family (Fig. 1, Table 1). The UGT308 gene family demonstrates significant gene expansion, making it the largest in the overall phylogenetic tree and the largest among the UGT families in An. funestus. There are 3 An. funestus UGT genes each in UGT families UGT301, UGT302 and UGT306 with a close 1:1 orthologue gene between An. funestus and An. gambiae. There are two An. funestus UGT genes in the UGT36 family belonging to subfamilies B and C are close orthologous to the UGTs from An. gambiae and Ae. Aegypti belonging to the same subfamilies. There is only one An. funestus UGT each in UGT49, UGT50, UGT309, UGT310, UGT313, UGT314 and UGT315 families (Fig. 1, Table 1). The species phylogeny among the four Diptera species is mirrored by the branching pattern of the UGT50 family, the An. funestus UGT50 gene is closer to An. gambiae and Ae. Aegypti with a pairwise identity of 93.1% aaID and 79.6% aaID respectively than D. melanogaster DmUgt50B3 with 58.6% aaID (Fig. 1) (Supplementary Dataset 2).
Protein function analysis using the consensus sequence and UGT50B8 amino acid sequence predicted a signal peptide at the N-terminal, a transmembrane domain, and a cytoplasmic tail at the C-terminal domain (Supplementary Fig. 2). A significant similarity is observed at the UGT signature motif of 29 aa-long sequences (consensus: FITHGGLLSTQEAIYHGVPIVGIPFFGDQ) found in the middle of the C-terminal domain with two residues conserved in all UGT genes (G415, P428) [58]. Other conserved signature motif sequences involved in sugar donor binding regions (DBR1 and DBR2) and catalytic mechanisms are found in the C-terminal domain comparable to mammalian and insect UGT genes (Supplementary Fig. 1).
3.2. Studying UGT association with pyrethroid resistance using DNA and RNA pools
The analysis of genomic DNA pools involves the calculation of the gene-wise ratio between nucleotide diversity of nonsynonymous polymorphism and nucleotide diversity of synonymous polymorphism (πn/πs) for all genes including the 21 UGT genes [7,48]. The results are presented in (Supplementary Dataset 3) elucidating the association of genes with pyrethroid resistance. The (πn/πs) ratio gene-wise for all the UGT genes in all the populations is generally <1, implying stabilizing or purifying selection acting against changes in the amino acid sequence, except for UGT315A3 (AFUN000679) in the FUMOZ population where to investigate their association with pyrethroid resistance a positive selection was detected driving changes in the protein sequence.
Results for differentially expressed genes (DEGs) globally in each separate contrast while highlighting detoxification genes including UGT genes are presented in (Supplementary Fig. 3). Differential expression analyses between mosquitoes resistant and unexposed to permethrin in populations from Uganda and Cameroon did not detect significant changes in the expression level of detoxification genes including UGTs. However, the expression contrast in Malawi between resistant and unexposed populations detected significant differential expression of 7 UGT genes; 5 genes were upregulated, and 2 genes were downregulated. In Malawi, 4 of the genes that are upregulated are also upregulated when the resistant population is compared with FANG (Fig. 2). UGT310B2 (AFUN011266) is significantly upregulated in all resistant populations to permethrin when compared to the FANG population, 2.7 FC in FUMOZ, 5.6 in Malawi, 2.6 in Uganda and 3.5 FC in Cameroon (Fig. 2B). While UGT301C2 (AFUN004354) overexpression was specific to the FUMOZ population (2.2 FC) when compared with FANG (Fig. 2, Supplementary Fig. 4, Supplementary Dataset 4). The overexpression of UGT310B2 in FUMOZ compared to FANG was detected using quantitative PCR (qPCR) (Fig. 2C). Detecting UGT301C2 by qPCR was challenging due to its relative overexpression in the FUMOZ colony compared to the FANG (Fig. 2C, Supplementary Dataset 4).
Overall, global transcriptional regulation in resistant populations in response to permethrin exposure, when compared to unexposed populations from the same region, was more evident in Malawi compared to Uganda and Cameroon (Supplementary Fig. 5). In Malawi, a total of 1826 significant DEGs were detected, and 48 of those genes belong to the major detoxification gene families, 23 cytochrome P450s, 12 carboxylesterases, 4 glutathione S-transferases, 7 UDP-glycosyltransferases and 2 ABC transporters. In the population from Cameroon, only a single carboxylesterase gene (AFUN002514) was detected to be upregulated [17]. While the Ugandan resistant population overexpresses three cytochromes P450 genes AFUN015739 (CYP307A1), AFUN020895 and AFUN019365 compared to the unexposed population from the same country (Supplementary Fig. 3, Supplementary Dataset 4) (Supplementary text 1).
3.3. Identifying selection and divergence in an. Funestus UGT genes using targeted sequencing
Gene-wise allele frequency spectrum-based (AFS) summary statistics were calculated using biallelic SNPs within targeted regions. A total of 136,348 bi-allelic polymorphic sites were retained for AFS analysis across 80 mosquitoes Africa-wide, divided into equal numbers of dead and alive across the continent and in each country (see methods). In each country, 91,619, 90,340, and 49,762 polymorphic biallelic sites were detected respectively in Malawi, Uganda and Cameroon (Table S1). Identified polymorphic sites are divided between genes targeted by enrichment sequencing including 61 genes on the X chromosome, 431 genes on Chr2 and 315 genes on Chr3 (807 in total) (Table S1). Many of those genes belong to detoxification gene families, including P450 (66 genes), GSTs (12 genes), COEs (5 genes), UGTs (12 genes), ABC transporters (14 genes), and some of the remaining 698 genes could also be associated with pyrethroid resistance (Supplementary Text 2).
Low polymorphism is present in samples collected from Malawi compared to populations from other countries. Additionally, low polymorphism was detected between laboratory colonies FANG and FUMOZ compared to all field isolates, where 28,566 polymorphic sites were detected. A low level of polymorphism in the field population from Malawi and the FANG laboratory colony, originally from Angola, is expected when FUMOZ, originally from Mozambique, is used as a reference genome since all populations are from Southern Africa (Table S1).
Africa-wide gene-wise FST test between dead and alive mosquitoes across the investigated countries detected a significant gene-wise FST value of 0.07463 for UGT301C2 on the top 0.5 quantiles (Chromosome 2 0.95 quantiles = 0.048) (Fig. 3) (Table S2 and S3). To some extent, a geographical pattern of elevated gene-wise FST for UGT genes was evident in samples collected from Southern African populations (FANG, FUMOZ and Malawi) with elevated gene-wise FST for UGT301C2 and UGT314A3, compared to Uganda (East Africa) and Cameroon (Central West Africa) with a shared high gene-wise FST for UGT306C2 (Fig. 3) (Table S3). The commonly overexpressed UGT310B2 was not included in the targeted sequencing to investigate if there is a link between differentiation and overexpression. UGT301C2 which is overexpressed in the FUMOZ colony is highly differentiated between FUMOZ and FANG populations but not significant. Gene-wise Tajima's D density curve of genes included in the enrichment sequencing may reflect the demographic history of An. funestus population in those countries (Fig. 4 and Supplementary Fig. 7).
3.3.1. Gene-wise differentiation and selection in laboratory colonies (FANG and the FUMOZ) and Malawi
Gene-wise FST differentiation between the susceptible laboratory colony the FANG and the resistant laboratory colony the FUMOZ did not detect UGTs with significant differentiation among the top 0.05 quantiles in respective chromosomes, but UGT301C2 and UGT314A3 on chromosome 2 are the most differentiated UGT genes (Fig. 3) (Table S2). The average gene-wise FST values in the analysis between FANG and FUMOZ were higher than all the other analyses between putatively susceptible and resistant from other countries, revealing the genetic difference between the two geographically distant isolates (Fig. 3) (Table S2). Furthermore, in Malawi (South Africa), significant differentiation of UGT genes UGT301C2 on chromosome 2 and UGT302A3 on chromosome 3 was identified, among the top 0.05 quantiles on respective chromosomes (0.95 quantiles of chromosome 2 = 0.069 and chromosome 3 = 0.085), with gene-wise FST of 0.9142 and 0.12021 respectively. (Fig. 3 and Table S3). Meanwhile, differentiation of UGT314A3 on chromosome 2 in populations from Malawi was similar to the observed differentiation between laboratory colonies (the FANG and the FUMOZ) only higher than 80% of gene-wise FST values and not significant (Fig. 3).
In all populations from Southern Africa including Malawi, FANG colony (derived from Angola) and FUMOZ colony (derived from Mozambique) Tajima's D density curves were close to equilibrium, represented by the coalescent simulation curve (Fig. 4B). In the FUMOZ population, 7 UGT genes out of the 12 UGTs targeted in this study have a high gene-wise Tajima's D than 0.95 quantiles of simulated Tajima's D values and the gene-wise Tajima's D average in FUMOZ is higher than the average of simulated Tajima's D for all chromosomes (Fig. 4) (Table S4 and S5). Similarly, in the susceptible FANG population, Tajima's D average for all chromosomes is higher than the coalescent simulation average (Table S4). This observation may indicate a strong balancing selection or decrease in population size (sudden population contraction) in the laboratory colony populations, probably introduced by laboratory propagation. However, in Malawi, UGT301C2 has a negative Tajima's D value of −1.8243 below the 0.05 quantiles of simulated Tajima's D, deviating from the empirical distribution of gene-wise Tajima's D values on chromosome 2 indicating a recent selective sweep driven by directional selection (Fig. 4, Table S5).
3.3.2. Gene-wise differentiation and selection in central West Africa (Cameroon) and East Africa (Uganda)
When comparing susceptible and resistant populations from Uganda, UGT306C2 on chromosome 3 had a significant differentiation with gene-wise with FST value of 0.0494 within the top 0.05 quantiles, while in Cameroon gene-wise FST for UGT306C2 was high but not significant (Fig. 3, Table S3 and Supplementary Dataset 5).
In Uganda and Cameroon, gene-wise Tajima's D values were predominantly skewed towards negative values. In Cameroon, the average and median of gene-wise Tajima's D per chromosome are below the 0.05 quantiles of simulated Tajima's D for corresponding Chromosomes. There were 7 UGT genes in Cameroon below 0.05 quantiles of simulated Tajima's D and most UGT genes in Uganda have negative Tajima's D values but not within the lowest 5% of simulated Tajima's D for corresponding chromosomes (Fig. 4, Supplementary Fig. 7, Table S4 and S5).
3.4. Analysis of UGTs polymorphism across Africa identified nonsynonymous SNPs potentially associated with resistance
Low gene-wise nucleotide diversity was detected in the susceptible FANG population for all UGT genes compared to other populations probably due to the lack of selection pressure and consistent population genetic drift introduced by laboratory maintenance of the colony (Fig. 5A). The low diversity of the FANG UGTs haplotype is evident in haplotype networks were FANG haplotypes mostly cluster together. Gene-wise nucleotide diversity varies between UGT genes, and the number of polymorphic substitutions relates to the gene size (Fig. B). Analysis of the Templeton, Crandall and Sing (TCS) haplotype tree for UGTs targeted by the enrichment sequencing from 80 mosquitos (160 haplotypes) highlights the high polymorphism of UGTs across Africa with many singleton haplotypes separated by many mutational steps, except for UGT306C2, UGT306A3 and UGT301C2 where predominant haplotypes shared by different populations were detected (Fig. 5C-E).
To identify SNPs that are potentially associated with pyrethroid resistance within UGTs, we identified significantly differentiated SNPs based on p-values quantifying differences in allele frequencies (pFst) between susceptible and resistant populations in each country (Table S6 and S7) In comparison between laboratory colonies FANG and the FUMOZ, 30.5% of targeted UGT SNPs showed significant differentiation, including 26 non-synonymous variants. In Malawi, 11.6% of UGT SNPs are significantly differentiated, with 6 genes containing 14 significant nonsynonymous SNPs. Cameroon shows 5.4% of significantly differentiated UGT SNPs, including 4 nonsynonymous SNPs on 4 genes. Uganda exhibits 8.6% of highly divergent UGT SNPs, with 5 significant nonsynonymous SNPs on 4 UGTs (Supplementary Fig. S8 and S9).
We focused our investigation on significantly differentiated non-synonymous SNPs, especially non-synonymous SNPs that could potentially affect the enzyme catalytic activities occurring close to the conserved motifs involved in binding to the glycosyl donor (Fig. 6). Three Nonsynonymous SNPs were detected at the conserved motif of three different UGTs occurring only in susceptible populations (Supplementary Fig. 11) (Table S8). In addition, nonsynonymous changes were detected between sugar donor binding residues 1 and 2 (DBR1 and 2) in UGT306C2 (c.945 T > A, p.His315Gln) only in 3 haplotypes from the resistant Malawi population, and in UGT302A3 (c.962 A > G, p.Asn321Ser) occurring in a frequency of 7 haplotypes from the total 20 haplotypes of the FUMOZ population. Other mutations were detected outside the conserved domain on the signal peptide, transmembrane domain, carboxylic tail, and the N-terminal domain that is believed to determine substrate specificity. Notably, a nonsynonymous SNP on the N-terminal domain of UGT36C3 (c.226 A > G, p.Thr76Ala) that is fixed in the resistant population from Uganda and occurs in 17/20 haplotype of the putatively susceptible populations (Table S8).
Overall, the most highly differentiated SNPs in analysis between susceptible in resistant populations from the three countries belong to detoxification genes other than UGTs except for Malawi, where SNPs on UGT302A3 are the most highly differentiated SNPs (Supplementary Fig. 9) (Supplementary Dataset 7). A nonsynonymous SNP in the N-terminal of UGT302A3 (c.334C > G, p.Gln112Glu) is among the most highly differentiated SNPs in Malawi, occurring in 12 haplotypes of the resistant population and only 1 haplotype of the susceptible population. While in Cameroon the most highly differentiated SNPs in detoxification genes belong to CYP304B1 and CYP6AK1 and in Uganda belong to GSTE7 and CYP6Z1 (Supplementary Dataset 7).
4. Discussion
It is essential to improve our understanding of insecticide resistance since malaria prevention is heavily dependent on insecticide-based interventions. Although other gene families such as cytochrome p450s or GSTs have been extensively studied, the role of UGT genes in pyrethroid resistance remains largely unexplored in malaria vectors. Our study provides a comprehensive genome-wide characterisation of the UGT gene family in An. funestus and investigates their expression and selection in geographically distinct populations. A major outcome of the study was the detection of overexpressed and significantly differentiated UGTs in field-collected pyrethroid-resistant populations of An. funestus.
4.1. Characterisation and evolution of An. funestus UGTs
The number of UGT genes identified in An. funestus (27) is comparable to that of An. gambiae (26), however both of Ae. aegypti and D. melanogaster contain 35 UGT genes. An. funestus UGTs exhibit typical characteristics of enzymes that catalyse glycosylation at the C-terminal domain containing significant conservation of 29 amino acids signature motifs and UDP-glycosyl donor binding regions (DBR1 and DBR2), as described for other insects [27,45]. On the other hand, the N-terminal domain is highly variable between UGTs and is believed to be involved in substrate specificity [38,44,46]. The ‘loose’ fit of the substrate binding domain at the N-terminal provides a binding site for structurally diverse substrates by the same UGT isoform [22,27]. Additionally, a signal peptide is located at the N-terminal indicating that UGTs protein precursor is destined towards the secretory pathway, while the retention of those proteins inside the ER membrane is mediated by the C-terminal hydrophobic transmembrane domain that spans the membrane with a group I topology. Most of the UGT protein resides in the ER and only a small portion of the protein, particularly the cytoplasmic tail, resides in the cytosol [27,58].
Previous phylogenetic studies of insect UGTs indicate that UGT50 is the only family that is found universally in all insect species [46]. UGT50 family contains one UGT gene from each Diptera species selected for this investigation and the branching pattern of the UGT50 mirrors the species phylogeny among the four Diptera species investigated in this study [46,76]. The UGT308 family is the largest in An. funestus by gene number and in the overall phylogenetic tree and, where nine An. funestus UGTs clustered within this family. The expansion in the UGT308 family was potentially driven by divergence to increase the number of substrates binding for glycosylation and gene duplication, as illustrated by the subfamily UGT308G in An. funestus and UGT308J in Ae. Aegypti. Based on previous investigations in mosquitoes, genes belonging to the UGT308 family were speculated to be involved in insecticide resistance, and the gene expansion may have evolved to resist consistent exposure to insecticides [38,77]. However, in our investigation, we did not detect overexpression of An. funestus UGT308 genes in resistant field-collected populations.
4.2. Overexpression of UGT310B2 Africa-wide in pyrethroid-resistant populations was detected
In this study, DNA and RNA PoolSeq were analysed to identify an association between An. funestus UGTs and pyrethroid resistance. The investigation started by calculating the gene-wise ratio between nonsynonymous and synonymous polymorphisms pN/pS from pooled DNA. The pN/pS genetic test is a powerful population genetic test that requires few assumptions and is a good indicator of selective pressure at a gene level [78]. pN/pS detected what could be a stabilizing or a purifying selection acting against changes in the protein sequence in all UGTs from field collected mosquitoes from Malawi, Cameroon and Uganda. The pN/pS analysis is limited by only detecting selection pressure in protein-coding regions, however, evolutionary changes in regulatory regions of genes associated with permethrin detoxification may affect the timing and expression level of those genes. Such changes cannot be detected by gene-wise population genetic tests using pooled DNA. Therefore, we investigated the transcriptional regulation of An. funestus UGT genes in response to permethrin exposure using total RNA pools. Pooled RNA samples derived from several biologically similar animals, compared to sequencing corresponding individuals individually, reduce the cost of sequencing while retaining similar biological information.
Insecticide resistance surveillance detected an increase in insecticide resistance in Malawi, Cameroon and Uganda in recent times [7,9,13,17]. Changes in transcriptional regulation in response to permethrin exposure between resistant populations and unexposed populations from the same territory were more evident in Malawi than in Uganda and Cameroon. High resistance levels in Uganda and Cameroon may have obscured the difference in gene expression of detoxification genes induced by permethrin exposure. On the other hand, comparing the transcriptional profile of field-resistant populations and the laboratory-resistant (FUMOZ) to that of the laboratory-susceptible colony (FANG) detected a pronounced difference in detoxification genes expression [17]. We detected overexpression of UGT310B2 (AFUN011266) across all pyrethroid-resistant populations. UGT overexpression in other mosquito vector populations, such as An. sinensis and A. gambiae, have been reported and linked to pyrethroid resistance [38,77]. Although the upregulation of UGTs expression in field-resistant mosquitoes was not as prominent as Phase I detoxification enzymes cytochrome P450s, UGTs still play an important role in the metabolic breakdown of pyrethroid insecticides (Supplementary Fig. S3). UGTs conjugate polar hydrophobic compounds with hydrophilic glycosyl groups, producing more polar metabolites that can be easily excreted from the cell by export transporters [22,[27], [28], [29]]. Pyrethroids are mostly composed of nonhydrocarbon chains and cyclic easter or acid groups and typically lack a polar group that can be glycosylated by UGTs [79,80]. Therefore, in the detoxification process of pyrethroids, they undergo initial oxidation by phase I cytochrome P450s, resulting in the production of more polar and reactive metabolites that potentially can be conjugated by UGTs [81].
The overexpression of CYPs compared to other detoxification enzymes indicates their primary role in the detoxification process of pyrethroids. While relative overexpression of UGTs compliments their roles as secondary enzymes interacting with the oxidised pyrethroid, a by-product of CYPs. Previous studies have identified overexpression of An. funestus cytochrome P450 genes, including CYP6P9a, CYP6P9b, CYP9J11, CYP6Z1, CYP6M7 and CYP9K1 in resistant populations. The recombinant protein of those genes expressed in vitro metabolises permethrin with significant depletion, therefore reducing the efficacy of permethrin-treated bed nets [7,8,14,17,25,26]. The identified overexpression of UGT genes in this study is relevant to current vector control strategies and management. UGTs may contribute to cross-resistance against polar insecticides that can be directly glycosylated enhancing their solubility and excretion. Contribution of UGTs to resistance against other insecticides that contain a potential glycosylation site has been reported in the Diamondback moth Plutella xylostella resistance to chlorantraniliprole [39], the housefly Musca domestica resistance to organophosphate [41], and the greenfly cotton aphid Aphis gossypii resistance to neonicotinoid and spirotetramat [42,43].
4.3. Targeted sequencing reveals differentiated UGTs and population structure in investigated countries
Sequencing DNA from pools of individuals is a cost-effective approach for conducting population genetics studies, which are otherwise economically challenging to sequence many individual genomes at high coverage [82]. However, to avoid pooling biases and detect SNPs at an individual resolution while maintaining a cost-effective approach, SureSelect target enrichment sequencing was implemented on individual mosquitoes. Polymorphism detected in the targeted region reflects the geographical distance between the collected populations and the FUMOZ reference genome originally from Mozambique (Southern Africa). As expected, populations from Southern Africa Malawi and the FANG colony (originally from Angola) harbour lower polymorphism than populations collected from Uganda and Cameroon. The gene-wise differentiation (FST) for UGT genes is generally lower than other detoxification genes in all analyses except in Malawi, where UGT302A3 and UGT301C2 are the most differentiated detoxification genes, highlighting the potentially important role of UGTs in detoxification in this region. In Uganda and Cameroon, the gene-wise differentiation (FST) of some cytochrome p450s and GSTs genes were among the highly differentiated genes illustrating their important role in the detoxification mechanism of pyrethroid in those regions (Supplementary Dataset 5).
A genome-wide process could be inferred from gene-wise Tajima's D density curves for genes in targeted regions revealing a history of An. funestus population in investigated locations, with a likely population expansion after a recent bottleneck effect in Cameroon and Uganda. Gene-wise Tajima's D density curves for populations from Cameroon and Uganda are skewed towards negative values, whereas populations from Southern Africa Malawi, FANG (Angola) and FUMOZ (Mozambique) were close to equilibrium, represented by the coalescent simulation curve. Results in this study, support previous findings of population expansion in the western part of An. funestus population range, west of the African Rift Valley [18,83]. A similar pattern is found in co-distributed malaria vectors An. gambiae and An. coluzzii, suggesting that those species have responded to common geographic constraints with a population expansion north of the Congo basin and west of the East African Rift Valley [84].
4.4. Low-frequency nonsynonymous SNPs within UGTs in field populations show a limited directional selection
Haplotype clustering of UGTs did not reveal a strong directional selection in any of the investigated populations. UGT haplotypes clustered randomly with many singleton haplotypes separated except for UGTs that show significant gene-wise divergence or a potential gene-level selective sweep inferred from Tajima's D. A dominant haplotype was detected in UGT301C2 that is significantly differentiated gene-wise in analysis between resistant and susceptible populations in Malawi and Africa-wide. A potential gene-level selective sweep is detected in Malawi at UGT301C2, while in Cameroon the significantly negative Tajima's D is not a deviation from the genome-wide trend. Haplotype clustering was detected for UGT306A3 and C2 both genes recorded significant Tajima's D in Cameroon while UGT306C2 is significantly differentiated between resistant and putatively susceptible populations from Uganda. The haplotype clustering for UGT301C2 and UGT306A3 revealed limited directional selection with a predominant haplotype detected in field-collected populations, while UGT306C2 haplotypes from field populations cluster in more than one node. The limited selection in UGTs might reflect their role in pyrethroid detoxification as phase II enzymes in the detoxification pathway of pyrethroid compared to the previously described selection for CYP9K1 in Uganda, and CYP6P9A and CYP6P9B in populations from Southern Africa (Supplementary Fig. 10) [7,10,11,14].
The resulting averaged values of gene-wise FST can mask or lower the magnitude of a positive selection where a combination of positive and negative selection within the gene at different times along its evolution may cancel each other out. Most of the significantly differentiated SNPs in all investigated populations are synonymous and intronic. Therefore, we focused the investigation on significantly differentiated nonsynonymous at the conserved motifs. Nonsynonymous mutations causing amino acid changes at conserved motifs acting as a binding site to the UDP-glycosyl donor are found in low frequencies in field-collected populations but with a significant differentiation between putatively susceptible and resistant populations. Other nonsynonymous mutations were detected outside the conserved motifs at the signal peptide, transmembrane domain, carboxylic tail, and the N-terminal domain.
Changes in the N-terminal domain might have a detrimental effect on substrate specificity. However, the N-terminal domain generally exhibits greater sequence divergence between UGT isoforms to enable the glycosylation of diverse substrates by the same UGT isoform [22,27]. Research on allelic variations in human UGTs indicated that amino acid substitutions at key residues can affect the enzyme-substrate selectivity, glycosylation activity and overall drug metabolism [[85], [86], [87], [88]]. Unless those nonsynonymous mutations identified in this research are validated using recombinant proteins substantiating their functional role in pyrethroid detoxification is challenging. To capture the complete effect of those nonsynonymous sites in UGTs activity we recommend using a baculovirus expression system in insect cells to account for post-translational modifications that have a significant impact on their activity [89]. Post-translational modification of UGTs specifically N-glycosylation was demonstrated to be important for UGT protein folding and catalytic activity [85,86,89].
It has been established that direct selection of nonsynonymous SNPs causing amino acid alterations in An. funestus detoxification genes such as P450s [7,8,90] and GSTs [15] enhance their detoxification activities. Despite the low frequency of the UGTs nonsynonymous SNPs outlined in this paper they are the first nonsynonymous SNPs to be reported in An. funestus UGTs. The selection detected in An. funestus UGTs nonsynonymous SNPs is limited and further illustrate the secondary role of An. funestus UGTs in pyrethroid resistance. However, the outlined significantly differentiated UGTs nonsynonymous SNPs will help future prediction of cross-resistance to insecticides that can be directly detoxicated by UGTs.
5. Conclusion
In this study, we have provided a comprehensive investigation of the potential role of UGTs in pyrethroid resistance in resistant laboratory colonies and field-collected An. funestus populations. Findings in this study have important implications for current An. funestus vector control strategies and management as UGT enzymes may confer cross-resistance to other polar insecticides, which can be directly detoxified by UGTs. Notably, we have identified a common overexpression of UGT310B2 across various regions in Africa. The overexpression of UGT310B2 in the FUMOZ colony was confirmed using quantitative PCR. The increased expression of UGT genes implies a mechanism by which mosquitoes can rapidly metabolise and eliminate pyrethroid with a risk of cross-resistance to other polar insecticides. This compromises the ability of pyrethroid-based interventions and reduces their effectiveness in controlling An. funestus population. The gene-wise (FST) differentiation for UGT genes is generally lower than other detoxification genes in all analyses except in Malawi, where UGT302A3 and UGT301C2 are the most differentiated detoxification genes, highlighting the potential role of UGTs in detoxification in this region. In addition, SNPs belonging to UGT302A3 were the most highly differentiated SNPs in Malawi between resistant and putatively susceptible based on p-values quantifying differences in allele frequencies (pFST). In Uganda, a significant gene-wise FST differentiation was detected in UGT306C2. The high gene-wise FST divergence of UGT301C2 and UGT306C2 was supported by a limited selection with limited clustering of haplotypes from those regions in the haplotype tree. Gene-wise Tajima's D density curves infer genome-wide process and reveal population structures of An. funestus population from the three countries, supporting previous observations. In addition, this study identified significantly differentiated UGTs nonsynonymous SNPs that might implicate UGTs detoxification activities and produced detailed records of those SNPs.
This study highlights the complexity of insecticide resistance and the geographic-specific variability in resistance mechanisms. This information is crucial for tailoring vector control strategies to specific regions and populations. Further investigations using more recent field-collected populations across Africa should be carried out to explore the role of UGT genes in pyrethroid resistance and predict their potential for cross-resistance to other insecticides.
Authorship contribution statement
Talal Al-Yazeedi: The design of the research, the experimental work, data analysis, data curation, data visualisation and the preparation of the manuscript. Abdullahi Muhammad: Experimental work, reviewing and editing of the manuscript. Seung-Joon Ahn: Data analysis, reviewing and editing of the manuscript. Helen Irving: Data collection. Jack Hearn: The design of the research, data analysis, data collection and reviewing and editing of the manuscript. Charles S. Wondji: The design of the research, funding acquisition and reviewing and editing of the manuscript.
CRediT authorship contribution statement
Talal Al-Yazeedi: Writing – review & editing, Writing – original draft, Visualization, Project administration, Methodology, Formal analysis, Data curation, Conceptualization. Abdullahi Muhammad: Writing – review & editing, Writing – original draft, Methodology, Formal analysis. Helen Irving: Methodology. Seung-Joon Ahn: Writing – review & editing, Writing – original draft, Methodology, Formal analysis. Jack Hearn: Writing – review & editing, Writing – original draft, Supervision, Methodology, Formal analysis. Charles S. Wondji: Writing – review & editing, Writing – original draft, Supervision, Investigation, Funding acquisition, Conceptualization.
Declaration of competing interest
The authors declare that there are no conflicts of interest.
Acknowledgements
This work was supported by a Wellcome Trust Senior Research Fellowships in Biomedical Sciences to Charles S. Wondji (101893/Z/13/Z and 217188/Z/19/Z) and a Bill and Melinda Gates Foundation grant to CSW (INV-006003). For the purpose of open access, the authors have applied a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.ygeno.2024.110798.
Appendix A. Supplementary data
Data availability
Read data of pooled whole genome sequencing (PoolSeq) are available in the European Nucleotide Archive (ENA) under accession (PRJEB13485 and PRJEB24384). SureSelect data are available under accession numbers PRJEB24520 (Cameroon), PRJEB47287 (Malawi and Uganda), PRJEB24506 (FANG colony) and PRJEB48958 (FUMOZ colony). RNA-seq data are available under accession numbers PRJEB24351 (Cameroon permethrin resistant), PRJEB45224 (FANG and FUMOZ) and PRJEB70998 (Cameroon unexposed, Malawi permethrin resistant, Malawi unexposed, Uganda permethrin resistant and Uganda unexposed).
References
- 1.Organization, W.H . World Health Organization; 2022. World Malaria Report 2022. [Google Scholar]
- 2.Bhatt S., et al. The effect of malaria control on Plasmodium falciparum in Africa between 2000 and 2015. Nature. 2015;526(7572):207–211. doi: 10.1038/nature15535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Hemingway J. The way forward for vector control. Science. 2017;358(6366):998–999. doi: 10.1126/science.aaj1644. [DOI] [PubMed] [Google Scholar]
- 4.Hemingway J., et al. Averting a malaria disaster: will insecticide resistance derail malaria control? Lancet. 2016;387(10029):1785–1788. doi: 10.1016/S0140-6736(15)00417-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Sinka M.E., et al. The dominant Anopheles vectors of human malaria in Africa, Europe and the Middle East: occurrence data, distribution maps and bionomic précis. Parasit. Vectors. 2010;3:117. doi: 10.1186/1756-3305-3-117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Atoyebi S.M., et al. Investigating the molecular basis of multiple insecticide resistance in a major malaria vector Anopheles funestus (sensu stricto) from Akaka-Remo, Ogun state, Nigeria. Parasit. Vectors. 2020;13(1):423. doi: 10.1186/s13071-020-04296-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hearn J., et al. Multi-omics analysis identifies a CYP9K1 haplotype conferring pyrethroid resistance in the malaria vector Anopheles funestus in East Africa. Mol. Ecol. 2022;31(13):3642–3657. doi: 10.1111/mec.16497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ibrahim S.S., et al. The P450 CYP6Z1 confers carbamate/pyrethroid cross-resistance in a major African malaria vector beside a novel carbamate-insensitive N485I acetylcholinesterase-1 mutation. Mol. Ecol. 2016;25(14):3436–3452. doi: 10.1111/mec.13673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Morgan J.C., et al. Pyrethroid resistance in an Anopheles funestus population from Uganda. PloS One. 2010;5(7) doi: 10.1371/journal.pone.0011872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Mugenzi L.M.J., et al. A 6.5-kb intergenic structural variation enhances P450-mediated resistance to pyrethroids in malaria vectors lowering bed net efficacy. Mol. Ecol. 2020;29(22):4395–4411. doi: 10.1111/mec.15645. [DOI] [PubMed] [Google Scholar]
- 11.Mugenzi L.M.J., et al. Cis-regulatory CYP6P9b P450 variants associated with loss of insecticide-treated bed net efficacy against Anopheles funestus. Nat. Commun. 2019;10(1):4652. doi: 10.1038/s41467-019-12686-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Mulamba C., et al. Widespread pyrethroid and DDT resistance in the major malaria vector anopheles funestus in East Africa is driven by metabolic resistance mechanisms. PloS One. 2014;9(10) doi: 10.1371/journal.pone.0110058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Riveron J.M., et al. Rise of multiple insecticide resistance in Anopheles funestus in Malawi: a major concern for malaria vector control. Malar. J. 2015;14:344. doi: 10.1186/s12936-015-0877-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Riveron J.M., et al. Directionally selected cytochrome P450 alleles are driving the spread of pyrethroid resistance in the major malaria vector <i>Anopheles funestus</i>. Proc. Natl. Acad. Sci. 2013;110(1):252–257. doi: 10.1073/pnas.1216705110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Riveron J.M., et al. A single mutation in the GSTe2 gene allows tracking of metabolically based insecticide resistance in a major malaria vector. Genome Biol. 2014;15(2):R27. doi: 10.1186/gb-2014-15-2-r27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Tchigossou G., et al. Molecular basis of permethrin and DDT resistance in an Anopheles funestus population from Benin. Parasit. Vectors. 2018;11(1):602. doi: 10.1186/s13071-018-3115-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Weedall G.D., et al. A cytochrome P450 allele confers pyrethroid resistance on a major African malaria vector, reducing insecticide-treated bednet efficacy. Sci. Transl. Med. 2019;11(484) doi: 10.1126/scitranslmed.aat7386. [DOI] [PubMed] [Google Scholar]
- 18.Weedall G.D., et al. An Africa-wide genomic evolution of insecticide resistance in the malaria vector Anopheles funestus involves selective sweeps, copy number variations, gene conversion and transposons. PLoS Genet. 2020;16(6) doi: 10.1371/journal.pgen.1008822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Wondji C.S., et al. Two duplicated P450 genes are associated with pyrethroid resistance in Anopheles funestus, a major malaria vector. Genome Res. 2009;19(3):452–459. doi: 10.1101/gr.087916.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Jacob M.R., et al. In: Towards Malaria Elimination. Sylvie M., Vas D., editors. Rijeka; IntechOpen: 2018. Insecticide resistance in malaria vectors: An update at a global scale. (p. Ch. 7) [Google Scholar]
- 21.Hemingway J., et al. The molecular basis of insecticide resistance in mosquitoes. Insect Biochem. Mol. Biol. 2004;34(7):653–665. doi: 10.1016/j.ibmb.2004.03.018. [DOI] [PubMed] [Google Scholar]
- 22.Bock K.W. Vertebrate UDP-glucuronosyltransferases: functional and evolutionary aspects. Biochem. Pharmacol. 2003;66(5):691–696. doi: 10.1016/s0006-2952(03)00296-x. [DOI] [PubMed] [Google Scholar]
- 23.Black W.C.T., et al. From global to local-new insights into features of pyrethroid detoxification in vector mosquitoes. Insects. 2021;12(4) doi: 10.3390/insects12040276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Grant D.M. Detoxification pathways in the liver. J. Inherit. Metab. Dis. 1991;14(4):421–430. doi: 10.1007/BF01797915. [DOI] [PubMed] [Google Scholar]
- 25.Riveron J.M., et al. Genome-wide transcription and functional analyses reveal heterogeneous molecular mechanisms driving Pyrethroids resistance in the major malaria vector Anopheles funestus across Africa. G3 Genes|Genomes|Genetics. 2017;7(6):1819–1832. doi: 10.1534/g3.117.040147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wondji C.S., et al. RNAseq-based gene expression profiling of the Anopheles funestus pyrethroid-resistant strain FUMOZ highlights the predominant role of the duplicated CYP6P9a/b cytochrome P450s. G3 Genes|Genomes|Genetics. 2021;12(1) doi: 10.1093/g3journal/jkab352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Meech R., Mackenzie P.I. Structure and function of uridine diphosphate glucuronosyltransferases. Clin. Exp. Pharmacol. Physiol. 1997;24(12):907–915. doi: 10.1111/j.1440-1681.1997.tb02718.x. [DOI] [PubMed] [Google Scholar]
- 28.Burchell B., Coughtrie M.W.H. UDP-glucuronosyltransferases. Pharmacol. Ther. 1989;43(2):261–289. doi: 10.1016/0163-7258(89)90122-8. [DOI] [PubMed] [Google Scholar]
- 29.Meech R., et al. The UDP-glycosyltransferase (UGT) superfamily: new members, new functions, and novel paradigms. Physiol. Rev. 2019;99(2):1153–1222. doi: 10.1152/physrev.00058.2017. [DOI] [PubMed] [Google Scholar]
- 30.Després L., David J.P., Gallet C. The evolutionary ecology of insect resistance to plant chemicals. Trends Ecol. Evol. 2007;22(6):298–307. doi: 10.1016/j.tree.2007.02.010. [DOI] [PubMed] [Google Scholar]
- 31.Wiesen B., et al. Sequestration of host-plant-derived flavonoids by lycaenid butterflyPolyommatus icarus. J. Chem. Ecol. 1994;20(10):2523–2538. doi: 10.1007/BF02036189. [DOI] [PubMed] [Google Scholar]
- 32.Hopkins T.L., Kramer K.J. Insect cuticle Sclerotization. Annu. Rev. Entomol. 1992;37(1):273–302. [Google Scholar]
- 33.Luque T., Okano K., O’Reilly D.R. Characterization of a novel silkworm (Bombyx mori) phenol UDP-glucosyltransferase. Eur. J. Biochem. 2002;269(3):819–825. doi: 10.1046/j.0014-2956.2001.02723.x. [DOI] [PubMed] [Google Scholar]
- 34.Ahmad S.A., T.L. Hopkins, β-Glucosylation of plant phenolics by phenol β-glucosyltransferase in larval tissues of the tobacco hornworm, Manduca sexta (L.) Insect Biochem. Mol. Biol. 1993;23(5):581–589. [Google Scholar]
- 35.Kojima W., et al. Physiological adaptation of the Asian corn borer Ostrinia furnacalis to chemical defenses of its host plant, maize. J. Insect Physiol. 2010;56(9):1349–1355. doi: 10.1016/j.jinsphys.2010.04.021. [DOI] [PubMed] [Google Scholar]
- 36.Cui X., et al. Molecular mechanism of the UDP-glucuronosyltransferase 2B20-like gene (AccUGT2B20-like) in pesticide resistance of Apis cerana cerana. Front. Genet. 2020;11 doi: 10.3389/fgene.2020.592595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Hu B., et al. The expression of Spodoptera exigua P450 and UGT genes: tissue specificity and response to insecticides. Insect Science. 2019;26(2):199–216. doi: 10.1111/1744-7917.12538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Zhou Y., et al. UDP-glycosyltransferase genes and their association and mutations associated with pyrethroid resistance in Anopheles sinensis (Diptera: Culicidae) Malar. J. 2019;18(1):62. doi: 10.1186/s12936-019-2705-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Li X., et al. Over-expression of UDP–glycosyltransferase gene UGT2B17 is involved in chlorantraniliprole resistance in Plutella xylostella (L.) Pest Manag. Sci. 2017;73(7):1402–1409. doi: 10.1002/ps.4469. [DOI] [PubMed] [Google Scholar]
- 40.Grant C., et al. Overexpression of the UDP-glycosyltransferase UGT34A23 confers resistance to the diamide insecticide chlorantraniliprole in the tomato leafminer, Tuta absoluta. Insect Biochem. Mol. Biol. 2023;159 doi: 10.1016/j.ibmb.2023.103983. [DOI] [PubMed] [Google Scholar]
- 41.Lee S.-W., et al. Metabolic resistance mechanisms of the housefly (Musca domestica) resistant to pyraclofos. Pesticide Biochemistry and Physiology. 2006;85(2):76–83. [Google Scholar]
- 42.Chen X., et al. UDP-glucosyltransferases potentially contribute to imidacloprid resistance in Aphis gossypii glover based on transcriptomic and proteomic analyses. Pestic. Biochem. Physiol. 2019;159:98–106. doi: 10.1016/j.pestbp.2019.06.002. [DOI] [PubMed] [Google Scholar]
- 43.Pan Y., et al. UDP-glycosyltransferases contribute to spirotetramat resistance in Aphis gossypii glover. Pestic. Biochem. Physiol. 2020;166 doi: 10.1016/j.pestbp.2020.104565. [DOI] [PubMed] [Google Scholar]
- 44.Li X., et al. Characterization of UDP-glucuronosyltransferase genes and their possible roles in multi-insecticide resistance in Plutella xylostella (L.) Pest Manag. Sci. 2018;74(3):695–704. doi: 10.1002/ps.4765. [DOI] [PubMed] [Google Scholar]
- 45.Ahn S.-J., Vogel H., Heckel D.G. Comparative analysis of the UDP-glycosyltransferase multigene family in insects. Insect Biochem. Mol. Biol. 2012;42(2):133–147. doi: 10.1016/j.ibmb.2011.11.006. [DOI] [PubMed] [Google Scholar]
- 46.Huang F.-F., et al. The UDP-glucosyltransferase multigene family in Bombyx mori. BMC Genomics. 2008;9(1):563. doi: 10.1186/1471-2164-9-563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ahn S.-J., Marygold S.J. The UDP-glycosyltransferase family in Drosophila melanogaster: nomenclature update, gene expression and phylogenetic analysis. Front. Physiol. 2021;12 doi: 10.3389/fphys.2021.648481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Hearn J., et al. Gene conversion explains elevated diversity in the immunity modulating APL1 gene of the malaria vector Anopheles funestus. Genes (Basel) 2022;13(6) doi: 10.3390/genes13061102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Hunt R.H., et al. Laboratory selection for and characteristics of pyrethroid resistance in the malaria vector Anopheles funestus. Med. Vet. Entomol. 2005;19(3):271–275. doi: 10.1111/j.1365-2915.2005.00574.x. [DOI] [PubMed] [Google Scholar]
- 50.Gillies M.T., Coetzee M. A supplement to the Anophelinae of Africa south of the Sahara. Publ S Afr Inst Med Res. 1987;55:1–143. [Google Scholar]
- 51.Koekemoer L.L., et al. A cocktail polymerase chain reaction assay to identify members of the Anopheles funestus (Diptera: Culicidae) group. Am. J. Trop. Med. Hyg. 2002;66(6):804–811. doi: 10.4269/ajtmh.2002.66.804. [DOI] [PubMed] [Google Scholar]
- 52.Mistry J., et al. Pfam: the protein families database in 2021. Nucleic Acids Res. 2021;49(D1):D412–D419. doi: 10.1093/nar/gkaa913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Blum M., et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 2021;49(D1):D344–D354. doi: 10.1093/nar/gkaa977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Giraldo-Calderón G.I., et al. VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases. Nucleic Acids Res. 2015;43(D1):D707–D713. doi: 10.1093/nar/gku1117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Ghurye J., et al. A chromosome-scale assembly of the major African malaria vector Anopheles funestus. Gigascience. 2019;8(6) doi: 10.1093/gigascience/giz063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Krogh A., et al. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 2001;305(3):567–580. doi: 10.1006/jmbi.2000.4315. [DOI] [PubMed] [Google Scholar]
- 57.Price M.N., Dehal P.S., Arkin A.P. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol. Biol. Evol. 2009;26(7):1641–1650. doi: 10.1093/molbev/msp077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Mackenzie P.I., et al. The UDP glycosyltransferase gene superfamily: recommended nomenclature update based on evolutionary divergence. Pharmacogenetics. 1997;7(4):255–269. doi: 10.1097/00008571-199708000-00001. [DOI] [PubMed] [Google Scholar]
- 59.Danecek P., et al. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10(2) doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Edgar R.C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Leigh J.W., Bryant D. Popart: full-feature software for haplotype network construction. Methods Ecol. Evol. 2015;6(9):1110–1116. [Google Scholar]
- 62.Robinson M.D., McCarthy D.J., Smyth G.K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Schmittgen T.D., Livak K.J. Analyzing real-time PCR data by the comparative CT method. Nat. Protoc. 2008;3(6):1101–1108. doi: 10.1038/nprot.2008.73. [DOI] [PubMed] [Google Scholar]
- 64.Nelson C.W., Moncla L.H., Hughes A.L. SNPGenie: estimating evolutionary parameters to detect natural selection using pooled next-generation sequencing data. Bioinformatics. 2015;31(22):3709–3711. doi: 10.1093/bioinformatics/btv449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Wondji C.S., et al. Mapping a quantitative trait locus (QTL) conferring pyrethroid resistance in the African malaria vector Anopheles funestus. BMC Genomics. 2007;8(1):34. doi: 10.1186/1471-2164-8-34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Li H., Durbin R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics. 2009;25(14):1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Li H., et al. The sequence alignment/map format and SAMtools. Bioinformatics (Oxford, England) 2009;25(16):2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.McKenna A., et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Garrison E. Haplotype-based variant detection from short-read sequencing. arXiv preprint arXiv. 2012 (1207.3907(q-bio.GN)) [Google Scholar]
- 70.Garrison E., et al. Vcflib and tools for processing the VCF variant call format. bioRxiv. 2021 doi: 10.1371/journal.pcbi.1009123. (p. 2021.05.21.445151) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Cingolani P., et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin) 2012;6(2):80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Pfeifer B., et al. PopGenome: an efficient Swiss army knife for population genomic analyses in R. Mol. Biol. Evol. 2014;31(7):1929–1936. doi: 10.1093/molbev/msu136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Nielsen R., et al. Genomic scans for selective sweeps using SNP data. Genome Res. 2005;15(11):1566–1575. doi: 10.1101/gr.4252305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Hudson R.R. Generating samples under a Wright–Fisher neutral model of genetic variation. Bioinformatics. 2002;18(2):337–338. doi: 10.1093/bioinformatics/18.2.337. [DOI] [PubMed] [Google Scholar]
- 75.Ayala D., et al. The genome sequence of the malaria mosquito, Anopheles funestus, Giles, 1900 [version 1; peer review: 2 approved] Wellcome Open Research. 2022;7(287) doi: 10.12688/wellcomeopenres.18445.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Neafsey D.E., et al. Mosquito genomics. Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes. Science. 2015;347(6217):1258522. doi: 10.1126/science.1258522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Nkya T.E., et al. Insecticide resistance mechanisms associated with different environments in the malaria vector Anopheles gambiae: a case study in Tanzania. Malar. J. 2014;13:28. doi: 10.1186/1475-2875-13-28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Yang Z., Bielawski J.P. Statistical methods for detecting molecular adaptation. Trends Ecol. Evol. 2000;15(12):496–503. doi: 10.1016/S0169-5347(00)01994-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Khambay B.P.S., Jewess P.J. In: Comprehensive Molecular Insect Science. Gilbert L.I., editor. Elsevier; Amsterdam: 2005. 6.1 - Pyrethroids; pp. 1–29. [Google Scholar]
- 80.Soderlund D.M. Molecular mechanisms of pyrethroid insecticide neurotoxicity: recent advances. Arch. Toxicol. 2012;86(2):165–181. doi: 10.1007/s00204-011-0726-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Guengerich F.P. Cytochrome P450 research and the journal of biological chemistry. J. Biol. Chem. 2019;294(5):1671–1680. doi: 10.1074/jbc.TM118.004144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Cutler D.J., Jensen J.D. To pool, or not to pool? Genetics. 2010;186(1):41–43. doi: 10.1534/genetics.110.121012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Michel A.P., et al. Rangewide population genetic structure of the African malaria vector Anopheles funestus. Mol. Ecol. 2005;14(14):4235–4248. doi: 10.1111/j.1365-294X.2005.02754.x. [DOI] [PubMed] [Google Scholar]
- 84.Genetic diversity of the African malaria vector Anopheles gambiaeNature. 2017;552(7683):96–100. doi: 10.1038/nature24995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Nakajima M., et al. N-glycosylation plays a role in protein folding of human UGT1A9. Biochem. Pharmacol. 2010;79(8):1165–1172. doi: 10.1016/j.bcp.2009.11.020. [DOI] [PubMed] [Google Scholar]
- 86.Nakamura T., et al. Introduction of an N-glycosylation site into UDP-glucuronosyltransferase 2B3 alters its sensitivity to cytochrome P450 3A1-dependent modulation. Front. Pharmacol. 2016;7:427. doi: 10.3389/fphar.2016.00427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Korprasertthaworn P., et al. Effects of amino acid substitutions at positions 33 and 37 on UDP-glucuronosyltransferase 1A9 (UGT1A9) activity and substrate selectivity. Biochem. Pharmacol. 2012;84(11):1511–1521. doi: 10.1016/j.bcp.2012.08.026. [DOI] [PubMed] [Google Scholar]
- 88.Kim J.Y., et al. Comprehensive variant screening of the UGT gene family. Yonsei Med. J. 2014;55(1):232–239. doi: 10.3349/ymj.2014.55.1.232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Chambers A.C., et al. Overview of the Baculovirus expression system. Curr. Protoc. Protein Sci. 2018;91 doi: 10.1002/cpps.47. (p. 5.4.1-5.4.6) [DOI] [PubMed] [Google Scholar]
- 90.Ibrahim S.S., et al. Allelic variation of cytochrome P450s drives resistance to Bednet insecticides in a major malaria vector. PLoS Genet. 2015;11(10) doi: 10.1371/journal.pgen.1005618. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Read data of pooled whole genome sequencing (PoolSeq) are available in the European Nucleotide Archive (ENA) under accession (PRJEB13485 and PRJEB24384). SureSelect data are available under accession numbers PRJEB24520 (Cameroon), PRJEB47287 (Malawi and Uganda), PRJEB24506 (FANG colony) and PRJEB48958 (FUMOZ colony). RNA-seq data are available under accession numbers PRJEB24351 (Cameroon permethrin resistant), PRJEB45224 (FANG and FUMOZ) and PRJEB70998 (Cameroon unexposed, Malawi permethrin resistant, Malawi unexposed, Uganda permethrin resistant and Uganda unexposed).