Abstract
Matrix-assisted laser desorption-ionisation time of flight mass spectrometry (MALDI-TOF MS) is a powerful analytical method that has been used extensively to identify sample ions of complex mixtures, and biological samples such as proteins, tissues and microorganisms. MALDI-TOF MS has revolutionised clinical microbiology with accurate, rapid, and inexpensive species-level identification of microbes. MALDI-TOF MS technology generates spectral signatures and matches them to a library of similar organisms using bioinformatics pattern matching. The use of MALDI-TOF MS for entomological samples has been explored by multiple groups with proven efficacy at differentiating between closely related species, as well as detecting pathogens in different vectors. The low cost per sample processing, rapid turnaround and robustness are attractive for surveillance of vector control programs. Libraries are built in-house for institutional usage, although a multi-user platform with sharing of spectra and data would be attractive. Only a few studies have strived to make their libraries publicly available. Here, we outline a stepwise approach for creating an in-house MALDI-TOF MS library and subsequent query, using malaria vector species identification as a case study for entomological samples. A protocol and video of the methodology are also shared. Moreover, the libraries related to this publication have been deposited in public repository (https://doi.org/10.7910/DVN/VYQFNO37) for anyone with MALDI-TOF MS equipment to adapt.
Introduction
Vector control remains a cornerstone of malaria prevention, with vector surveillance playing a critical role in monitoring and evaluating control and research programs. However, for surveillance to be truly informative, high-quality data must be collected consistently across space and time, extending beyond simple vector densities and morphological identification. A major challenge is that molecular assays (essential for understanding vector species composition) are prohibitively expensive when applied to thousands of mosquitoes annually. As countries progress toward malaria elimination, gaining deeper insights into vectorial systems and transmission patterns becomes essential for designing end-game strategies. This underscores the urgent need for cost-effective tools that enable large-scale sample processing, ensuring robust data collection to drive informed decision-making in malaria control and elimination efforts.
MALDI-TOF MS represents one such tool. The technology has revolutionised diagnostics in the field of clinical microbiology allowing for faster turn-around time from sample preparation to obtaining precise and accurate results at a lower cost per sample [1–4]. Recently, MALDI-TOF has found valuable application in entomological surveillance. Numerous studies have assessed its performance, accuracy, and reliability for mosquito species and sub-species identification [5–8]. MALDI-TOF MS has also been evaluated for the identification of other arthropods, including Drosophila, [9,10], fleas [11], lice [12], ticks [13–16], Ceratopogonides (biting midges), and sand flies [17–19]. It has also has also been used to differentiate between infected and non-infected ticks [15,16,20]; Plasmodium-infected and non-infected An. stephensi [21] and Wolbachia-infected and non-infected Aedes aegypti [22]. MALDI-TOF MS has also been shown to distinguish between/among blood meals from different hosts [5,23–25], age grading of malaria vectors [26] and insecticide resistance profiling [27].
Creating in house libraries can be challenging and often the number of institutions that have access to the instrument but that use it beyond microbiology is limited. We aim to demystify the process and share a detailed workflow for creating a MALDI-TOF MS library using malaria vector species identification as a case study. When well optimised, the spectra obtained from a single run can be used to develop other libraries and query other parameters including, for example, infectious status [21], age/parity [26], presence endosymbionts such as Microsporidia MB or Wolbachia [22], and potentially insecticide resistance [27]. Blood-engorged mosquito abdomens can be used for differentiation of blood meal sources [5].
The workflow involves a series of steps that begins with dissection of mosquitoes into three anatomical compartments: legs and wings, cephalothorax, and abdomen. Dissection is followed by protein extraction and acquisition of spectral profiles. Guided by results from gold standard tests (ELISA, PCR, taxonomy), a sub-set of high-quality spectra is then selected and used for training the machine algorithm and create a reference database (spectral library) (training dataset).
A high-quality spectrum should have a high number of well-defined, intense peaks with good signal-to-noise ratio, consistent peak positions across replicates, minimal background noise, and a set of prominent “marker peaks” specific to the analysed molecule, allowing for reliable identification and comparison against a reference database. The remaining spectra are used to determine the accuracy of the reference database in identifying unknown samples (validation dataset). The match level between the unknown sample and the reference database is estimated using the log score value (≥1.8 is considered a reliable identification). Once libraries are created these can be continuously improved by inclusion of additional reference spectra capturing potential variation between strains due to geographical difference brought about by genetic variations limited arising from limited genetic exchange/gene flow.
Here we share the methodological approach to creating MALDI TOF MS libraries for entomological samples using malaria vector species identification as an exemplary case study. Moreover, the libraries related to this publication have been deposited in public repository (https://doi.org/10.7910/DVN/VYQFNO37) for anyone with MALDI-TOF MS equipment to adapt.
Materials and methods
Mosquito sampling
Mosquito samples were collected from 2019–2022 in three distinct ecological zones with different species composition viz., Kilifi, Taita-Taveta and Kwale Counties of Kenya. A description of mosquito sampling and processing is detailed in Karisa et al [5].
Ethical consideration and approvals
This study did not involve human samples; only mosquitoes were collected and analyzed. However, since collections were conducted at the household level, ethical approval was obtained from the Kenya Medical Research Institute Scientific and Ethics Review Unit (KEMRI SERU 4945) before commencing the study. Several protocols were used during the mosquito collection exercise. Informed consent was sought from all participating households. In some cases, verbal consent was obtained, while in others, written consent or permission was provided in the presence of local administration before setting up traps for mosquito collection.
Sample preparation and gold standards
The step-by-step protocol of the process here briefly described, is also available on protocols.io (dx.doi.org/10.17504/protocols.io.n2bvjdy3xvk5/v1) and is included as a supplementary file with this article (S1 File). Briefly, Anopheles mosquitoes were individually dissected into different body parts: legs, wings, cephalothorax and abdomen. The legs and wings or one half of the cephalothorax were used for species identification by PCR as the gold standard reference [28–30] for the classification of MALDI-TOF MS spectra. The remaining half of the cephalothorax was homogenized, proteins were extracted and loaded onto a steel MALDI-target plate (Bruker Daltonics) as described earlier [5]. The prepared plate was then introduced into the Microflex machine (Bruker Daltonics) for spectral acquisition.
MALDI-TOF MS processing
The stepwise process of spectra acquisition, pre-processing, library creation and query (validation) has been previously described [5]. Furthermore, the step-by-step protocol of the process is also available on protocols.io (dx.doi.org/10.17504/protocols.io.n2bvjdy3xvk5/v1) and is included as a supplementary file with this article (S1 File).
Results
Mosquito molecular identification and spectra acquisition
A total of 2,332 mosquitoes, collected in Kenya [5], were morphologically identified as either Anopheles gambiae s.l or Anopheles funestus s.l and used for development of a MALDI-TOF MS species identification library. A total of 1,971 (85%) of the mosquitoes were accurately identified to species using molecular tools as quality control of reference library. The remaining 15% did not render sufficient DNA for PCR or were found not to be an anopheline by sequencing. Out of the 1,971 samples, various mosquito species were identified, including: An. gambiae s.s, An. arabiensis, An. merus, An. quadriannulatus, An. funestus s.s, An. rivulorum, and An. leesoni, making up the majority [5]. Of the 1,971 samples 5.2% produced poor-quality spectra and were therefore excluded, leaving a total of 1,782 for database creation and query (validation).
Reference spectral library creation
Reference libraries were created with 87 well characterized sample spectra belonging to An. gambiae (An. gambiae s.s – 10, An. arabiensis – 26, An. merus – 19, An. quadriannulatus – 14) and An. funestus (An. funestus s.s – 6, An. rivulorum – 6 and An. leesoni – 6) complexes were used for database creation. The libraries related to this publication have been deposited in public repository (https://doi.org/10.7910/DVN/VYQFNO37) for anyone with MALDI-TOF MS equipment to adapt.
Reference spectral library query
Out of all the mosquito analysed by MALDI-TOF MS, 1,712/1782 were correctly identified with an accuracy of 96.1% [5]. The database was particularly precise in distinguishing An. gambiae and An. funestus complex members which clustered into three distinct branches: i) An. vaneedeni, An. funestus and An. parensis clustered, ii) An. gambiae complex, and iii) An. rivulorum and An. leesoni (Fig 1).
Fig 1. MSP dendrogram of MALDI-TOF MS spectra used for database creation for species identification and An. vaneedeni and An. parensis [5].
However, a few samples had low LSV values, suspected to be due to either protein degradation, physiological status or residual blood meals in the head and thorax (Fig 2). The application of the weighted LSV approach on the An. gambiae s.l samples that had ambiguous identification during querying, resulted in correct species identification congruent with the molecular analysis results, further improving the accuracy of species identification to 97.5% [5].
Fig 2. LSVs obtained after a database query with MS spectra belonging to members of An. gambiae and An. funestus complexes.

Horizontal dashed lines represent the cut-off point for reliable identification (LSV > 1.8). Abbreviations: A.U., arbitrary units, An., Anopheles, s.s, sensu stricto, LSV., Log Score value [5].
Impact of preservation method on mosquito identification and LSV distribution
With a well curated, preservation methods do not significantly impact on mosquito identification and LSV distribution as we noted that there was no significant difference (Kruskal-Wallis test, p = 0.19) in the median LSV among different methods of preservation (Fig 3).
Fig 3. Log Score Value (LSVs) obtained after MSP reference database query with MS spectra based on method of preservation.
Horizontal dashed lines represent the cut-off point for reliable identification (LSV > 1.8) [5].
Discussion
This article outlines a comprehensive procedure for utilisations of MALDI-TOF MS for malaria vector species identification. The accuracy in distinguishing between sibling species of the two Anopheles’ species complex (An. funestus s.l and An. gambiae s.l) demonstrates the discriminative power of MALDI-TOF MS. In the context of surveillance, the wide range of species representation should reflect as much as possible the diversity of mosquitoes in each study area. Different mosquito species play varied roles in disease transmission and have different feeding and resting proclivities. The composition of vectors paints a picture of the type of rigor required for effective entomological surveillance and the obvious benefit of using a highly sensitive tool such as MALDI-TOF MS. The spectra obtained from the head and thorax can simultaneously measure several entomological parameters if libraries are created for that purpose. In different studies, the head and thorax spectral pattern has been successfully used for species identification [5], pathogens infection status determination [21], endosymbionts [22] and age-grading [26]. More recently, a proof-of-concept study demonstrated that spectral patterns from the head and thorax were used for discriminating between resistant and susceptible mosquitoes [27]. Here, we propose using the head and thorax spectral for all future MALDI TOF MS spectra acquisition for malaria vector profiling. To make this technology accessible to all stakeholders, including national public health agencies and partners responsible for vector surveillance, it is essential to harmonize protocols and methodologies across research groups and institutions. Standardizing approaches will enable the development of a shared platform, ensuring consistency, comparability, and broader adoption of this technology in national malaria control programs.
In conclusion, the proposed work flow enables the creation of in-house MALDI-TOF MS libraries with relative ease (protocol and associated video are attached as supplementary files). MALDI TOF MS can be used to robustly identify mosquito species. The high accuracy and ability to distinguish complex species and visualise species relationships make this method valuable in entomological monitoring of primary Afrotropical malaria vectors and can be expanded to include other mosquito disease vectors.
Supporting information
This PDF contains the detailed step-by-step protocol for constructing MALDI-TOF MS reference database for insect identification.
(PDF)
Acknowledgments
We are grateful to the Scientific and technical teams at the Centre for Geographic Medicine Research Coast, Kilifi for help in design and implementation of this work. Many thanks to the technical and field staff (Festus Yaa, Gabriel Nzai, Robert Mwakesi and Martha Muturi) who devoted their time and assisted in the field collection of samples. This paper has been published with the permission of the Director of the Kenya Medical Research Institute (KEMRI).
Associated content
- Mosquito Database Creation Using MALDI TOF MS video demonstration available on YouTube (https://youtu.be/btS0hsx9yNw). The video shows the experimental steps of MALDI TOF MS database creation.
Data Availability
Sample data are fully available without restriction on Harvard Dataverse - https://doi.org/10.7910/DVN/VYQFNO
Funding Statement
MM - Investment 057212 - Bill and Melinda Gates Foundation - USA MR - FLR_R1_190497 - The Royal Society FLAIR fellowship grant- UK The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Dhiman N, Hall L, Wohlfiel SL, Buckwalter SP, Wengenack NL. Performance and cost analysis of matrix-assisted laser desorption ionization-time of flight mass spectrometry for routine identification of yeast. J Clin Microbiol. 2011;49(4):1614–6. doi: 10.1128/JCM.02381-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Dieckmann R, Helmuth R, Erhard M, Malorny B. Rapid classification and identification of salmonellae at the species and subspecies levels by whole-cell matrix-assisted laser desorption ionization-time of flight mass spectrometry . Appl Environ Microbiol. 2008;74(24):7767–78. doi: 10.1128/AEM.01402-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Seng P, Drancourt M, Gouriet F, La Scola B, Fournier P-E, Rolain JM, et al. Ongoing revolution in bacteriology: routine identification of bacteria by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Clin Infect Dis. 2009;49(4):543–51. doi: 10.1086/600885 [DOI] [PubMed] [Google Scholar]
- 4.Tran A, Alby K, Kerr A, Jones M, Gilligan PH. Cost Savings Realized by Implementation of Routine Microbiological Identification by Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry. J Clin Microbiol. 2015;53(8):2473–9. doi: 10.1128/JCM.00833-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Karisa J, Ominde K, Tuwei M, Bartilol B, Ondieki Z, Musani H, et al. Utility of MALDI-TOF MS for determination of species identity and blood meal sources of primary malaria vectors on the Kenyan coast. Wellcome Open Res. 2024;8:151. doi: 10.12688/wellcomeopenres.18982.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Müller P, Pflüger V, Wittwer M, Ziegler D, Chandre F, Simard F, et al. Identification of cryptic Anopheles mosquito species by molecular protein profiling. PLoS One. 2013;8(2):e57486. doi: 10.1371/journal.pone.0057486 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Yssouf A, Socolovschi C, Flaudrops C, Ndiath MO, Sougoufara S, Dehecq J-S, et al. Matrix-assisted laser desorption ionization--time of flight mass spectrometry: an emerging tool for the rapid identification of mosquito vectors. PLoS One. 2013;8(8):e72380. doi: 10.1371/journal.pone.0072380 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Yssouf A, Parola P, Lindström A, Lilja T, L’Ambert G, Bondesson U, et al. Identification of European mosquito species by MALDI-TOF MS. Parasitol Res. 2014;113(6):2375–8. doi: 10.1007/s00436-014-3876-y [DOI] [PubMed] [Google Scholar]
- 9.Campbell PM. Species differentiation of insects and other multicellular organisms using matrix‐assisted laser desorption/ionization time of flight mass spectrometry protein profiling. Systematic Entomology. 2005;30(2):186–90. doi: 10.1111/j.1365-3113.2004.00279.x [DOI] [Google Scholar]
- 10.Feltens R, Görner R, Kalkhof S, Gröger-Arndt H, von Bergen M. Discrimination of different species from the genus Drosophila by intact protein profiling using matrix-assisted laser desorption ionization mass spectrometry. BMC Evol Biol. 2010;10:95. doi: 10.1186/1471-2148-10-95 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Yssouf A, Socolovschi C, Leulmi H, Kernif T, Bitam I, Audoly G, et al. Identification of flea species using MALDI-TOF/MS. Comp Immunol Microbiol Infect Dis. 2014;37(3):153–7. doi: 10.1016/j.cimid.2014.05.002 [DOI] [PubMed] [Google Scholar]
- 12.Ouarti B, Laroche M, Righi S, Meguini MN, Benakhla A, Raoult D, et al. Development of MALDI-TOF mass spectrometry for the identification of lice isolated from farm animals. Parasite. 2020;27:28. doi: 10.1051/parasite/2020026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Boucheikhchoukh M, Laroche M, Aouadi A, Dib L, Benakhla A, Raoult D, et al. MALDI-TOF MS identification of ticks of domestic and wild animals in Algeria and molecular detection of associated microorganisms. Comp Immunol Microbiol Infect Dis. 2018;57:39–49. doi: 10.1016/j.cimid.2018.05.002 [DOI] [PubMed] [Google Scholar]
- 14.Karger A, Kampen H, Bettin B, Dautel H, Ziller M, Hoffmann B, et al. Species determination and characterization of developmental stages of ticks by whole-animal matrix-assisted laser desorption/ionization mass spectrometry. Ticks Tick Borne Dis. 2012;3(2):78–89. doi: 10.1016/j.ttbdis.2011.11.002 [DOI] [PubMed] [Google Scholar]
- 15.Yssouf A, Almeras L, Berenger J-M, Laroche M, Raoult D, Parola P. Identification of tick species and disseminate pathogen using hemolymph by MALDI-TOF MS. Ticks Tick Borne Dis. 2015;6(5):579–86. doi: 10.1016/j.ttbdis.2015.04.013 [DOI] [PubMed] [Google Scholar]
- 16.Yssouf A, Almeras L, Terras J, Socolovschi C, Raoult D, Parola P. Detection of Rickettsia spp in ticks by MALDI-TOF MS. PLoS Negl Trop Dis. 2015;9(2):e0003473. doi: 10.1371/journal.pntd.0003473 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Kaufmann C, Ziegler D, Schaffner F, Carpenter S, Pflüger V, Mathis A. Evaluation of matrix-assisted laser desorption/ionization time of flight mass spectrometry for characterization of Culicoides nubeculosus biting midges. Med Vet Entomol. 2011;25(1):32–8. doi: 10.1111/j.1365-2915.2010.00927.x [DOI] [PubMed] [Google Scholar]
- 18.Kumsa B, Laroche M, Almeras L, Mediannikov O, Raoult D, Parola P. Morphological, molecular and MALDI-TOF mass spectrometry identification of ixodid tick species collected in Oromia, Ethiopia. Parasitol Res. 2016;115(11):4199–210. doi: 10.1007/s00436-016-5197-9 [DOI] [PubMed] [Google Scholar]
- 19.Mathis A, Depaquit J, Dvořák V, Tuten H, Bañuls A-L, Halada P, et al. Identification of phlebotomine sand flies using one MALDI-TOF MS reference database and two mass spectrometer systems. Parasit Vectors. 2015;8:266. doi: 10.1186/s13071-015-0878-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Fotso Fotso A, Mediannikov O, Diatta G, Almeras L, Flaudrops C, Parola P, et al. MALDI-TOF mass spectrometry detection of pathogens in vectors: the Borrelia crocidurae/Ornithodoros sonrai paradigm. PLoS Negl Trop Dis. 2014;8(7):e2984. doi: 10.1371/journal.pntd.0002984 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Laroche M, Almeras L, Pecchi E, Bechah Y, Raoult D, Viola A, et al. MALDI-TOF MS as an innovative tool for detection of Plasmodium parasites in Anopheles mosquitoes. Malar J. 2017;16(1):5. doi: 10.1186/s12936-016-1657-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Rakotonirina A, Caruzzo C, Ballan V, Kainiu M, Marin M, Colot J, et al. Wolbachia detection in Aedes aegypti using MALDI-TOF MS coupled to artificial intelligence. Sci Rep. 2021;11(1):21355. doi: 10.1038/s41598-021-00888-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Niare S, Almeras L, Tandina F, Yssouf A, Bacar A, Toilibou A, et al. MALDI-TOF MS identification of Anopheles gambiae Giles blood meal crushed on Whatman filter papers. PLoS One. 2017;12(8):e0183238. doi: 10.1371/journal.pone.0183238 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Tandina F, Niaré S, Laroche M, Koné AK, Diarra AZ, Ongoiba A, et al. Using MALDI-TOF MS to identify mosquitoes collected in Mali and their blood meals. Parasitology. 2018;145(9):1170–82. doi: 10.1017/S0031182018000070 [DOI] [PubMed] [Google Scholar]
- 25.Tandina F, Niare S, Almeras L, Davoust B, Doumbo OK, Raoult D, et al. Identification of mixed and successive blood meals of mosquitoes using MALDI-TOF MS protein profiling. Parasitology. 2020;147(3):329–39. doi: 10.1017/S003118201900163X [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Mohammad N, Naudion P, Dia AK, Boëlle P-Y, Konaté A, Konaté L, et al. Predicting the age of field Anopheles mosquitoes using mass spectrometry and deep learning. Sci Adv. 2024;10(19):eadj6990. doi: 10.1126/sciadv.adj6990 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Almeras L, Costa MM, Amalvict R, Guilliet J, Dusfour I, David J-P, et al. Potential of MALDI-TOF MS biotyping to detect deltamethrin resistance in the dengue vector Aedes aegypti. PLoS One. 2024;19(5):e0303027. doi: 10.1371/journal.pone.0303027 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Scott JA, Brogdon WG, Collins FH. Identification of single specimens of the Anopheles gambiae complex by the polymerase chain reaction. Am J Trop Med Hyg. 1993;49(4):520–9. doi: 10.4269/ajtmh.1993.49.520 [DOI] [PubMed] [Google Scholar]
- 29.Koekemoer LL, Kamau L, Hunt RH, Coetzee M. A cocktail polymerase chain reaction assay to identify members of the Anopheles funestus (Diptera: Culicidae) group. Am J Trop Med Hyg. 2002;66(6):804–11. doi: 10.4269/ajtmh.2002.66.804 [DOI] [PubMed] [Google Scholar]
- 30.Lobo NF, Laurent BS, Sikaala CH, Hamainza B, Chanda J, Chinula D. Unexpected diversity of Anopheles species in Eastern Zambia: implications for evaluating vector behavior and interventions using molecular tools. Sci Rep. 2015;5:17952. [DOI] [PMC free article] [PubMed] [Google Scholar]


