Skip to main content
Malaria Journal logoLink to Malaria Journal
. 2024 Mar 26;23:86. doi: 10.1186/s12936-024-04915-0

Rapid assessment of the blood-feeding histories of wild-caught malaria mosquitoes using mid-infrared spectroscopy and machine learning

Emmanuel P Mwanga 1,2,, Idrisa S Mchola 1, Faraja E Makala 1, Issa H Mshani 1,2, Doreen J Siria 1,2, Sophia H Mwinyi 1,2, Said Abbasi 1, Godian Seleman 1, Jacqueline N Mgaya 1, Mario González Jiménez 3, Klaas Wynne 3, Maggy T Sikulu-Lord 4, Prashanth Selvaraj 5, Fredros O Okumu 1,2,6,7,#, Francesco Baldini 2,#, Simon A Babayan 2,#
PMCID: PMC10964711  PMID: 38532415

Abstract

Background

The degree to which Anopheles mosquitoes prefer biting humans over other vertebrate hosts, i.e. the human blood index (HBI), is a crucial parameter for assessing malaria transmission risk. However, existing techniques for identifying mosquito blood meals are demanding in terms of time and effort, involve costly reagents, and are prone to inaccuracies due to factors such as cross-reactivity with other antigens or partially digested blood meals in the mosquito gut. This study demonstrates the first field application of mid-infrared spectroscopy and machine learning (MIRS-ML), to rapidly assess the blood-feeding histories of malaria vectors, with direct comparison to PCR assays.

Methods and results

Female Anopheles funestus mosquitoes (N = 1854) were collected from rural Tanzania and desiccated then scanned with an attenuated total reflectance Fourier-transform Infrared (ATR-FTIR) spectrometer. Blood meals were confirmed by PCR, establishing the ‘ground truth’ for machine learning algorithms. Logistic regression and multi-layer perceptron classifiers were employed to identify blood meal sources, achieving accuracies of 88%–90%, respectively, as well as HBI estimates aligning well with the PCR-based standard HBI.

Conclusions

This research provides evidence of MIRS-ML effectiveness in classifying blood meals in wild Anopheles funestus, as a potential complementary surveillance tool in settings where conventional molecular techniques are impractical. The cost-effectiveness, simplicity, and scalability of MIRS-ML, along with its generalizability, outweigh minor gaps in HBI estimation. Since this approach has already been demonstrated for measuring other entomological and parasitological indicators of malaria, the validation in this study broadens its range of use cases, positioning it as an integrated system for estimating pathogen transmission risk and evaluating the impact of interventions.

Keywords: Anopheles, Human blood index machine learning, Transfer learning, VectorSphere

Background

Effective entomological surveillance requires systematic collection, analysis, and interpretation of data on insects that transmit pathogens in different localities. It is essential for assessing risks and guiding the planning and implementation of vector control strategies, as well for monitoring, and evaluation of those strategies [1]. The likelihood of pathogen transmission can vary widely, depending on factors such as the presence of competent vectors, favourable climatic conditions, the presence of vulnerable human populations and the presence of other vertebrate hosts, which may sustain the vector populations [1]. Other factors may include the diversity of vector species in the area, their population dynamics, their behaviours in and around human dwellings such as the timing and location of biting, their resting behaviours and host preferences of these vectors [2, 3].

Anopheles mosquitoes are considered particularly hazardous due to their propensity to feed on, and thus transmit pathogens to, humans, notably malaria, which causes approximately 620,000 deaths and about 250 million cases annually [4]. Compared to mosquitoes from other regions, the Afro-tropical malaria vectors are particularly dangerous in this regard due to their comparatively greater preference for humans over other vertebrates [2]. This attribute, which is generally estimated as the human blood index, has been considered an important measure of the stability of malaria in different settings [5]; and is known to be highest in major malaria vectors, including Anopheles gambiae, Anopheles funestus and Anopheles coluzzii, which appear to be particularly well adapted synanthropes [6]. Following closely is Anopheles arabiensis, which can be an opportunistic vector species capable of blood-feeding readily on either humans or cattle, depending on availability [2, 3, 7]. Consequently, while this behaviour poses a notable risk for the transmission of zoonotic pathogens in addition to malaria, An. arabiensis is also a far less competent vector of malaria than either An. gambiae, An. funestus or An. coluzzii [811].

While anthropophagy (i.e. preference for feeding on humans) in malaria vectors can be augmented by the degree of endophily (i.e. preference for indoor resting), this behaviour can also be attenuated under high degrees of exophily (i.e. preference for outdoor resting). For example, An. funestus is known for being both highly anthropophilic and highly endophilic [2, 9], enforcing its major role in malaria transmission [9, 10] though there are settings where it is known to bite outdoors early in the morning [12, 13] or to feed on non-human hosts [14]. On the other hand, mosquitoes that rest indoors are more likely to feed on human host, while mosquitoes that prefer to rest outdoors are more likely to feed on non-human host [2, 15]. This might be due to mosquitoes feeding on the first host they encounter when presented with multiple hosts in the same environment [7], or to the use of bed nets preventing access to human hosts [16, 17]. Overall, accurate determination of the blood-feeding histories of malaria vector species is an important indicator of their feeding behaviour, their role in ongoing malaria transmission and the overall risk exposure of people within those settings.

Methods for investigating the blood-meal sources in mosquitoes include several techniques: the precipitin test observes the formation of a white precipitate resulting from the interaction between a saline extract of the blood meal and a suitable antiserum from a known host, indicating the presence of an antigen–antibody interaction [18]; microsphere assays is a molecular-based assay involving uniquely labelled microspheres with host species-specific capture probes to detect host blood meals [19]; microsatellite assays analyse short tandem repeat sequences in the mosquito’s DNA to identify blood sources based on unique genetic markers [20]; enzyme-linked immunosorbent assays (ELISA) detect immunoglobulin G (IgG) from blood-fed mosquito samples [21]; and polymerase chain reactions (PCR) target mitochondrial cytochrome b to identify arthropod blood meal sources [22]. ELISA and PCR, the most common techniques for studying host blood meals in mosquitoes, have played a crucial role in understanding mosquito host preference since the early 1980s and emerged as powerful tools due to their sensitivity [2126]. These methods have evolved over time with modification to enhance accuracy and efficiency. ELISA, for instance, utilizes two basic procedures: indirect ELISA, where an antiserum is used to trap a particular IgG [23], and direct ELISA, which relies solely on the antibody-enzyme conjugate to attach to host-specific IgG in the bloodmeal [21, 24], currently preferred for its simplicity over indirect ELISA. PCR, being more sensitive due to specific primers targeting host DNA, has evolved from conventional PCR, which amplified human host DNA extracts at the human tyrosine hydrolase (TC-11 or HUMTHO1) and VWA (HUMVWFA31) [25, 26], to the current multiplexed PCR capable of detecting five mammalian blood meals in mosquitoes in a single step (i.e., by size-differentiated DNA fragment on agarose gels) [22]. While these techniques offer significant advantages, they also come with challenges such as being time-consuming, laborious, and require repeated use of expensive reagents, not always readily available in rural laboratories where field collections are conducted. Moreover, ELISA assays, one of the most widely used technique, are prone to high levels of cross-reactivity, occasionally failing to sufficiently distinguish between human and hon-human blood meals [27]. Since field collections do not always yield synchronous physiological states, some of the blood meals may have been partly digested, which might also compound the detection capability of current methods [28].

In a recent study, our team demonstrated that machine learning models trained on mid-infrared spectra data collected from mosquitoes fed on different hosts (4000 cm−1 to 400 cm−1 frequencies) (MIRS-ML) could accurately distinguish vertebrate blood meals in laboratory-reared An. arabiensis mosquitoes without the need for molecular techniques [29]. However, it was also noted that field validation would be necessary for multiple reasons. Firstly, in field settings, the time post-feeding is unknown, and the mosquitoes may have multiple blood meals, occasionally from multiple sources. Secondly, unlike laboratory settings where the age of mosquitoes is known, field mosquitoes vary in age and may have taken their 2nd, 3rd, or 4th meals. Thirdly, the amount of blood in the mosquito gut may be small in the field due to increased disturbance during feeding compared to controlled laboratory conditions, and lastly, the genetic variability for blood sources is higher in the field. Overcoming these challenges would enable the potential use of MIRS-ML in real-world field scenarios. We, therefore, concluded from the initial laboratory study that whereas the technique offers a unique opportunity to rapidly test individual mosquitoes for blood-type and other attributes, assessing blood-feeding histories of wild malaria mosquitoes would provide an opportunity to test its potential field validation.

The current study aimed to analyse the blood-feeding preferences of wild-caught malaria mosquitoes, by using MIRS-ML models to identify the sources of their blood meals. The study also explored how well the models trained using laboratory-reared mosquitoes can be applied to field-collected samples by incorporating specific transfer learning techniques previously used for predicting the species identification and age of mosquitoes collected in different countries [30, 31]. The ultimate goal of the work was to demonstrate the utility of this approach for field applications. Implementing these models in the field would significantly enhance the knowledge of mosquito feeding behaviours and disease transmission, potentially informing more effective vector control strategies against multiple mosquito-borne diseases [3239].

Methods

Mosquito collection and processing

Mosquitoes were sampled from five sites in Tulizamoyo, a rural village in Ulanga district, southeastern Tanzania (8.3544° S, 36.7054° E). To capture a comprehensive range of blood-meals, collections were conducted as follows: (a) indoors using CDC light traps and resting buckets throughout the night (6:30 PM–6:30 AM) and Prokopack aspirators during the early morning (5:30 AM–6:30 AM); (b) outdoors in peri-domestic areas, including outdoor kitchens, with the same night and early morning methods; and (c) around animal sheds, again using resting buckets at night and Prokopack aspirators in the morning.

The collected mosquitoes were sorted by taxa and physiological states [40]. All blood-fed Anopheles females were killed with chloroform and preserved individually in 1.5 mL Eppendorf tubes containing silica gel desiccant afterwards. The mosquitoes were kept for 5 days at 5 °C before scanning (see below). In total, 1854 blood-fed (76% An. funestus and 24% An. arabiensis) females were examined.

Mid-infrared spectrometer scanning

The abdomens of all blood-fed An. funestus and An. arabiensis were scanned. An attenuated total reflection Fourier-transform infrared (ATR-FTIR) ALPHA II spectrometer (Bruker optics) was used to collect the infrared spectra of dried mosquito abdomens over a spectral range of 4000–400 cm−1, with a 2 cm−1 resolution. The absorbance data obtained from scanning the mosquito abdomens provides insights into the biochemical makeup, e.g. the protein and lipid concentrations present in the blood meal, which are indicative of the vertebrate source of the blood meal [29]. Each mosquito was scanned 32 times and the spectra were averaged. Scanning was done inside the Ifakara Health institute’s Vector Biology Laboratory, the VectorSphere.

Identification of blood meals from different vertebrate hosts using PCR

Following MIRS analysis, mosquito carcasses were subjected to a multiplex PCR assay to identify the vertebrate origins of their blood meals as either from humans, cows, goats, dogs, or pigs. A multiplexed PCR assay was used targeting the cytochrome b (cytB) gene following the Kent et al. protocol [22]. DNA was extracted using DNAzol® with a final volume of 20 µl per sample. The PCR mix included 5 µl of DNA, 1 µl each of 20 µM universal and species-specific primers, and 12.5 µl of One Taq Quick Load 2X master mix. Amplification conditions were: 95 °C for 5 min, 29 cycles of 95 °C for 1 min, 58 °C for 1 min, 72 °C for 1 min, and a final extension at 72 °C for 7 min. Products were run on a 2% agarose gel with Classic view stain and imaged under UV light with the Kodak Logic 100 system. PCR results were used as the “ground truth” to train and validate machine learning algorithms. The PCR products were run on a 2% agarose gel with Classic view stain and imaged under UV light with the Kodak Logic 100 system, assessed in comparison to the known fragment sizes for different hosts (Kent et al. [22] as shown in Table 1). PCR results provided “ground truth” data to train machine learning.

Table 1.

Amplified DNA fragments from different blood meal hosts

Host blood Fragment size (base pairs)
Human 334
Bovine 561
Goat 132
Dog 680
Pig 453

Confirmation of the identity of sibling species in the An. funestus group

Using DNA extracted from the same mosquitoes, a multiplex PCR protocol by Koekemoer et al. [41] was used to identify and distinguish between sibling species within the An. funestus group.

Training machine learning models to identify and distinguish between blood meal types

The analysis was carried out in Python 3.9 using the Scikit-learn [42] and Keras [43] libraries for the machine learning tasks. Supervised machine learning was exclusively trained with wild-caught An. funestus females dataset (N = 751), consisting of human-blood fed (n = 167) and bovine blood-fed (n = 584) mosquitoes, in order to predict blood meal sources for field-collected mosquitoes. Before performing model training and prediction, the classes were balanced by randomly under-sampling the over-represented blood meal class to match the under-represented classes [i.e. human-blood fed (n = 167) and bovine blood-fed (n = 167) mosquitoes]. The remaining samples from the random under-sampling were later included in the unseen data/test data for overall prediction. Field collected An. arabiensis were not used for model training since there were only 256 (human blood-fed (n = 2) and bovine blood-fed (n = 254)) of them in the total sample set. Additionally, prior to model training, the spectra were cleaned of water vapor absorption bands and carbon dioxide (CO2) interference bands then standardized by rescaling to zero mean and a variance of 1 to ensure consistency and uniformity. The following algorithms were tested and compared to select the one with the highest predictive accuracy and precision: K-nearest neighbours (KNN), Logistic Regression (LR), Support Vector Machine (SVM), Gradient Boosting (XGB), Random Forest (RF), and Multilayer Perceptron (MLP). The best-performing model was selected based on predictive accuracy and refined it through hyperparameter tuning. This optimized model was then validated using fivefold cross-validation. Once the model was validated, it was tested using a balanced set of unseen spectra from human blood-fed (n = 17) and bovine blood-fed (n = 17) mosquito samples derived from the under-sampling process.

A second-stage model evaluation was conducted using a larger but imbalanced set of test samples consisting predominantly of spectra from bovine-fed mosquitoes (n = 688) and a small number of spectra from human-fed mosquitoes (n = 19). While the datasets used for both the model training and the first stage testing consisted of only An. funestus, this larger dataset used for the second stage testing also included a small number of blood-fed An. arabiensis (n = 254), which had been excluded from model training.

Lastly, a transfer learning technique was implemented to predict field data by initially training machine learning models with laboratory data and then augmenting with small quantities of field data as follows. In this context, deep learning framework was utilized due to their direct provision of pre-trained models and pre-build transfer learning capabilities, which differs from traditional machine learning algorithms. Spectral data from a previous study were utilized [29], which involved laboratory-reared mosquitoes to train the deep learning model. This earlier study used age-synchronized lab-reared An. arabiensis fed on four different host types, cattle, goat, chicken and humans [29]. This pre-existing data was used here to train an MLP deep learning model within the Keras framework, but only the mosquitoes fed on human blood (n = 409) and bovine blood (n = 454) were included. Then, the model was augmented with a small subset of newly collected data from wild mosquitoes to assess the amount of field data needed for effective transfer learning. The resulting MLP model was then utilized to classify the sources of blood meals in wild-collected mosquitoes from two different test sets: a near-balanced set of test samples (human blood-fed (n = 177) and bovine blood-fed (n = 120)) derived from the under-sampling process, and an imbalanced set of test samples consisting predominantly of spectra from bovine-fed mosquitoes and a small number of spectra from human-fed mosquitoes; the second test set included 784 bovine blood-fed and 122 human blood-fed mosquito samples.

While accuracy was the primary evaluation metric for the model, additional metrics, namely recall (true positive rate), precision (positive predictive value), and F1-scores were also employed for a comprehensive performance assessment. The recall score, indicating the ability of the model to identify all actual positives and minimize false negatives, was calculated as the proportion of accurately identified blood meal hosts out of the total blood-fed mosquitoes within each category. Precision, reflecting the success of the model in avoiding false positives, was measured as the proportion of correctly classified blood meal host/source against all the positive predictions of that model for each blood meal category. Lastly, the F1 score, a harmonic mean of precision and recall, was computed to gauge the balanced performance of the model in accurately classifying blood meal host sources. A higher F1 score denotes superior model efficacy, with a score of 1 indicating perfect precision and recall.

Estimating the human blood index (HBI) using results from PCR and MIRS-ML approaches

The proportion of mosquito blood meals obtained from humans were estimated through predictions generated by MIRS-ML based approaches and compared them to the outcomes of PCR analysis. The definitive ‘ground truth’ HBI (human-fed/total blood-fed mosquitoes) was calculated using PCR results, while MIRS-ML based prediction were used for comparison.

Results

PCR-based identification of blood meals from different vertebrate hosts

A total of 1854 samples were examined (Table 2). Of these 45.2% of the mosquitoes had consumed bovine blood, 9% human blood, 3.7% dog blood, and 1.4% a mixture of human and bovine blood. Another 0.3% had fed on either a mix of human and dog blood or bovine and dog blood. Notably, 40.1% of all samples remained unamplified, possibly due to prolonged host-blood digestion within the mosquito abdomen [28] or the presence of blood from other vertebrates not targeted by the list of primers used in the study.

Table 2.

Number of amplified host blood meal sources of wild-caught Anopheles mosquitoes

Host blood An. funestus group An. arabiensis Total count (%)
Bovine blood 584 254 838 (45.2)
Human blood 167 2 169 (9)
Dog blood 65 3 68 (3.7)
Human and bovine blood 26 26 (1.4)
Bovine and dog blood 5 5 (0.3)
Human and dog blood 5 5 (0.3)
Unamplified 553 190 743 (40.1)
Total 1405 449 1854 (100)

Confirmation of the identity of sibling species in the Anopheles funestus group

Additional PCR was conducted to determine the species composition of An. funestus that blood-fed on bovine and humans. These tests revealed that 99% of the successfully amplified bovine blood-fed samples were An. funestus, with Anopheles rivolurum and Anopheles vaneedeni making up 0.7%–0.1%, respectively. Anopheles funestus also accounted for 100% of the amplified samples from mosquitoes that had fed on human blood.

Using machine learning models to identify and distinguish between blood meal types

As humans and cattle were found to be the predominant hosts (Table 2), the ML models were exclusively trained using labels from An. funestus human blood-fed (n = 167) and bovine blood-fed (n = 584). To address the imbalance, the bovine blood-fed class was under-sampled at random to match the under-represented class (i.e. human-blood fed (n = 167) and bovine blood-fed (n = 167) mosquitoes) [44].

LR achieved the highest in-sample prediction accuracy at 80% (Fig. 1A). After hyperparameter tuning, the LR model predicted the previously unseen balanced set of test samples with an overall accuracy of 88%,–94% for bovine and 82% for human blood meal classifications (Fig. 1B). The summarization of this result on a confusion matrix shows that about 6% of mosquitoes blood-fed on bovine were misclassified as human blood-fed, and 18% of human blood-fed mosquitoes were misclassified as bovine blood-fed (Fig. 1B).

Fig. 1.

Fig. 1

A Comparison of machine learning algorithms; KNN K-nearest neighbours, LR Logistic regression, SVM Support vector machine, XGB Extreme Gradient boosting, RF Random forest. B A confusion matrix from the LR classifier’s predictions on the balanced set of test samples of wild An. funestus blood-fed on human and bovine. C A confusion matrix from the LR classifier’s predictions of the imbalanced set of test samples of wild mosquitoes blood-fed on human and bovine

Moreover, when all the remaining samples were included in the test set to create a larger but imbalanced dataset, the LR model classified all the previously unseen spectra with an overall accuracy of 78%, predicting bovine blood-fed and human blood-fed mosquitoes with 73%–82% accuracy, respectively (Fig. 1C). Additionally, a lower precision was observed for the minority class (i.e. Human). Additional metrics (precision, recall and F1 statistics) and the number of test samples are in Table 3.

Table 3.

Precision, recall, and F1-score of the LR classifier in classifying Bovine and human blood-meal sources in out-of-sample wild malaria mosquitoes

Host blood Precision Recall F1-score No. test samples
Model testing using a balanced set of test samples
 Bovine 0.84 0.94 0.89 17
 Human 0.93 0.82 0.87 17
Model testing using an imbalanced set of test samples
 Bovine 0.99 0.79 0.88 688
 Human 0.09 0.79 0.17 19

Using ML models trained with laboratory data to classify host blood meals of field-collected mosquitoes

Although the initial model trained with field data yielded a relative high accuracy performance, the effectiveness of a model trained using laboratory data from an earlier study was evaluated [29], for classifying the host blood meals of field-collected samples. Indeed, the advantage of this approach is that it would allow to create models using laboratory samples, which are easier to produce and balance between different hosts.

After training a baseline MLP model, a small subset of field spectra was incorporated using transfer learning which can allow generalization with minimal re-calibrations [30]. Transfer learning exhibited a significant enhancement in classification accuracy, increasing from 76% to approximately 90% (Fig. 2A). This level of accuracy was achieved by integrating, into the MLP model trained with laboratory data up to 100 field samples, evenly split between human-fed and bovine-fed classes. Specifically, on the balanced set of test samples, the MLP model achieved a classification accuracy of 90% for bovine blood meal sources and 91% for human blood meal sources (Fig. 2B).

Fig. 2.

Fig. 2

A The accuracy of classifying unseen blood-meal sources in field mosquitoes significantly increased from 76 to 90% when using a training set of up to 100 field mosquitoes for transfer learning. The mean accuracy is depicted by the solid line, while the shaded ribbon represents the standard deviation of the mean across 10 models. B A confusion matrix from the transfer learning model for classifying human and bovine blood meals in field mosquitoes from the balanced set of test samples. C A confusion matrix from the transfer learning model’s classification prediction of the imbalanced set of test samples of wild mosquitoes blood-fed on human and bovine

Moreover, on the imbalanced set of test samples (784 bovine blood-fed and 122 human blood-fed), the MLP model improved and achieved an overall accuracy of 94%–98% for bovine and 90% for human blood-fed mosquitoes (Fig. 2C). The precision, recall, F1-score metrics, and the number of test samples are presented in Table 4.

Table 4.

Precision, recall, and F1-score of the transfer learning model (i.e. MLP) in classifying out-of-sample bovine and human blood-meal sources in wild malaria mosquitoes

Host blood Precision Recall F1-score No. test samples
Model testing using a balanced set of test samples
 Bovine 0.91 0.90 0.90 120
 Human 0.90 0.91 0.90 117
Model testing using an imbalanced set of test samples
 Bovine 0.98 0.98 0.98 784
 Human 0.88 0.90 0.89 122

Lastly, to assess whether MIRS-ML could be used to estimate human blood index (HBI), which reflects the proportion of mosquito blood meals derived from humans, the predictions by MIRS-ML were compared against standard HBI values obtained by PCR. It was observed that LR predictions, when solely based on field data, slightly underestimated the HBI by 6% compared to PCR results. On the other hand, the predictions obtained by the model that used transfer learning were much more accurate in estimating HBI; and even minimal number of samples included in the re-calibration model well aligned with the PCR-based standard HBI (Fig. 3).

Fig. 3.

Fig. 3

Estimation of the HBI by the transfer learning (i.e. MLP-TL, Multilayer perceptron-transfer learning) compared to PCR when using a training set of up to 100 field mosquitoes for transfer learning. The solid line represents the average HBI, while the shaded ribbon illustrates the standard deviation across 10 iterations

Discussion

Human blood index (HBI), which reflects the tendency of mosquitoes to feed on humans compared to other vertebrates, is vital for assessing malaria transmission dynamics and the level of stability of transmission [5]. Current techniques for determining mosquito blood meal sources are slow, labour-intensive, and expensive due to the need for costly reagents. They are also susceptible to errors, such as false positives from cross-reactivity with other antigens or due to the partial digestion of blood meals in the mosquito digestive system. Yet, as malaria endemic countries move towards elimination, there is a pressing need for simpler, more cost-effective methods that can be deployed at scale in malaria-endemic countries to improve entomological surveillance and evaluate the effectiveness of malaria control interventions.

This study demonstrates the first-ever field application of the simple mid-infrared spectroscopy and machine learning (MIRS-ML) approach for predicting the blood-feeding histories of malaria vector in rural Africa. Beyond this, the study also demonstrates the transferability of the laboratory-trained MIRS-ML models to identify and classify host blood meals in field-collected samples through the utilization of transfer learning techniques. For validation, PCR as the ‘ground truth’ was used to determine the actual blood-feeding histories of the field-collected mosquitoes; and examined a total of 1854 blood-fed Anopheles mosquitoes.

Based on the PCR analysis, most of the mosquitoes blood-fed on humans or bovines, and only a very small percentage had fed on other hosts, such as dogs and pigs. Given the inherent limitations of the PCR, classification of blood meals in 41% of the samples was impossible, possibly because they fed on a host other than those tested in this study and therefore could not be amplified with the primers used. Nonetheless, only mosquitoes confirmed to have fed on either humans or bovines were included in this analysis, as they were the vast majority; thus binary machine learning classifiers were trained for blood-meal prediction. The capability of the MIRS-ML models to classify mosquito blood-meal sources was demonstrated, achieving an accuracy of 88%, when using 338 spectra data collected from field samples (169 human-fed and 169 from bovine-fed mosquitoes). This demonstrates a realistic opportunity to deploy such simple methods for estimating HBI, thereby extending the capability of infrared-AI based systems already well demonstrated for tracking several other entomological attributes [45].

In prior work using age-synchronized laboratory-reared mosquitoes, the focus was on predicting blood-meal sources with An. arabiensis, where the MIRS-ML approach achieved a classification accuracy of–98% for four blood meal sources (bovine, human, goat and chicken) [29]. Whereas the mosquitoes used in that earlier study were only 6–8 h post-feeding, this current study included a broader range of age groups and natural variation in the degrees of digestion of the bloodmeals. This current study therefore strongly demonstrates the potential of the MIRS-ML approach for realistic field surveillance, even when the time of actual blood-feeding and digestion stages is unknown upon sample collection and preparation.

A major achievement in the present work is the demonstration of the transferability of laboratory-trained models to field samples through the application of transfer learning. The transferability of laboratory-trained models achieved a classification accuracy of 90% in predicting blood-meal sources for field-collected An. funestus. The base laboratory model was initially trained using spectra data from blood-fed An. arabiensis [29], which was then augmented by incorporating a small subset (n = 100, with 50 samples each from humans and bovine blood-fed An. funestus spectra) of field-collected data into the model. This implies that the technique can be extended to assess blood-meal sources in the abdomens of Afrotropical malaria vectors, as the species would not be a confounding factor in this case. It also implies that the generalizability of this model will cut across laboratory and field sample prediction, and therefore, sample origin might not be a confounding factor. Since field-collected mosquitoes were likely of varied ages, and therefore mosquito age, a factor readily classifiable by MIRS-ML models [30], is also unlikely to be a confounder, and can be overcome by similar transfer learning approaches. The results presented here corroborate with previous studies in which the utilization of transfer learning successfully generalized predictions of mosquito age and species across different countries and laboratories [30, 31]. This approach effectively accounts for the inherent variability of mosquitoes from different environmental and ecological settings or genetic backgrounds, which could otherwise limit the generalizability of ML models trained on mosquito spectra data to new mosquito populations. Indeed, the genetic variability for blood meals in the field is likely high, and blood-fed mosquitoes collected during the study contained a mixture of fully engorged and partially consumed blood meals.

Partial digestion or low quantity of ingested blood meals, could potentially impair the capability of MIRS-ML to accurately identify or differentiate between various blood meals, thereby affecting the Human Blood Index (HBI) estimates. To mitigate this, it is advised against including gravid mosquitoes in samples and recommended to preserve all blood-fed mosquitoes immediately upon collection to halt any biochemical changes before spectroscopy. Currently investigating this phenomenon, preliminary studies have demonstrate a notable decrease in MIRS-ML accuracy after 36 h post-feeding (Mgaya et al. (unpublished), which coincides with gravidity in a typical 2–3 day gestation period under optimal conditions. In this paper, field models closely aligned with PCR outcomes, considered as the benchmark, despite the inability to precisely determine the gestational stage of mosquitoes at the time of collection each morning post-trapping. Moreover, earlier studies by Mukabana et al. [28], have successfully used PCR to amplify host DNA up to 32 h post-feeding after which the host DNA is degraded. Crucially, the analysis only incorporated samples that yielded successful PCR amplification of host DNA for MIRS-ML training, discarding all non-amplified samples. This selection criterion may inadvertently introduce bias since the partially or fully digested blood meals may be the ones least likely to yield good-quality host DNA. Future models should therefore include samples of mosquitoes that have blood-fed on known hosts, 1–4 days post-feeding to evaluate the efficacy of MIRS-ML across various stages, including gravid and post-oviposition states. Lastly, though the model was already trained on a large number of mosquitoes, it is recommended to increase these sample sizes and obtain mosquitoes from different sampling locations so as to neutralize effects such as partial blood-meals and partial digestion, as well as any effects of environmental or microclimatic factors affecting blood feeding and digestion.

Indeed, increasing the number of field samples for transfer learning not only enhanced the classification accuracy for field blood-fed mosquitoes but also improved the precision in estimating the HBI in comparison to the ‘ground truth’ PCR method. This indicates that the technique has the potential to be a reliable method for estimating HBI, capable of generalizing HBI estimations in field-collected mosquitoes as effective as PCR. Therefore, it can provide valuable information to national malaria control programs regarding the feeding preferences of malaria mosquitoes.

Despite the successes of this technique, there remain several gaps. Firstly, it is unclear whether the technique can detect mixed blood meals, a situation that is more likely to occur in the field, remains unanswered, warranting future investigation. Secondly, PCR and ELISA remains highly sensitive and specific, known for their accuracy in detecting host DNA and specific protein from blood meals, even in small amounts, respectively. Although MIRS-ML has demonstrated notable accuracies in detecting mosquito blood meals, its performance, being highly sensitive and specific, depends on the quantity and quality of the training data and machine learning algorithms used. This robustness of the model will contribute to its ability to handle variations. Thirdly, the machine learning models in this study were trained using An. funestus mosquitoes that had blood-fed on humans and bovines. This choice was made because most mosquito samples collected from the field contained either human or bovine blood in their abdomens, while only a minority had dog blood or mixed blood-meals. Consequently, the available samples were insufficient to adequately train the machine learning models to detect mosquito blood-meal sources from hosts other than humans and bovines. In their current state, these models would face challenges in field deployment since they will not be capable of identifying blood-meal sources from other potential hosts often found in human dwellings such as goat, pig, and chicken. However, considering that the transferability of the laboratory-trained models for field sample prediction has also been demonstrated, the deployment of these models could involve initially training them on laboratory data, which can be generated in large quantities. Additionally, this approach allows for the inclusion of a wider range of hosts, ensuring accurate mosquito blood-meal source prediction from all common hosts typically found near human dwellings, including humans, bovines, goats, dogs, pigs and chickens. Thus, once validated, MIRS-ML approaches have the potential to make significant contributions to understanding the dynamics of disease transmission involving humans, livestock, wildlife, and vectors. Specifically, they could offer valuable insights into scenarios where mosquitoes have opportunities to feed on multiple host species.

Interestingly, despite its anthropophilic behaviour, An. funestus, the main vector in the study area, was found to also blood-feed on bovines. This finding is consistent with previous studies that demonstrated a potential switch in host choice by An. funestus from humans to cattle [46, 47]. In brief, given the circumstances of the collections, this observation may be explained by several factors: Firstly, the houses where mosquito collections were conducted had been supplied with intact bed nets before the collections started, which might have created a physical barrier, reducing mosquito exposure to humans [48]; and forcing mosquitoes to use alternative blood sources in the surrounding areas as previously documented by Iwashita et al. [48]. Secondly, it might have been a result of the zoopotentiation effect, which refers to the increased tendency of mosquitoes to feed on humans living near livestock [49, 50], especially when livestock in close proximity to human dwellings emit heat and odour cues that attract mosquitoes. In such circumstances, not only do zoophagic mosquitoes find additional blood sources that they already prefer, but even the naturally anthropophagic mosquitoes may also accidentally feed on cattle when host cues become mixed nearby. There is a lot of evidence suggesting that zoopotentiation may increase malaria transmission risk by creating an alternative source of bloodmeals, consequently increasing both mosquito survival rates and abundance [5155]. This interaction of mosquitoes between humans and non-human hosts may also elevate the likelihood of transmitting parasitic helminths and zoonotic pathogens [3239, 56].

Infrared spectroscopy and machine learning methods have already been demonstrated for several other use cases, such as age-grading mosquitoes [30, 31, 5759], detection of pathogens inside mosquitoes [60], identification of mosquito species [30] and even detection of parasites in human blood [6163]. This demonstration of its usefulness for analysing the blood-feeding histories of mosquitoes in both the laboratory (as previously shown [29]) and the field (this current study), underscores the unique potential of the technology as a one-stop system for comprehensive analysis of entomological and parasitological indicators of malaria and other mosquito-borne diseases.

Conclusion

In conclusion, the study marks the pioneering application of mid-infrared spectroscopy combined with machine learning (MIRS-ML) for rapid assessment of blood-feeding patterns in field-collected malaria vectors. By successfully classifying the blood meals of wild An. funestus female mosquitoes, it has been demonstrated that, regardless of whether the ML models were trained with MIR spectra from field-collected conspecific females or from laboratory-reared An. arabiensis, MIRS-ML has the accuracy, precision and overall potential for identifying and distinguishing between different host blood meals. By comparing results with multiplex PCR assays, which was considered the 'ground truth', MIRS-ML achieved high classification accuracies of 88%–90% with logistic regression and multi-layer perceptron classifiers, respectively. Notably, the study also confirms the effectiveness of transfer learning in adapting laboratory-trained models for field data analysis. The MIRS-ML methodology represents a scalable, cost-efficient alternative to traditional, more labour-intensive blood meal analysis methods, and has the added advantage of estimating the human blood index (HBI) with only slight overestimation. Since this technology has already been demonstrated for several other entomological and parasitological surveys, this study demonstrates its extended capability and potential as a “one-stop” system for comprehensive analysis of entomological and parasitological indicators of malaria and other mosquito-borne diseases. This advancement is crucial for malaria-endemic regions seeking simpler analytical methods to enhance entomological surveillance or to evaluate the impact of disease control efforts. The marginal discrepancies in HBI estimation do not detract from the method's utility, rather they highlight the transformative potential of MIRS-ML in facilitating comprehensive surveillance and providing deeper insights into malaria transmission dynamics.

Acknowledgements

The authors extend their sincere gratitude to the field technicians and community members who contributed to the collection of wild mosquitoes. The authors also express their gratitude to the administration team for their consistent administrative support. Furthermore, the authors appreciate the steadfast support received from local government officials in Ulanga district for their unwavering support throughout this study

Abbreviations

MIRS-ML

Mid-infrared spectroscopy and machine learning

PCR

Polymerase chain reaction

ATR-FTIR Spectrometer

Attenuated total reflection—Fourier transform infrared spectrometer

Author contributions

EPM, SB, FB, KW, PS, MTS and FOO conceived the study. EPM, SA, FOO, and FB developed the study's protocol. GS, FEM and EPM collected the data. ISM, SA, and EPM performed molecular assays. EPM carried out data analysis and ML training. EPM wrote the manuscript. EPM, DJS, SHM,NJM, IHM, MGJ, KW, MTS, PS, SB, FB and FOO reviewed and edited drafts of the manuscript. All authors have read and approved the final manuscript.

Funding

This study was supported by the Wellcome Trust Masters Fellowship in Tropical Medicine & Hygiene (Grant No. 214643/Z/18/Z) awarded to EPM and the Medical Research Council (MRC) [MR/P025501/1] awarded to FB. FB is supported by the Academy Medical Sciences Springboard Award (ref:SBF007/100094). FOO was supported by a Howard Hughes Medical Institute (HHMI)-Gates International Research Scholarship (Grant No. OPP1099295), and Bill and Melinda Gates foundation (INV003079). SAB is supported by the Bill and Melinda Gates Foundation (INV-030025) and Royal Society (ICA/R1/191238).

Availability of data and materials

The mid-infrared spectral datasets generated and analysed during the current study, as well as code for the analyses is available at [GitHub].

Declarations

Ethics approval and consent to participate

Ethical approval for this study was obtained from Ifakara Health Institute Institutional Review Board (Ref. IHI/IRB/No: 41-2020), and from the Medical Research Coordinating Committee (MRCC) at the National Institute of Medical Research (NIMR), Ref: NIMR/HQ/R.8a/Vol. IX/3557.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Fredros O. Okumu, Francesco Baldini, and Simon A. Babayan co-supervised this work equally.

References

  • 1.WHO . Malaria surveillance, monitoring & evaluation: a reference manual. Geneva: World Health Organization; 2018. [Google Scholar]
  • 2.Takken W, Verhulst NO. Host preferences of blood-feeding mosquitoes. Annu Rev Entomol. 2013;58:433–453. doi: 10.1146/annurev-ento-120811-153618. [DOI] [PubMed] [Google Scholar]
  • 3.Tirados I, Costantini C, Gibson G, Torr SJ. Blood feeding behaviour of the malarial mosquito Anopheles arabiensis: implications for vector control. Med Vet Entomol. 2006;20:425–437. doi: 10.1111/j.1365-2915.2006.652.x. [DOI] [PubMed] [Google Scholar]
  • 4.WHO . World malaria report. Geneva: World Health Organization; 2022. [Google Scholar]
  • 5.Kiswewski AE, Mellinger A, Spielman A, Malaney P, Sachs SE, Sachs J. A global index representing the stability of malaria transmission. Am J Trop Med Hyg. 2004;70:486–498. doi: 10.4269/ajtmh.2004.70.486. [DOI] [PubMed] [Google Scholar]
  • 6.Killeen GF. Characterizing, controlling and eliminating residual malaria transmission. Malar J. 2014;13:330. doi: 10.1186/1475-2875-13-330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Tedrow RE, Rakotomanga T, Nepomichene T, Howes RE, Ratovonjato J, Ratsimbasoa AC, et al. Anopheles mosquito surveillance in Madagascar reveals multiple blood feeding behavior and Plasmodium infection. PLoS Negl Trop Dis. 2019;13:e0007176. doi: 10.1371/journal.pntd.0007176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Lemasson JJ, Fontenille D, Lochouarn L, Dia I, Simard F, Ba K, et al. Comparison of behavior and vector efficiency of Anopheles gambiae and An. arabiensis (Diptera:Culicidae) in Barkedji, a Sahelian area of Senegal. J Med Entomol. 1997;34:396–403. doi: 10.1093/jmedent/34.4.396. [DOI] [PubMed] [Google Scholar]
  • 9.Kaindoa EW, Matowo NS, Ngowo HS, Mkandawile G, Mmbando A, Finda M, et al. Interventions that effectively target Anopheles funestus mosquitoes could significantly improve control of persistent malaria transmission in south–eastern Tanzania. PLoS ONE. 2017;12:e0177807. doi: 10.1371/journal.pone.0177807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Mapua SA, Hape EE, Kihonda J, Bwanary H, Kifungo K, Kilalangongono M, et al. Persistently high proportions of Plasmodium-infected Anopheles funestus mosquitoes in two villages in the Kilombero valley. South-Eastern Tanzania Parasite Epidemiol Control. 2022;18:e00264. doi: 10.1016/j.parepi.2022.e00264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Matowo NS, Kulkarni MA, Messenger LA, Jumanne M, Martin J, Mallya E, et al. Differential impact of dual-active ingredient long-lasting insecticidal nets on primary malaria vectors: a secondary analysis of a 3-year, single-blind, cluster-randomised controlled trial in rural Tanzania. Lancet Planet Health. 2023;7:e370–e380. doi: 10.1016/S2542-5196(23)00048-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Moiroux N, Gomez MB, Pennetier CC, Elanga E, Djènontin A, Chandre F, , et al. Changes in Anopheles funestus biting behavior following universal coverage of long-lasting insecticidal nets in Benin. J Infect Dis. 2012;206:1622–1629. doi: 10.1093/infdis/jis565. [DOI] [PubMed] [Google Scholar]
  • 13.Omondi S, Kosgei J, Musula G, Muchoki M, Abong’o B, Agumba S, et al. Late morning biting behaviour of Anopheles funestus is a risk factor for transmission in schools in Siaya western Kenya. Malaria J. 2023; 22:366. [DOI] [PMC free article] [PubMed]
  • 14.Meza FC, Kreppel KS, Maliti DF, Mlwale AT, Mirzai N, Killeen GF, et al. Mosquito electrocuting traps for directly measuring biting rates and host-preferences of Anopheles arabiensis and Anopheles funestus outdoors. Malar J. 2019;18:83. doi: 10.1186/s12936-019-2726-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Smith A. The attractiveness of an adult and child to a. gambiae. East Afr Med J. Nairobi. 1956;33(10):409–410. [PubMed] [Google Scholar]
  • 16.Mbogo CNM, Baya NM, Ofulla AVO, Githure JI, Snow RW. The impact of permethrin-impregnated bednets on malaria vectors of the Kenyan coast. Med Vet Entomol. 1996;10:251–259. doi: 10.1111/j.1365-2915.1996.tb00739.x. [DOI] [PubMed] [Google Scholar]
  • 17.Charlwood JD, Graves PM. The effect of permethrin-impregnated bednets on a population of Anopheles farauti in coastal Papua New Guinea. Med Vet Entomol. 1987;1:319–327. doi: 10.1111/j.1365-2915.1987.tb00361.x. [DOI] [PubMed] [Google Scholar]
  • 18.Gomes LAM, Duarte R, Lima DC, Diniz BS, Serrão ML, Labarthe N. Comparison between precipitin and ELISA tests in the bloodmeal detection of Aedes aegypti (Linnaeus) and Aedes fluviatilis (Lutz) mosquitoes experimentally fed on feline, canine and human hosts. Mem Inst Oswaldo Cruz. 2001;96:693–695. doi: 10.1590/S0074-02762001000500020. [DOI] [PubMed] [Google Scholar]
  • 19.Thiemann TC, Brault AC, Ernest HB, Reisen WK. Development of a high-throughput microsphere-based molecular assay to identify 15 common bloodmeal hosts of Culex mosquitoes. Mol Ecol Resour. 2012;12:238–246. doi: 10.1111/j.1755-0998.2011.03093.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Ansell J, Hu J-T, Gilbert SC, Hamilton KA, Hill AVS, Lindsay SW. Improved method for distinguishing the human source of mosquito blood meals between close family members. Trans R Soc Trop Med Hyg. 2000;94:572–574. doi: 10.1016/S0035-9203(00)90092-0. [DOI] [PubMed] [Google Scholar]
  • 21.Beier JC, Perkins PV, Wirtz RA, Koros J, Diggs D, Gargan TP, et al. Bloodmeal identification by direct enzyme-linked immunosorbent assay (ELISA), tested on Anopheles (Diptera: Culicidae) in Kenya. J Med Entomol. 1988;25:9–16. doi: 10.1093/jmedent/25.1.9. [DOI] [PubMed] [Google Scholar]
  • 22.Kent RJ, Norris DE. Identification of mammalian blood meals in mosquitoes by a multiplexed polymerase chain reaction targeting cytochrome b. Am J Trop Med Hyg. 2005;73:336–342. doi: 10.4269/ajtmh.2005.73.336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Burkot TR, DeFoliart GR. Bloodmeal sources of Aedes triseriatus and Aedes vexans in a southern Wisconsin forest endemic for La Crosse encephalitis virus. Am J Trop Med Hyg. 1982;31:376–381. doi: 10.4269/ajtmh.1982.31.376. [DOI] [PubMed] [Google Scholar]
  • 24.Edrissian GH, Hafizi A. Application of enzyme-linked immunosorbent assay (ELISA) to identification of Anopheles mosquito bloodmeals. Trans R Soc Trop Med Hyg. 1982;76:54–56. doi: 10.1016/0035-9203(82)90017-7. [DOI] [PubMed] [Google Scholar]
  • 25.Polymeropoulos MH, Xiao H, Rath DS, Merrill CR. Tetranucleotide repeat polymorphism at the human tyrosine hydroxylase gene (TH) Nucleic Acids Res. 1991;19:3753. [PMC free article] [PubMed] [Google Scholar]
  • 26.Kimpton C, Walton A, Gill P. A further tetranucleotide repeat polymorphism in the vWF gene. Hum Mol Genet. 1992;1:287. doi: 10.1093/hmg/1.4.287. [DOI] [PubMed] [Google Scholar]
  • 27.Chow E, wirtz RA, Scott TW. Identification of blood meals in Aedes aegypti by antibody sandwich enzyme-linked immunosorbent assay. J Am Mosq Control Assoc. 1993;9:196–205. [PubMed] [Google Scholar]
  • 28.Mukabana RW, Takken W, Seda P, Killeen GF, Hawley WA, Knols BGJ. Extent of digestion affects the success of amplifying human DNA isolated from blood meals of Anopheles gambiae (Diptera: Culicidae) Bull Entomol Res. 2002;92:233–239. doi: 10.1079/BER2002164. [DOI] [PubMed] [Google Scholar]
  • 29.Mwanga EP, Mapua SA, Siria DJ, Ngowo HS, Nangacha F, Mgando J, et al. Using mid-infrared spectroscopy and supervised machine-learning to identify vertebrate blood meals in the malaria vector Anopheles arabiensis. Malar J. 2019;18:187. doi: 10.1186/s12936-019-2822-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Siria DJ, Sanou R, Mitton J, Mwanga EP, Niang A, Sare I, et al. Rapid age-grading and species identification of natural mosquitoes for malaria surveillance. Nat Commun. 2022;13:1501. doi: 10.1038/s41467-022-28980-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Mwanga EP, Siria DJ, Mitton J, Mshani IH, González-Jiménez M, Selvaraj P, et al. Using transfer learning and dimensionality reduction techniques to improve generalisability of machine-learning predictions of mosquito ages from mid-infrared spectra. BMC Bioinformatics. 2023;24:11. doi: 10.1186/s12859-022-05128-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.White NJ. Plasmodium knowlesi: the fifth human malaria parasite. Clin Infect Dis. 2008;46:172–173. doi: 10.1086/524889. [DOI] [PubMed] [Google Scholar]
  • 33.Pialoux G, Gaüzère B-A, Jauréguiberry S, Strobel M. Chikungunya, an epidemic arbovirosis. Lancet Infect Dis. 2007;7:319–327. doi: 10.1016/S1473-3099(07)70107-X. [DOI] [PubMed] [Google Scholar]
  • 34.Bird BH, Ksiazek TG, Nichol ST, MacLachlan NJ. Rift Valley fever virus. J Am Vet Med Assoc. 2009;234:883–893. doi: 10.2460/javma.234.7.883. [DOI] [PubMed] [Google Scholar]
  • 35.Petersen LR, Jamieson DJ, Powers AM, Honein MA. Zika virus. N Engl J Med. 2016;374:1552–1563. doi: 10.1056/NEJMra1602113. [DOI] [PubMed] [Google Scholar]
  • 36.Barrett ADT, Monath TP. Epidemiology and ecology of yellow fever virus. Adv Virus Res. 2003;61:291–317. doi: 10.1016/S0065-3527(03)61007-9. [DOI] [PubMed] [Google Scholar]
  • 37.Halstead SB. Dengue virus–mosquito interactions. Annu Rev Entomol. 2007;53:273–291. doi: 10.1146/annurev.ento.53.103106.093326. [DOI] [PubMed] [Google Scholar]
  • 38.Campbell GL, Marfin AA, Lanciotti RS, Gubler DJ. West Nile virus. Lancet Infect Dis. 2002;2:519–529. doi: 10.1016/S1473-3099(02)00368-7. [DOI] [PubMed] [Google Scholar]
  • 39.Endy TP, Nisalak A. Japanese Encephalitis virus: ecology and epidemiology. Curr Top Microbiol Immunol. 2002;267:11–48. doi: 10.1007/978-3-642-59403-8_2. [DOI] [PubMed] [Google Scholar]
  • 40.Gillies MT, Coetzee M. A supplement to the anophelinae of the South of the Sahara (Afrotropical Region) Publ South African Inst Med Res. 1987;55:1–143. [Google Scholar]
  • 41.Koekemoer LL, Kamau L, Hunt RH, Coetzee M. A cocktail polymerase chain reaction assay to identify members of the Anopheles funestus (Diptera: Culicidae) group. Am J Trop Med Hyg. 2002;66:804–811. doi: 10.4269/ajtmh.2002.66.804. [DOI] [PubMed] [Google Scholar]
  • 42.Pedregosa F, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825–2830. [Google Scholar]
  • 43.Chollet F. Keras. The python deep learning library. KerasIo. 2015. http://keras.io
  • 44.Lemaitre G, Nogueira F, Aridas CK. Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J Mach learn Resea. 2017;18:1–5. [Google Scholar]
  • 45.Mshani IH, Siria DJ, Mwanga EP, Sow BBD, Sanou R, Opiyo M, et al. Key considerations, target product profiles, and research gaps in the application of infrared spectroscopy and artificial intelligence for malaria surveillance and diagnosis. Malar J. 2023;22:346. doi: 10.1186/s12936-023-04780-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Githeko AK, Service MW, Mbogo CM, Atieli FK, Juma FO Origin of blood meals in indoor and outdoor resting malaria vectors in Western Kenya. Acta Trop. 1994;58:307–316. doi: 10.1016/0001-706X(94)90024-8. [DOI] [PubMed] [Google Scholar]
  • 47.Katusi GC, Hermy MRG, Makayula SM, Ignell R, Govella NJ, Hill SR, et al. Seasonal variation in abundance and blood meal sources of primary and secondary malaria vectors within Kilombero Valley Southern Tanzania. Parasit Vectors. 2022;15:479. doi: 10.1186/s13071-022-05586-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Iwashita H, Dida GO, Sonye GO, Sunahara T, Futami K, Njenga SM, et al. Push by a net, pull by a cow: can zooprophylaxis enhance the impact of insecticide treated bed nets on malaria control? Parasit Vectors. 2014;7:52. doi: 10.1186/1756-3305-7-52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Donnelly B, Berrang-Ford L, Ross NA, Michel P. A systematic, realist review of zooprophylaxis for malaria control. Malar J. 2015;14:313. doi: 10.1186/s12936-015-0822-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Hasyim H, Dhimal M, Bauer J, Montag D, Groneberg DA, Kuch U, et al. Does livestock protect from malaria or facilitate malaria prevalence? A cross-sectional study in endemic rural areas of Indonesia. Malar J. 2018;17:302. doi: 10.1186/s12936-018-2447-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Bouma M, Rowland M. Failure of passive zooprophylaxis: cattle ownership in Pakistan is associated with a higher malaria prevalence. Trans Roy Soc Trop Med Hyg. 1995;89:351–353. doi: 10.1016/0035-9203(95)90004-7. [DOI] [PubMed] [Google Scholar]
  • 52.Bøgh C, Clarke SE, Walraven GEL, Lindsay SW. Zooprophylaxis, artefact or reality? a paired-cohort study of the effect of passive zooprophylaxis on malaria in the Gambia. Trans R Soc Trop Med Hyg. 2002;96:593–596. doi: 10.1016/S0035-9203(02)90320-2. [DOI] [PubMed] [Google Scholar]
  • 53.Bøgh C, Clarke SE, Pinder M, Sanyang F, Lindsay SW. Effect of passive zooprophylaxis on malaria transmission in the Gambia. J Med Entomol. 2001;38:822–828. doi: 10.1603/0022-2585-38.6.822. [DOI] [PubMed] [Google Scholar]
  • 54.Saul A. Zooprophylaxis or zoopotentiation: the outcome of introducing animals on vector transmission is highly dependent on the mosquito mortality while searching. Malar J. 2003;2:1–18. doi: 10.1186/1475-2875-2-32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Sota T, Mogi M. Effectiveness of zooprophylaxis in malaria control: a theoretical inquiry with a model for mosquito populations with two bloodmeal hosts. Med Vet Entomol. 1989;3:337–345. doi: 10.1111/j.1365-2915.1989.tb00240.x. [DOI] [PubMed] [Google Scholar]
  • 56.Derua YA, Alifrangis M, Magesa SM, Kisinza WN, Simonsen PE. Sibling species of the Anopheles funestus group, and their infection with malaria and lymphatic filarial parasites, in archived and newly collected specimens from northeastern Tanzania. Malar J. 2015;14:104. doi: 10.1186/s12936-015-0616-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Gonzalez-Jimenez M, Babayan SA, Khazaeli P, Doyle M, Walton F, Reedy E, et al. Prediction of malaria mosquito species and population age structure using mid-infrared spectroscopy and supervised machine learning. Wellcome Open Res. 2019;4:76. doi: 10.12688/wellcomeopenres.15201.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Mayagaya VS, Michel K, Benedict MQ, Killeen GF, Wirtz RA, Ferguson HM, et al. Non-destructive determination of age and species of Anopheles gambiae s.l using near-infrared spectroscopy. Am J Trop Med Hyg. 2009;81:622–630. doi: 10.4269/ajtmh.2009.09-0192. [DOI] [PubMed] [Google Scholar]
  • 59.Lambert B, Sikulu-Lord MT, Mayagaya VS, Devine G, Dowell F, Churcher TS. Monitoring the age of mosquito populations using near-infrared spectroscopy. Sci Rep. 2018;8:5274. doi: 10.1038/s41598-018-22712-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Maia MFF, Kapulu M, Muthui M, Wagah MGG, Ferguson HMM, Dowell FEE, et al. Detection of Plasmodium falciparum infected Anopheles gambiae using near-infrared spectroscopy. Malar J. 2019;18:85. doi: 10.1186/s12936-019-2719-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Mwanga EP, Minja EG, Mrimi E, Jiménez MG, Swai JK, Abbasi S, et al. Detection of malaria parasites in dried human blood spots using mid-infrared spectroscopy and logistic regression analysis. Malar J. 2019;18:341. doi: 10.1186/s12936-019-2982-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Khoshmanesh A, Dixon MWA, Kenny S, Tilley L, McNaughton D, Wood BR. Detection and quantification of early-stage malaria parasites in laboratory infected erythrocytes by attenuated total reflectance infrared spectroscopy and multivariate analysis. Anal Chem. 2014;86:4379–4386. doi: 10.1021/ac500199x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Roy S, Perez-Guaita D, Andrew DW, Richards JS, McNaughton D, Heraud P, et al. Simultaneous ATR-FTIR based determination of malaria parasitemia, glucose and urea in whole blood dried onto a glass slide. Anal Chem. 2017;89:5238–5245. doi: 10.1021/acs.analchem.6b04578. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The mid-infrared spectral datasets generated and analysed during the current study, as well as code for the analyses is available at [GitHub].


Articles from Malaria Journal are provided here courtesy of BMC

RESOURCES