Abstract
Simple Summary
Wing venation traits are used to identify honey bee subspecies. While several wing-based tools are available, they suffer from weaknesses that were addressed by the recently developed software DeepWings©. This software allows fully automated identification of wing images in a friendly, free, and rapid manner. Here, we sought to test DeepWings© on 14,816 wing images representing 2601 colonies sampled in the native areas of three widespread subspecies in Europe: the Iberian honey bee (Apis mellifera iberiensis), the dark honey bee (Apis mellifera mellifera), both belonging to the M lineage, and the Carniolan honey bee (Apis mellifera carnica), belonging to the C lineage. DeepWings© classification of these colonies largely matched the endemic M and C lineages, with proportions of 71.4% and 97.6%, respectively. At the subspecies-level the matching proportions were 89.7% for the Iberian honey bee, 41.1% for the dark honey bee and 88.3% for the Carniolan honey bee, which can be explained by DeepWings© sometimes confounding closely related subspecies and, more importantly, by genetic pollution. A comparison between DeepWings© data and molecular data revealed that the agreement between the two is weaker when there is genetic pollution. Our results suggest that DeepWings© is a valuable tool for honey bee identification, which can be used not only for breeding and conservation but also for research purposes.
Abstract
DeepWings© is a software that uses machine learning to automatically classify honey bee subspecies by wing geometric morphometrics. Here, we tested the five subspecies classifier (A. m. carnica, Apis mellifera caucasia, A. m. iberiensis, Apis mellifera ligustica, and A. m. mellifera) of DeepWings© on 14,816 wing images with variable quality and acquired by different beekeepers and researchers. These images represented 2601 colonies from the native ranges of the M-lineage A. m. iberiensis and A. m. mellifera, and the C-lineage A. m. carnica. In the A. m. iberiensis range, 92.6% of the colonies matched this subspecies, with a high median probability (0.919). In the Azores, where the Iberian subspecies was historically introduced, a lower proportion (85.7%) and probability (0.842) were observed. In the A. m mellifera range, only 41.1 % of the colonies matched this subspecies, which is compatible with a history of C-derived introgression. Yet, these colonies were classified with the highest probability (0.994) of the three subspecies. In the A. m. carnica range, 88.3% of the colonies matched this subspecies, with a probability of 0.984. The association between wing and molecular markers, assessed for 1214 colonies from the M-lineage range, was highly significant but not strong (r = 0.31, p < 0.0001). The agreement between the markers was influenced by C-derived introgression, with the best results obtained for colonies with high genetic integrity. This study indicates the good performance of DeepWings© on a realistic wing image dataset.
Keywords: Apis mellifera subspecies, wing geometric morphometrics, honey bee classification, honey bee conservation, introgression
1. Introduction
The honey bee, Apis mellifera L. (Hymenoptera: Apidae), differentiated into over 30 subspecies [1,2,3,4,5], which belong to four main evolutionary lineages native to (i) western and north-eastern Europe, and north-western China (lineage M), (ii) central and south-eastern Europe (lineage C), (iii) Africa (lineage A), and (iv) the Near East and Central Asia (lineage O). Europe is the cradle of 10 such subspecies, of which eight belong to the M and C lineages. While the European M lineage comprises only A. m. mellifera and A. m. iberiensis, it spreads across a wider geographical and climatically more diverse area than the other six subspecies of C-lineage ancestry. This area extends from the Iberian Peninsula to southern Scandinavia and from Britain and Ireland to the Ural Mountains [5]. In contrast, the native area of the six C-lineage subspecies is limited to the Apennine and Balkan peninsulas, bordered at the north by the Alps and the Carpathians, and at the south by Sicily and the west Aegean islands [5]. Remarkably, despite the greater potential of the M lineage to adapt to more extreme environments, the C lineage includes two of the three subspecies favored in apiculture: A. m. carnica and A. m. ligustica. These, together with the O-lineage A. m. caucasia, were introduced worldwide due to their perceived gentle behavior and high productivity [5,6,7,8,9,10]. As a result, in many places of the M-lineage distributional range, particularly north of the Pyrenees, the genetic integrity of the native A. m. mellifera was threatened by gene flow from those foreign subspecies [9,10,11,12,13,14]. In an attempt to restore and protect the A. m. mellifera gene pool, conservation efforts sprouted in Europe [15] and an association (SICAMM–Societas Internationalis pro Conservatione Apis melliferae melliferae) for its protection was founded in 1994. These efforts require tools to identify colonies before they can be moved to conservation areas or to monitor the efficiency of isolated mating stations. On the other hand, identification tools may also be useful to C-lineage queen breeders in Eastern and Southeastern Europe.
The molecular tool kit for honey bee identification includes different markers of the mitochondrial (e.g., tRNAleu-cox 2 intergenic region) and nuclear DNA (e.g., microsatellites and single nucleotide polymorphisms [SNPs]) [16,17,18,19,20,21,22,23,24,25]. Although this toolkit proved to be powerful for honey bee identification [25,26,27,28,29,30,31,32,33], its use by beekeepers is very limited [34]. This is because molecular methods require trained personnel in addition to costly equipment and reagents, making genetic analysis of colonies unaffordable to beekeepers. In contrast, morphometric methods are cost-effective, need only a microscope with an attached camera, and, when based on wing traits, can more easily be implemented by beekeepers [23,35].
Subspecies identification by classical morphometry comprises 42 characters, including the size of anatomical structures, such as proboscis, femur, tergite, and sternites; discrete classes of pigmentation; length and width of wings, and angles in wing venation [5]. However, the manual measuring of this full-body set is very time-consuming and only a subset of wing characters proved to be informative for subspecies discrimination [23]. One of the most intensive assessments of wing shape variation is provided by the Discriminant Analysis with Numerical Output (DAWINO) method. This method is based on 30 characters extracted from vein lengths, their ratios, and vein angles. On the other side of the spectrum are the methods that only require estimations of the Cubital Index (CI), Hantel Index (HI), and/or Discoidal Shift Angle (DSA). These are popular amongst many beekeepers involved in the conservation of A. m. mellifera [34] as they are simple and can be implemented by semi-automatic software, such as BeeMorph (http://www.hockerley.plus.com/ accessed on 1 July 2022 ) or CBeeWings (http://www.cybis.se/cbeewing/ accessed on 1 July 2022). The problem is that these software packages provide a less reliable identification than that obtained from character-intensive methods. In addition, they require manual annotation of the vein junctions from which the indexes are calculated, which is time-consuming and prone to error.
Another way of assessing wing shape variation is through wing geometric morphometrics (WGM). This method is recognized as robust and reliable in insect taxonomy and is widely used in honey bee subspecies identification for varying purposes, including conservation [36,37,38,39,40,41,42]. WGM uses the coordinates defined by 19 landmarks located in the vein junctions to capture variation in wing shape [43] and can be implemented by the software IdentiFly [36]. However, IdentiFly is a semi-automatic tool that requires several steps before the wings can be fully identified, which makes its use difficult for the layman. The most recent advance in WGM uses deep learning to automatically extract the 19 landmarks from the right forewing of honey bee workers [44]. The approach is implemented by the software DeepWings©, which allows fully automated identification of honey bees. DeepWings© is very friendly, requiring only that the users drag the wing images into a file drop zone. Then, for each analyzed wing, it automatically retrieves the classification probabilities for the top three subspecies and the estimation of CI, HI, and DSA along with the landmark coordinates. DeepWings© is offered as a free web service at https://deepwings.ipb.pt. (accessed on 1 November 2022)
Herein, we employed DeepWings© to identify 2601 colonies from the analysis of 14,816 wings. These colonies were located in 15 countries, covering the native ranges of A. m. iberiensis, A. m. mellifera and A. m. carnica. In addition, we compared the wing shape data with molecular data for a subset of A. m. iberiensis and A. m. mellifera colonies. Our objectives were (i) to evaluate the functionality of DeepWings© when processing a massive number of wing images of varying quality and produced by different persons; (ii) to assess how closely the colonies identified by DeepWings© matched the endemic subspecies distribution, with an emphasis on M-lineage subspecies; and (iii) to assess the association between the identification produced by DeepWings© and that inferred from molecular markers.
2. Materials and Methods
2.1. Wing Samples
A total of 14,816 right forewing images of workers were obtained from 2601 colonies located in 15 countries (Figure 1, Table S1). The sampling effort encompassed the native distribution of the (i) M-lineage subspecies A. m. iberiensis (Portugal, Spain, and historical introduction in the Azores), A. m. mellifera (Belgium, France, Ireland, Poland, Russia, Sweden, Switzerland, UK), and (ii) C-lineage subspecies A. m. carnica (Croatia, Hungary, Moldavia, Romania, and Slovenia). Samples were collected from hives, except for Hungary and Poland, where the great majority were collected from flowers. However, these samples most likely represent independent colonies, given the > 3 km distance between sampling locations. Some of the samples of A. m. mellifera were collected from protected apiaries (Table S1). The number of wings per colony varied between one (all samples collected from flowers and samples from Groix) and 39, with a median of 5. Most wings were photographed using a stereomicroscope attached to a digital camera, with variable quality and dpi resolution (Figure S1).
Figure 1.
DeepWings© classification at the individual wing and colony levels. Sections in each donut chart represent the proportion of wings (inner ring) or colonies (outer ring) classified into each subspecies. Sample sizes of individual wings/colonies are indicated for each location. In the cases of Groix (France), Poland, and Hungary, each colony is represented by a single wing, so that classification for wings and colonies is coincident. AVI-Avignon, BEL-Belgium, CHE-Switzerland, ESP-Spain, FAI-Faial, FLO-Flores, GRA-Graciosa, GRO-Groix, HRV-Croatia, HUN-Hungary, IRL-Ireland, MDA-Moldova, OUE-Ouessant, PIC-Pico, POL-Poland, PRT-Portugal, ROU-Romania, RUS-Russia, SJO-São Jorge, SMA-Santa Maria, SMI-São Miguel, SVN-Slovenia, SWE-Sweden, TER-Terceira, and WAL-Wales.
2.2. DeepWings© Analysis
Given that the samples were collected from the native ranges of A. m. iberiensis, A. m. mellifera, and A. m. carnica, wing images were classified using the five subspecies classifier of DeepWings© [44], as it is more accurate than the 26 subspecies classifier. Wing images were entered for each colony in batches varying between 4 and 39, except for samples from Groix, Poland, and Hungary. For these locations, because there was only one wing per colony, 40 images (the maximum accepted by the program) were loaded simultaneously.
The output of DeepWings© used in this study included lineage and subspecies classification. The five-subspecies classification model of DeepWings© predicts the probability that a given wing belongs to A. m. iberiensis, A. m. mellifera, A. m. carnica, A. m. ligustica, or A. m. caucasia and corresponding lineage. The software retrieves the three highest prediction probabilities for each wing individually and for an average wing estimated from the wing batch dragged into the file drop zone. DeepWings© constructs the average wing by averaging the coordinates of each of the 19 landmarks across all the wings processed in one batch [44]. When all the wings from a colony are simultaneously uploaded, as is carried out here, the estimated average wing represents the colony and the classification at the colony level can be retrieved for the average wing. We used the wings output at both individual and colony levels. The wings were assigned to one of the five subspecies based on the highest prediction probability, even if the probability was low.
2.3. Association between Wing Data and Molecular Data
The association between DeepWings© classification and that obtained from molecular markers (microsatellites and SNPs) was assessed for 1214 colonies sampled from the native ranges of A. m. iberiensis (Portugal, Spain) and A. m. mellifera (France, Ireland, Wales, Russia). For each colony, the highest prediction probability of belonging to M lineage, as inferred from the 19 landmark coordinates using DeepWings©, was compared against the corresponding M-lineage membership proportion, as inferred from different sets of microsatellites or SNPs (Table S1) using the software Structure [45] or Admixture [46], respectively. The molecular dataset was generated in previous works (see Table S1).
2.4. Statistical Analysis
The probability data obtained from the average wing for the subspecies identified by DeepWings© did not follow a normal distribution, as per the Kolmogorov–Smirnov test. Accordingly, the summary statistics were presented for each location as medians and interquartile ranges (median, interquartile range [IQR]; Tables S2 and S3). The distributions of the probability data points were compared among the identified subspecies in each dataset using the Mann–Whitney U test or the Kruskal–Wallis test, followed by the Dunn’s multiple comparison test with statistical significance levels (p) adjusted by Bonferroni. The association between the probability of belonging to the M lineage, as inferred by DeepWings© from wing shape data, and the membership proportion in the M lineage, as inferred from microsatellite or SNP data, was assessed using the Spearman’s rank-order correlation coefficient (r). All statistical tests were conducted on Graph Pad Prism version 5.01 for Windows, GraphPad Software, San Diego, CA, USA.
3. Results
3.1. Classification of the Total Wings Dataset
A total of 14,816 worker forewings, representing 2601 colonies, were processed by DeepWings©. From these, 856 (5.8%) were rejected by the software as the 19 landmarks could not be annotated due to different image problems (Figure S1). The rejected subset included 106 wings from Poland and 7 from Hungary. Since the colonies of these two countries were mostly represented by single wings, the total number of classified colonies decreased to 2488. In 82.8% (709) of the discarded wings, rejection was mainly due to the very low resolution of the images and noisy background, leading DeepWings© to aggregate close landmarks. The remaining 147 (17.2%) images displayed some kind of corruption, including missing landmarks (9, 1.1%), folded or twisted wings (9, 1.1%), presence of artifacts on the images (37, 4.3%), overlapping wings (43, 5.0%), or broken wings with missing landmarks (49, 5.7%). The proportion of rejected wings varied among datasets, ranging from 0.0% (Portugal, 0 wings) to 39.1% (Ouessant, France, 43 wings). This meant an average (± SD) success in automatic landmark annotation of 92.23% ± 9.36 across the individual datasets.
The final sample sizes identified by the five-subspecies classifier of DeepWings© were 13,960 for wings and 2488 for colonies. Each wing was classified into the subspecies that showed the highest prediction probability, ranging from as low as 0.300 to 1.000, with a median of 0.968. The classification results at the colony level (inferred from the average wing) were similar, as the highest probability ranged from 0.309 to 1.000, with a median of 0.944. The highest median probability was obtained for wings identified as A. m. mellifera (median probability = 0.999, IQR = 0.023) and the lowest for wings identified as A. m. caucasia (0.702, 0.339). Analysis of the 2488 colonies further confirmed this pattern, with A. m. mellifera reaching a median of 0.994 (0.075) and A. m. caucasia 0.636 (0.334). This result makes sense, as colonies were not sampled in the native distribution of the Caucasian subspecies.
Table 1 shows the percentages of wings and colonies, sampled within the range of A. m. iberiensis, A. m. mellifera, and A. m. carnica, for the top two probabilities and for an arbitrary 0.950 probability threshold. For the sake of this table presentation, all the colonies from the Azores were included in the range of A. m. iberiensis, as this subspecies was originally introduced by the Portuguese settlers in historical times [47]. Despite the hybrid zone reported in the southern part of Poland [5], Polish colonies were included in the range of A. m. mellifera, as the great majority of them originated from elsewhere. Finally, all the colonies from Romania were included in the range of A. m. carnica, as the other native subspecies of Romania, A. m. macedonica [5], is not represented in the five-subspecies classifier and the majority of colonies were sampled in the A. m. carnica native range. As shown in Table 1, most wings and colonies were assigned to the expected subspecies. The highest proportions were observed for A. m. iberiensis for both wings (77.2%) and colonies (89.7%) and the lowest for A. m. mellifera (wings: 67.1%; colonies: 41.1%). However, when the 0.950 threshold was applied, the highest proportions of wings (86.7%) and colonies (97.3%) assigned to the expected subspecies were obtained for A. m. carnica in its native range.
Table 1.
Percentages of wings/colonies classified by DeepWings© into each one of the five subspecies according to the origin of the samples (native ranges of A. m. iberiensis, A. m. mellifera, and A. m. carnica). Percentages are shown for the top two classification probabilities and considering a probability threshold > 0.950.
| A. m. iberiensis Native Range | A. m. mellifera Native Range | A. m. carnica Native Range | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Subspecies | 1st Highest Probability | 2nd Highest Probability | Probability > 0.950 | 1st Highest Probability | 2nd Highest Probability | Probability > 0.950 | 1st Highest Probability | 2nd Highest Probability | Probability > 0.950 | 
| A. m. iberiensis | 77.2/89.7 | 11.2/5.8 | 75.0/88.4 | 11.3/5.8 | 45.0/31.8 | 6.7/3.0 | 1.6/0.0 | 9.8/3.0 | 0.5/0.0 | 
| A. m. mellifera | 9.8/4.0 | 58.8/76.3 | 19.2/9.6 | 67.1/41.1 | 12.8/12.2 | 80.0/51.4 | 4.5/1,9 | 15.1/5.4 | 1.9/0.4 | 
| A. m. ligustica | 6.8/4.7 | 10.98.3 | 3.5/1.5 | 6.5/12.5 | 16.5/28.1 | 3.1/6.0 | 20.0/9.2 | 48.9/77.2 | 10.6/2.2 | 
| A. m. carnica | 3.8/1.0 | 7.7/3.8 | 1.9/0.5 | 11.5/35.9 | 7.0/12.9 | 9.3/38.7 | 72.0/88.3 | 17.9/9.8 | 86.7/97.3 | 
| A. m. caucasia | 2.4/0.5 | 11.4/5.9 | 0.4/0.0 | 3.7/4.8 | 18.7/15.0 | 0.8/0.9 | 1.8/0.5 | 8.3/4.6 | 0.3/0.0 | 
The classification proportions of wings and colonies calculated using the highest prediction probability are shown by country in Figure 1. As before, the majority of the individual wings and colonies met the expectations concerning the native range of subspecies or lineages. The classification of the individual wings did not completely match the classification inferred from the average wing for colonies, although the proportions were similar (Tables S2 and S3). However, often, the number of subspecies identified from individual wings was higher than that identified from colonies. For example, in Portugal, A. m. carnica (17 wings) and A. m. caucasia (13 wings) were only identified at the individual wing level. When the classification was conducted at the colony level, these two subspecies were no longer detected. Because colony-level classification is more meaningful for apiculture than individual-level classification, the following results will be presented only for colonies. Furthermore, the common practice of wing-based identification is to average out intra-colony variation through analysis of multiple wings per colony [25].
3.2. Classification of Colonies Sampled in the Native Range of A. m. iberiensis
Of the 651 colonies sampled in the A. m. iberiensis native range, 603 (92.6%) were classified as A. m. iberiensis, with a median probability of 0.919 (0.225). A higher proportion was found in Portugal (95.7%) than in Spain (91.8%), with median probabilities of 0.935 (0.210) and 0.918 (0.226), respectively (Table S3, Figure 2). The second most detected subspecies was A. m. mellifera, representing only 4 (2.9%) and 38 (7.4%) colonies in Portugal and Spain, with median probabilities of 0.976 (0.141) and 0.998 (0.005), respectively. While in Portugal, no differences were found in the distribution of the classification probabilities between the two M-lineage subspecies (U = 180.00, p > 0.05), in Spain, A. m. mellifera showed an unexpectedly higher median probability (U = 2490.00, p = 0.0001). A. m. ligustica was also detected in Iberia, although with a residual proportion (0.5%) and a low median probability (0.676, 0.340). A. m. carnica and A. m. caucasia colonies were detected exclusively in Spain, with the former representing only one colony (0.2%), with a probability of 0.994, and the latter representing two colonies (0.4%), with a median probability of 0.520 (0.172). When analyzed at the lineage level, nearly all colonies (99.1%) were assigned to the expected M-lineage.
Figure 2.
Classification probabilities calculated for colonies from the average wing with DeepWings©. Each dot represents a colony. Boxplots represent the median, interquartile range, maximum, and minimum. Groups with the same letter on top have similar distributions in the classification probabilities. Groups with different letters have different distributions in the classification probabilities for a significance level of 0.05.
In the Azores, A. m. iberiensis was also the most frequently identified subspecies on six of the eight sampled islands (Figure 1), although with a lower median probability (0.888, 0.274) than that found in mainland colonies (0.919, 0.225; Table S3). The highest median probability was observed for São Jorge (0.953, 0.210) and the lowest for Santa Maria (0.769, 0.318), where 23 (88.5%) and 49 (98.0%) colonies had an average wing shape closer to A. m. iberiensis, respectively. Only Graciosa and Terceira had a substantial proportion of C-lineage, with 73.7% and 33.3% of the colonies classified as A. m. ligustica. The median probability was higher for Graciosa (0.801, 0.349) than for Terceira (0.679, 0.288), but these were not significantly different from the median probabilities obtained for A. m. iberiensis in both islands (Graciosa: U = 25.00, p = 0.79; Terceira: U = 649.00, p = 1.00). A low proportion of A. m. ligustica (4 colonies, 5.4%) was also detected on Pico, although with a significantly (U = 50.00, p = 0.04) lower median probability (0.637, 0.280) than that obtained for A. m. iberiensis (0.881, 0.260).
3.3. Classification of Colonies Sampled in the Native Range of A. m. mellifera
Of the 1008 colonies sampled in the A. m. mellifera native range, 414 (41.1%) were classified as A. m. mellifera, with a median probability of 0.994 (0.082). Except for Avignon (France), Wales (UK), and Poland, the remaining locations had a high proportion of colonies classified as A. m. mellifera (Table S3, Figure 2). The two colonies from Belgium had wing shapes matching A. m. mellifera, with a high median probability (1.000, 0.000). In Russia, 50 (96.2%) colonies showed high classification probabilities for A. m. mellifera (0.998, 0.002), and only two were classified as A. m. iberiensis and A. m. ligustica, but with low probabilities: 0.571 and 0.537, respectively. A lower proportion of colonies from Ouessant (8, 72.7%), Groix (23, 63.9%), Ireland (39, 78.0%), Switzerland (5, 55.6%), and Sweden (16, 84.2%) were classified as A. m. mellifera, despite their high probabilities (0.991 ≤ median ≤ 1.000, 0.002 ≤ IQR ≤ 0.088). In these locations, A. m. iberiensis was the second most frequently identified subspecies, with a lower median probability in Ireland (U = 47.00, p = 0.005) and Groix (U = 73.00, p = 0.04). Therefore, when classified at the lineage level, most colonies (>93.0%) matched the expected M lineage (Figure 1).
A high proportion of colonies from Avignon (63.2%), Wales (41.2%), and Poland (64.1%) had average wings more similar to subspecies of C- and O-lineage ancestries than to A. m. mellifera (Figure 1). In Poland, colonies were classified as A. m. carnica (44.5%) more frequently than as A. m. ligustica (14.1%) or A. m. caucasia (5.5%), with a significantly (0.53 < z < 7.07, 1.82 × 10−11 < p < 1.99 × 10−6) higher median probability of 0.983 (0.148) vs. 0.855 (0.268) or 0.641 (0.316), respectively. In contrast, in Wales, colonies classified as A. m. caucasia (23.5%) were more frequent than A. m. ligustica (11.8%) or A. m. carnica (5.9%).
3.4. Classification of Colonies Sampled in the Native Range of A. m. carnica
Of the 368 colonies sampled in the A. m. carnica native range, 325 (88.3%) were classified as A. m. carnica, with a median probability of 0.984 (0.089). The highest proportion was found in Slovenia (95.2%), with a median probability of 0.993 (0.040), followed by Romania (91.1%), with a median probability of 0.989 (0.117), Croatia (90.6%), with a median probability of 0.983 (0.058), Hungary (84.1%), with a median probability of 0.982, (0.121), and finally Moldova (50.0%), with the lowest median probability (0.695, IQR = 0.471), as shown in Table S3, Figure 2. Colonies classified as A. m. ligustica were also found in these five countries, but with lower proportions (4.8% for Slovenia, 7.8% for Romania, 8.8% for Croatia, 10.2% for Hungary, and 30.0% for Moldova) and significantly lower probabilities in Croatia (median = 0.664, IQR = 0.248; U = 122.00, and p = 6.27 × 10−8) and Romania (median = 0.815, IQR = 0.252; U = 134, and p = 0.020). In Hungary, the probabilities of colonies classified as A. m. ligustica (median = 0.922, IQR = 0.122) and A. m. mellifera (median = 0.790, IQR = 0.240) were similar to those of A. m. carnica (H(2) = 5.245, p = 0.072). Colonies classified as A. m. mellifera (1.9%) and A. m. caucasia (0.5%) were rare (Figure 1) and showed low median probabilities, varying between 0.600 (A. m. caucasia in Moldova) and 0.790 (A. m. mellifera in Hungary, Table S3).
3.5. Association between Wing Data and Molecular Data
The association between the probability of belonging to the M lineage, as inferred by DeepWings© from wing shape data, and the membership proportion in the M lineage, as inferred from microsatellite or SNP data, is shown for all samples and locations in Figure 3. A good agreement between the morphological and molecular markers was observed for most of the samples from Spain, mainland Portugal, Santa Maria, São Miguel, São Jorge, Faial, Flores, Groix, Ouessant, Ireland, and Bashkortostan, as they lie in the upper quarter of both the X and Y-axis. This is not the case for the other locations, as samples are more scattered in the two-dimensional space, with many of them exhibiting high values for the molecular marker (X-axis) and low values for the morphological marker (Y-axis) or, less commonly, the opposite. For example, in the dataset of Terceira, 23.0% of the samples lie in the upper quarter for the molecular marker (>0.75) and in the lower quarter for the morphological marker (<25), indicating a poor agreement between the two. Nonetheless, for the whole dataset (N = 1214), a significant association was found between the two markers, as revealed by Spearman’s correlation test (r = 0.31; 0.25 < 95% confidence interval < 0.36; p < 0.0001).
Figure 3.
Scatter plots showing the probability of belonging to M lineage (Y-axis) vs. membership proportions in M lineage (X-axis) for individual datasets and for the whole dataset. Classification probabilities were inferred from wing images using the software DeepWings©, whereas membership proportions were inferred from microsatellites (Groix, Ireland and Russia) or SNPs (Portugal, Spain, Azores, Avignon, Ouessant, and Wales) using the software Structure or Admixture, respectively. Each dot represents a colony.
4. Discussion
In this study, 14,816 wings representing 2601 colonies from 15 countries and covering the native ranges of A. m. iberiensis, A. m. mellifera, and A. m. carnica were analyzed using the WGM approach implemented by DeepWings©. This large and diverse dataset of wing images, originating from such a wide geographical range and acquired using varied image acquisition systems, offered a unique opportunity to test the performance of DeepWings© under real conditions. Moreover, the interaction with the numerous wing image contributors (beekeepers and researchers), who have different experiences and needs, allowed us to introduce several improvements in the software. These included (i) estimation of CI, HI, and DSA; (ii) display of the landmark-annotated wing images; (iii) production of a table with the landmark coordinates; (iv) inference of an average wing from a batch of wings, allowing classification of a colony from multiple wings; and (v) display of the three best classifications for the analyzed wings. DeepWings© successfully classified 94.2% of the wings, consistent with the rate predicted by the software developers [44].
The classification of the European colonies largely matched the endemic M and C-lineages, with proportions of 71.5% and 97.6%, respectively, as the top two probabilities were typically assigned to subspecies sharing lineage ancestry. However, when analyzed at the subspecies level, the matching proportions decreased, as samples collected in the A. m. mellifera range were often classified as A. m. iberiensis (the reverse was less frequent) and samples collected in the A. m. carnica range were often classified as A. m. ligustica. These findings are not surprising given that subspecies belonging to the same evolutionary lineage share a recent ancestor and are, therefore, genetically closely related [48]. Furthermore, European subspecies are largely parapatric and meet in natural contact zones where admixture occurs [28,30,49]. More importantly, due to beekeeping activities involving large-scale colony transhumance and queen trading, many subspecies belonging to the same or different lineages now occur in artificial sympatry, leading to further erosion of the boundaries between subspecies and to the breakdown of subspecies integrity [10,32,50,51,52,53]. These phenomena help explain the large dispersion of the probability values observed for the different locations, both within and between subspecies (Figure 2). Alternatively, but not mutually exclusive, DeepWings© is unable to recognize the full spectrum of natural variation, therefore failing the accurate classification of many colonies. The reference database used by the classification module of DeepWings© was constructed from a small subset of the original wings analyzed by Ruttner in his seminal taxonomic work [44]. Therefore, it only partially covers the natural variation in wing shape patterns that existed at the time for each subspecies. This limitation is further aggravated by the circumstance that Ruttner’s collection was assembled over 50 years ago, and wing venation patterns can change through time, as recently reported for Romanian populations [41].
The detection of wing venation patterns corresponding to the divergent A. m. ligustica and A. m. carnica in France, Switzerland, the UK, Ireland, Poland, and Russia, therefore mismatching the expected M lineage, can be explained by beekeeper-mediated gene flow and is consistent with molecular surveys reporting variable C-derived introgression in A. m. mellifera across Europe [9,10,11,12,13,25,50,54,55,56,57,58,59]. However, A. m. iberiensis detected with high probabilities (> 0.950) in colonies located far from the native range in Iberia (French islands, Ireland, the United Kingdom, Switzerland, Sweden, Poland, and Russia) was likely confounded by DeepWings© with its close relative A. m. mellifera, as international trading of Iberian queens is very uncommon. If this is true, Switzerland and Groix showed a particularly high rate of misclassification, with 33.3% and 30.6% of the colonies labeled as A. m. iberiensis, respectively. In the other locations, the rates were lower (1.9%–20.0%), but still higher than expected, considering that DeepWings© classification accuracy reported for A. m. mellifera was 0.950% [44]. This finding calls for an improvement of the software to increase its discriminating power, which implies expansion of the reference database used for training the current version of DeepWings© with wings from collections other than that of Ruttner. DeepWings© is a dynamic tool that can be easily upgraded to include more wings of each subspecies and/or more subspecies by adding their images to retrain a classification model using machine learning [44].
In contrast to the findings north of the Pyrenees, only a small proportion of the colonies (34 from Spain and 3 from Portugal, 5.7%) examined in the A. m. iberiensis native range were recognized as A. m. mellifera with probabilities above 0.950, indicating that DeepWings© performed relatively well. While misclassification of these colonies cannot be ruled out, the detection of wing shapes matching A. m. mellifera can also be explained by the clinal patterns of variation that were recurrently reported for Iberian populations, with populations from northern Spain being genetically closer to A. m. mellifera than populations from southern Spain and Portugal [27,28,31,60,61]. However, the six colonies (0.92%) with wing shapes closer to the foreign lineages (although with low probabilities) than to the endemic M lineage were likely misidentified, as suggested by the molecular data (Figure 3). Notably, while the membership proportions inferred from the molecular marker were nearly invariable and above 0.94, the probabilities inferred from the wing marker were scattered along the Y-axis in Figure 3. The disagreement between the two markers does not necessarily imply that the observed wing variation originates from a classification artefact. Since the colonies analyzed here were sampled across three north–south transects, therefore covering the entire native range of A. m. iberiensis, it is possible that the probability data reflects genuine variation in the wing venation [28].
In the Azores, where A. m. iberiensis was introduced in the XVI century [62], a higher proportion of colonies was assigned to the C-lineage, as compared to Iberia. This finding was particularly noticeable for Graciosa, where most of the colonies were assigned to the C-lineage, and is compatible with high introgression levels obtained from mtDNA markers [47]. Recurrent importations of foreign queens to sustain a breeding program run in the 1980s and 1990s can explain the results of the Azores [47]. Remarkably, this breeding effort left a strong signature on the genetic makeup of all honey bee populations, except on those from São Miguel and Santa Maria. Unlike the populations of Faial, Graciosa, São Jorge, Pico, and Flores, which showed wide variation in both markers, populations from São Miguel and, in particular, from Santa Maria closely resembled Iberian honey bees. While foreign alleles could be purged by genetic drift, it is also possible that selection acted to restore in these two easternmost islands the gene pool that was historically introduced from mainland Iberia.
Similar to the Iberian wings, most Polish wings classified by DeepWings© originate from north–south sampling transects [58]. Yet, the results could not be more divergent between the two areas of the native range of M-lineage. While in Iberia nearly all the colonies matched the endemic subspecies, in Poland, DeepWings© detected a strong diversification of lineages and subspecies. Over 67.1% of the colonies had wing venation patterns more similar to the C-lineage A. m. carnica and A. m. ligustica, to the O-lineage A. m. caucasia, and to the M-lineage A. m. iberiensis than to A. m. mellifera. While A. m. iberiensis could be confounded with A. m. mellifera, detection of the other three subspecies is compatible with the existence of a natural hybrid zone in southern Poland, where the three lineages come together [49], as well as with long-standing importations of foreign queens [34,63]. However, detection of a high proportion of wings assigned with high probability to A. m. ligustica was unexpected, as molecular studies largely reported in Poland the presence of A. m. carnica, but not of the Italian bee [39,58]. Given that C-lineage subspecies are closely related, and therefore, difficult to differentiate, even by molecular markers [17], it is possible that DeepWings© is swapping the two subspecies, as is likely happening with A. m. mellifera and A. m. iberiensis. Alternatively, but not mutually exclusive, A. m. ligustica genes can were introduced in Poland by undocumented importations of Italian queens and/or by the documented and steadily increasing importations of the artificial strain Buckfast [63]. This is a plausible hypothesis because, contrary to Poland, in Croatia, Hungary, Romania, and Slovenia the proportions of wings matching A. m. ligustica were low, as would be expected in a territory where A. m. carnica is endemic and favored by local beekeepers [5]. Moreover, this finding is consistent with the presence in these countries of mitochondrial and nuclear alleles of A. m. ligustica ancestry [29,33,64,65,66,67,68]. Another possibility is that wing images belonging to C-lineage subspecies unrepresented in the DeepWings© reference database and not included in the five-subspecies classifier (e.g., A. m. cecropia and A. m. macedonica) were identified as A. m. ligustica. This could very well be the case of several colonies sampled east and south of the Carpathian mountain ridge in Romania and Moldova, where A. m. macedonica and A. m. carpatica occurs naturally [5,64,67], that were assigned with low probabilities to A. m. ligustica, A. m. caucasia, and even to the other Romania-native subspecies A. m. carnica.
The association between wing and molecular data, assessed for the colonies sampled in the M-lineage native range, was highly significant but not very strong. Convergence of the two markers was variable and dependent on the integrity of the gene pools. They largely agreed in Iberia, Groix, Ouessant, Santa Maria, São Miguel, Ireland, and Russia, which are known for harboring honey bee populations with high genetic integrity [27,28,47,60,61,69,70,71,72,73,74]. Yet, they often disagreed in the areas where the M-lineage gene pool was threatened by a history of importations of foreign queens, such as in France, UK, or the Azores [9,12,15,47,50,54]. Previous research found that morphological and molecular markers can produce congruent results, supporting the validity of morphometric methods [38,39,75,76]. However, morphometric methods have limitations, especially when dealing with hybridized populations [39,77]. In these populations, the larger variation range of the markers and their overlapping distribution may account for a decrease in the resolution power of the morphometric methods.
While the genetic basis of wing venation is unknown, it is possible that wing traits are encoded by a few genes. Hence, wing markers likely cover a limited portion of the genome variation. In contrast, molecular markers, such as microsatellites and especially SNPs, are widespread across the honey bee genome [20,78], and molecular assays can be designed to fully cover the 16 chromosomes [18,19,24]. If the purpose of colony analysis is to determine the degree of genetic integrity and to estimate introgression proportions with high accuracy, then molecular markers are preferred over wing markers. Otherwise, DeepWings© offers a good alternative for colony identification. By processing batches of up to 40 wings, the software averages out intra-colony variation from a large number of sampled workers, therefore enabling a more robust colony classification. Given that numerous images can be easily and rapidly processed at no cost, DeepWings© is a valuable tool for colony screening in honey bee breeding programs for conservation or other purposes that do not require or do not have a budget for molecular identification.
Acknowledgments
Authors would like to thank Szilvia Kusza for help with collecting samples in Hungary and to Sergey Gurov, Marat Khasanov, Vladimir Kugeiko, Kirill Kugeiko, Vener Sattarov, V. I. Kartavy, Sergey Igumnov and Vladimir Kirikov for preparing and sending wing images from their apiaries in Russia.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/insects13121132/s1, Table S1: Sampling locations, number of analyzed wings and colonies, and number of loci per molecular marker. Locations highlighted in bold indicate samples originating from conservation programs.; Table S2: Number (N) and probability (median, IQR) of wings classified into each subspecies per dataset., Table S3: Number (N) and probability (median, IQR) of colonies classified into each subspecies per dataset. Figure S1: Accepted and rejected wing images by DeepWings©. (a) Examples of accepted images along with quality parameters. (b) Examples of rejected images. The donut chart represents the percentage of corrupted images distributed across the different classes [79,80].
Author Contributions
Conceptualization, M.A.P.; methodology, M.A.P., C.A.Y.G.; software, P.J.R.; investigation, C.A.Y.G., M.A.P. and P.J.R.; resources, M.A.P., D.E., D.H., A.T., B.F., C.B., A.K., G.P.M. and A.O.; writing—original draft preparation, M.A.P. and C.A.Y.G.; writing—review and editing, M.A.P., C.A.Y.G., D.H., R.I., G.P.M., A.O., A.K., C.B. and B.F.; supervision, M.A.P.; project administration, M.A.P.; funding acquisition, M.A.P. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Data Availability Statement
The data presented in this study are available on request from the corresponding author.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This research was funded through the project BEEHAPPY (POCI-01-0145-FEDER-029871, FCT and COMPETE/QREN/EU). Carlos A. Yadró García is supported by a research grant (POCI-01- 0145-FEDER-029871) from the Foundation for Science and Technology (FCT), Portugal. FCT provided financial support by national funds (FCT/MCTES) to CIMO (UIDB/00690/2020) and SusTEC (LA/P/0007/2021). Sampling in Poland and Hungary was financed by a grant from National Science Center, Poland (2015/19/B/NZ9/03718) to Andrzej Oleksa. Rustem Ilyasov was supported with grant of the Russian Foundation for Basic Research (E-Asia_t # 19-54-70002).
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Engel M.S. The taxonomy of recent and fossil honey bees (Hymenoptera: Apidae; Apis) J. Hymenopt. Res. 1999;8:165–196. [Google Scholar]
- 2.Meixner M.D., Leta M.A., Koeniger N., Fuchs S. The honey bees of Ethiopia represent a new subspecies of Apis mellifera—Apis mellifera simensis n. ssp. Apidologie. 2011;42:425–437. doi: 10.1007/s13592-011-0007-y. [DOI] [Google Scholar]
- 3.Sheppard W.S., Meixner M.D. Apis mellifera pomonella, a new honey bee subspecies from Central Asia. Apidologie. 2003;34:367–375. doi: 10.1051/apido:2003037. [DOI] [Google Scholar]
- 4.Chen C., Liu Z., Pan Q., Chen X., Wang H., Guo H., Liu S., Lu H., Tian S., Li R. Genomic analyses reveal demographic history and temperate adaptation of the newly discovered honey bee subspecies Apis mellifera sinisxinyuan n. ssp. Mol. Biol. Evol. 2016;33:1337–1348. doi: 10.1093/molbev/msw017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ruttner F. Biogeography and Taxonomy of Honeybees. Springer Verlag; Berlin, Germany: 1988. [Google Scholar]
- 6.Sheppard W. A history of the introduction of honey bee races into the United States. Part 1. Am. Bee J. 1989;129:617–619. [Google Scholar]
- 7.Sheppard W. A history of the introduction of honey bee races into the United States. Part 2. Am. Bee J. 1989;129:664–667. [Google Scholar]
- 8.De La Rúa P., Galián J., Serrano J., Moritz R.F. Genetic structure and distinctness of Apis mellifera L. populations from the Canary Islands. Mol. Ecol. 2001;10:1733–1742. doi: 10.1046/j.1365-294X.2001.01303.x. [DOI] [PubMed] [Google Scholar]
- 9.Jensen A.B., Palmer K.A., Boomsma J.J., Pedersen B.V. Varying degrees of Apis mellifera ligustica introgression in protected populations of the black honeybee, Apis mellifera mellifera, in northwest Europe. Mol. Ecol. 2005;14:93–106. doi: 10.1111/j.1365-294X.2004.02399.x. [DOI] [PubMed] [Google Scholar]
- 10.Soland-Reckeweg G., Heckel G., Neumann P., Fluri P., Excoffier L. Gene flow in admixed populations and implications for the conservation of the Western honeybee, Apis mellifera. J. Insect Conserv. 2009;13:317–328. doi: 10.1007/s10841-008-9175-0. [DOI] [Google Scholar]
- 11.Oleksa A., Chybicki I., Tofilski A., Burczyk J. Nuclear and mitochondrial patterns of introgression into native dark bees (Apis mellifera mellifera) in Poland. J. Apic. Res. 2011;50:116–129. doi: 10.3896/IBRA.1.50.2.03. [DOI] [Google Scholar]
- 12.Ellis J.S., Soland-Reckeweg G., Buswell V.G., Huml J.V., Brown A., Knight M.E. Introgression in native populations of Apis mellifera mellifera L: Implications for conservation. J. Insect Conserv. 2018;22:377–390. doi: 10.1007/s10841-018-0067-7. [DOI] [Google Scholar]
- 13.Pinto M.A., Henriques D., Chávez-Galarza J., Kryger P., Garnery L., van der Zee R., Dahle B., Soland-Reckeweg G., De la Rúa P., Dall’Olio R. Genetic integrity of the Dark European honey bee (Apis mellifera mellifera) from protected populations: A genome-wide assessment using SNPs and mtDNA sequence data. J. Apic. Res. 2014;53:269–278. doi: 10.3896/IBRA.1.53.2.08. [DOI] [Google Scholar]
- 14.Groeneveld L.F., Kirkerud L.A., Dahle B., Sunding M., Flobakk M., Kjos M., Henriques D., Pinto M.A., Berg P. Conservation of the dark bee (Apis mellifera mellifera): Estimating C-lineage introgression in Nordic breeding stocks. Acta Agric. Scand. Sect. A Anim. Sci. 2020;69:157–168. doi: 10.1080/09064702.2020.1770327. [DOI] [Google Scholar]
- 15.De la Rúa P., Jaffé R., Dall’Olio R., Muñoz I., Serrano J. Biodiversity, conservation and current threats to European honeybees. Apidologie. 2009;40:263–284. doi: 10.1051/apido/2009027. [DOI] [Google Scholar]
- 16.Garnery L., Solignac M., Celebrano G., Cornuet J.-M. A simple test using restricted PCR-amplified mitochondrial DNA to study the genetic structure of Apis mellifera L. Experientia. 1993;49:1016–1021. doi: 10.1007/BF02125651. [DOI] [Google Scholar]
- 17.Momeni J., Parejo M., Nielsen R.O., Langa J., Montes I., Papoutsis L., Farajzadeh L., Bendixen C., Căuia E., Charrière J.-D. Authoritative subspecies diagnosis tool for European honey bees based on ancestry informative SNPs. BMC Genom. 2021;22:1–12. doi: 10.1186/s12864-021-07379-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Henriques D., Browne K.A., Barnett M.W., Parejo M., Kryger P., Freeman T.C., Muñoz I., Garnery L., Highet F., Jonhston J.S. High sample throughput genotyping for estimating C-lineage introgression in the dark honeybee: An accurate and cost-effective SNP-based tool. Sci. Rep. 2018;8:1–14. doi: 10.1038/s41598-018-26932-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Henriques D., Parejo M., Vignal A., Wragg D., Wallberg A., Webster M.T., Pinto M.A. Developing reduced SNP assays from whole-genome sequence data to estimate introgression in an organism with complex genetic patterns, the Iberian honeybee (Apis mellifera iberiensis) Evol. Appl. 2018;11:1270–1282. doi: 10.1111/eva.12623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Solignac M., Vautrin D., Loiseau A., Mougel F., Baudry E., Estoup A., Garnery L., Haberl M., Cornuet J.M. Five hundred and fifty microsatellite markers for the study of the honeybee (Apis mellifera L.) genome. Mol. Ecol. Notes. 2003;3:307–311. doi: 10.1046/j.1471-8286.2003.00436.x. [DOI] [Google Scholar]
- 21.Muñoz I., Henriques D., Johnston J.S., Chávez-Galarza J., Kryger P., Pinto M.A. Reduced SNP panels for genetic identification and introgression analysis in the dark honey bee (Apis mellifera mellifera) PLoS ONE. 2015;10:e0124365. doi: 10.1371/journal.pone.0124365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Chapman N.C., Harpur B.A., Lim J., Rinderer T.E., Allsopp M.H., Zayed A., Oldroyd B.P. A SNP test to identify Africanized honeybees via proportion of ‘African’ancestry. Mol. Ecol. Resour. 2015;15:1346–1355. doi: 10.1111/1755-0998.12411. [DOI] [PubMed] [Google Scholar]
- 23.Meixner M.D., Pinto M.A., Bouga M., Kryger P., Ivanova E., Fuchs S. Standard methods for characterising subspecies and ecotypes of Apis mellifera. J. Apic. Res. 2013;52:1–28. doi: 10.3896/IBRA.1.52.4.05. [DOI] [Google Scholar]
- 24.Muñoz I., Henriques D., Jara L., Johnston J.S., Chávez-Galarza J., De La Rúa P., Pinto M.A. SNPs selected by information content outperform randomly selected microsatellite loci for delineating genetic identification and introgression in the endangered dark European honeybee (Apis mellifera mellifera) Mol. Ecol. Resour. 2017;17:783–795. doi: 10.1111/1755-0998.12637. [DOI] [PubMed] [Google Scholar]
- 25.Parejo M., Wragg D., Gauthier L., Vignal A., Neumann P., Neuditschko M. Using whole-genome sequence information to foster conservation efforts for the European Dark Honey Bee, Apis mellifera mellifera. Front. Ecol. Evol. 2016;4:140. doi: 10.3389/fevo.2016.00140. [DOI] [Google Scholar]
- 26.Whitfield C.W., Behura S.K., Berlocher S.H., Clark A.G., Johnston J.S., Sheppard W.S., Smith D.R., Suarez A.V., Weaver D., Tsutsui N.D. Thrice out of Africa: Ancient and recent expansions of the honey bee, Apis mellifera. Science. 2006;314:642–645. doi: 10.1126/science.1132772. [DOI] [PubMed] [Google Scholar]
- 27.Chávez-Galarza J., Garnery L., Henriques D., Neves C.J., Loucif-Ayad W., Jonhston J.S., Pinto M.A. Mitochondrial DNA variation of Apis mellifera iberiensis: Further insights from a large-scale study using sequence data of the tRNAleu-cox2 intergenic region. Apidologie. 2017;48:533–544. doi: 10.1007/s13592-017-0498-2. [DOI] [Google Scholar]
- 28.Chávez-Galarza J., Henriques D., Johnston J.S., Carneiro M., Rufino J., Patton J.C., Pinto M.A. Revisiting the Iberian honey bee (Apis mellifera iberiensis) contact zone: Maternal and genome-wide nuclear variations provide support for secondary contact from historical refugia. Mol. Ecol. 2015;24:2973–2992. doi: 10.1111/mec.13223. [DOI] [PubMed] [Google Scholar]
- 29.Muñoz I., Dall’Olio R., Lodesani M., De la Rúa P. Population genetic structure of coastal Croatian honeybees (Apis mellifera carnica) Apidologie. 2009;40:617–626. doi: 10.1051/apido/2009041. [DOI] [Google Scholar]
- 30.Franck P., Garnery L., Celebrano G., Solignac M., Cornuet J.M. Hybrid origins of honeybees from Italy (Apis mellifera ligustica) and Sicily (A. m. sicula) Mol. Ecol. 2000;9:907–921. doi: 10.1046/j.1365-294x.2000.00945.x. [DOI] [PubMed] [Google Scholar]
- 31.Franck P., Garnery L., Solignac M., Cornuet J.M. The origin of west European subspecies of honeybees (Apis mellifera): New insights from microsatellite and mitochondrial data. Evolution. 1998;52:1119–1134. doi: 10.1111/j.1558-5646.1998.tb01839.x. [DOI] [PubMed] [Google Scholar]
- 32.Uzunov A., Meixner M.D., Kiprijanovska H., Andonov S., Gregorc A., Ivanova E., Bouga M., Dobi P., Büchler R., Francis R. Genetic structure of Apis mellifera macedonica in the Balkan Peninsula based on microsatellite DNA polymorphism. J. Apic. Res. 2014;53:288–295. doi: 10.3896/IBRA.1.53.2.10. [DOI] [Google Scholar]
- 33.Péntek-Zakar E., Oleksa A., Borowik T., Kusza S. Population structure of honey bees in the Carpathian Basin (Hungary) confirms introgression from surrounding subspecies. Ecol. Evol. 2015;5:5456–5467. doi: 10.1002/ece3.1781. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bouga M., Alaux C., Bienkowska M., Büchler R., Carreck N.L., Cauia E., Chlebo R., Dahle B., Dall’Olio R., De la Rúa P. A review of methods for discrimination of honey bee populations as applied to European beekeeping. J. Apic. Res. 2011;50:51–84. doi: 10.3896/IBRA.1.50.1.06. [DOI] [Google Scholar]
- 35.Tofilski A. Automatic Measurement of Honeybee Wings. In: MacLeod N., editor. Automated Taxon Identification in Systematics. CRC Press; Boca Raton, FL, USA: 2007. pp. 289–298. [Google Scholar]
- 36.Nawrocka A., Kandemir İ., Fuchs S., Tofilski A. Computer software for identification of honey bee subspecies and evolutionary lineages. Apidologie. 2018;49:172–184. doi: 10.1007/s13592-017-0538-y. [DOI] [Google Scholar]
- 37.Barour C., Baylac M. Geometric morphometric discrimination of the three African honeybee subspecies Apis mellifera intermissa, A. m. sahariensis and A. m. capensis (Hymenoptera, Apidae): Fore wing and hind wing landmark configurations. J. Hymenopt. Res. 2016;52:61–70. doi: 10.3897/jhr.52.8787. [DOI] [Google Scholar]
- 38.Henriques D., Chávez-Galarza J., Teixeira J.S., Ferreira H., Neves C.J., Francoy T.M., Pinto M.A. Wing geometric morphometrics of workers and drones and single nucleotide polymorphisms provide similar genetic structure in the Iberian honey bee (Apis mellifera iberiensis) Insects. 2020;11:89. doi: 10.3390/insects11020089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Oleksa A., Tofilski A. Wing geometric morphometrics and microsatellite analysis provide similar discrimination of honey bee subspecies. Apidologie. 2015;46:49–60. doi: 10.1007/s13592-014-0300-7. [DOI] [Google Scholar]
- 40.Aglagane A., Tofilski A., Er-Rguibi O., Laghzaoui E.-M., Kimdil L., El Mouden E.H., Fuchs S., Oleksa A., Aamiri A., Aourir M. Geographical Variation of Honey Bee (Apis mellifera L. 1758) Populations in South-Eastern Morocco: A Geometric Morphometric Analysis. Insects. 2022;13:288. doi: 10.3390/insects13030288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Tofilski A., Căuia E., Siceanu A., Vișan G.O., Căuia D. Historical Changes in Honey Bee Wing Venation in Romania. Insects. 2021;12:542. doi: 10.3390/insects12060542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Tofilski A. Using geometric morphometrics and standard morphometry to discriminate three honeybee subspecies. Apidologie. 2008;39:558–563. doi: 10.1051/apido:2008037. [DOI] [Google Scholar]
- 43.Bookstein F.L. Morphometric Tools for Landmark Data: Geometry and Biology. Cambridge University Press; Cambridge, UK: 1997. p. 435. [Google Scholar]
- 44.Rodrigues P.J., Gomes W., Pinto M.A. DeepWings©: Automatic Wing Geometric Morphometrics Classification of Honey Bee (Apis mellifera) Subspecies Using Deep Learning for Detecting Landmarks. Big Data Cogn. Comput. 2022;6:70. doi: 10.3390/bdcc6030070. [DOI] [Google Scholar]
- 45.Pritchard J.K., Stephens M., Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–959. doi: 10.1093/genetics/155.2.945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Alexander D.H., Novembre J., Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19:1655–1664. doi: 10.1101/gr.094052.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ferreira H., Henriques D., Neves C.J., Machado C.A., Azevedo J.C., Francoy T.M., Pinto M.A. Historical and contemporaneous human-mediated processes left a strong genetic signature on honey bee populations from the Macaronesian archipelago of the Azores. Apidologie. 2020;51:316–328. doi: 10.1007/s13592-019-00720-w. [DOI] [Google Scholar]
- 48.Wallberg A., Han F., Wellhagen G., Dahle B., Kawata M., Haddad N., Simões Z.L.P., Allsopp M.H., Kandemir I., De la Rúa P. A worldwide survey of genome sequence variation provides insight into the evolutionary history of the honeybee Apis mellifera. Nat. Genet. 2014;46:1081–1088. doi: 10.1038/ng.3077. [DOI] [PubMed] [Google Scholar]
- 49.Meixner M.D., Worobik M., Wilde J., Fuchs S., Koeniger N. Apis mellifera mellifera in eastern Europe–morphometric variation and determination of its range limits. Apidologie. 2007;38:191–197. doi: 10.1051/apido:2006068. [DOI] [Google Scholar]
- 50.Requier F., Garnery L., Kohl P.L., Njovu H.K., Pirk C.W., Crewe R.M., Steffan-Dewenter I. The conservation of native honey bees is crucial. Trends Ecol. Evol. 2019;34:789–798. doi: 10.1016/j.tree.2019.04.008. [DOI] [PubMed] [Google Scholar]
- 51.Bouga M., Harizanis P.C., Kilias G., Alahiotis S. Genetic divergence and phylogenetic relationships of honey bee Apis mellifera (Hymenoptera: Apidae) populations from Greece and Cyprus using PCR–RFLP analysis of three mtDNA segments. Apidologie. 2005;36:335–344. doi: 10.1051/apido:2005021. [DOI] [Google Scholar]
- 52.Ivanova E., Staykova T., Bouga M. Allozyme variability in honey bee populations from some mountainous regions in the southwest of Bulgaria. J. Apic. Res. 2007;46:3–7. doi: 10.1080/00218839.2007.11101359. [DOI] [Google Scholar]
- 53.Nedić N., Francis R.M., Stanisavljević L., Pihler I., Kezić N., Bendixen C., Kryger P. Detecting population admixture in honey bees of Serbia. J. Apic. Res. 2014;53:303–313. doi: 10.3896/IBRA.1.53.2.12. [DOI] [Google Scholar]
- 54.Henriques D., Lopes A.R., Chejanovsky N., Dalmon A., Higes M., Jabal-Uriel C., Le Conte Y., Reyes-Carreño M., Soroker V., Martín-Hernández R. Mitochondrial and nuclear diversity of colonies of varying origins: Contrasting patterns inferred from the intergenic tRNAleu-cox2 region and immune SNPs. J. Apic. Res. 2021;61:305–308. doi: 10.1080/00218839.2021.2010940. [DOI] [Google Scholar]
- 55.Chen C., Parejo M., Momeni J., Langa J., Nielsen R.O., Shi W., CONTRIBUTORS S.W.D., Vingborg R., Kryger P., Bouga M. Population Structure and Diversity in European Honey Bees (Apis mellifera L.)—An Empirical Comparison of Pool and Individual Whole-Genome Sequencing. Genes. 2022;13:182. doi: 10.3390/genes13020182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Keller E.M., Harris I., Cross P. Identifying suitable queen rearing sites of Apis mellifera mellifera at a regional scale using morphometrics. J. Apic. Res. 2014;53:279–287. doi: 10.3896/IBRA.1.53.2.09. [DOI] [Google Scholar]
- 57.Skonieczna Ł. Conservation of Apis mellifera mellifera in Poland. Биoмика. 2016;8:61–64. [Google Scholar]
- 58.Oleksa A., Kusza S., Tofilski A. Mitochondrial DNA suggests the introduction of honeybees of African ancestry to East-Central Europe. Insects. 2021;12:410. doi: 10.3390/insects12050410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Kaskinova M.D., Gaifullina L.R., Saltyoka E.S., Poskryakov A.V., Nikolenko A.G. Dynamics of the Genetic Structure of Apis mellifera Populations in the Southern Urals. Russ. J. Genet. 2022;58:36–41. doi: 10.1134/S1022795422010045. [DOI] [Google Scholar]
- 60.Miguel I., Iriondo M., Garnery L., Sheppard W.S., Estonba A. Gene flow within the M evolutionary lineage of Apis mellifera: Role of the Pyrenees, isolation by distance and post-glacial re-colonization routes in the western Europe. Apidologie. 2007;38:141–155. doi: 10.1051/apido:2007007. [DOI] [Google Scholar]
- 61.Cánovas F., De la Rúa P., Serrano J., Galián J. Geographical patterns of mitochondrial DNA variation in Apis mellifera iberiensis (Hymenoptera: Apidae) J. Zool. Syst. Evol. Res. 2008;46:24–30. doi: 10.1111/j.1439-0469.2007.00435.x. [DOI] [Google Scholar]
- 62.Canto E. [(accessed on 2 November 2022)];Arquivo dos Açores—Volume 1. 1878 Volume 1 Available online: https://portulanclarin.net/repository/browse/archivo-dos-acores-dir-ernesto-do-canto-1a-serie-ponta-delgada-vol-1-12/e87bcf2e2b2c11ea843902420a000004f9e866e2ae1347fea56119669130a535/ [Google Scholar]
- 63.Bieńkowska M., Splitt A., Węgrzynowicz P., Maciorowski R. The Buzz Changes within Time: Native Apis mellifera mellifera Honeybee Subspecies Less and Less Popular among Polish Beekeepers Since 1980. Agriculture. 2021;11:652. doi: 10.3390/agriculture11070652. [DOI] [Google Scholar]
- 64.Coroian C.O., Muñoz I., Schlüns E.A., Paniti-Teleky O.R., Erler S., Furdui E.M., Mărghitaş L.A., Dezmirean D.S., Schlüns H., De La Rua P. Climate rather than geography separates two European honeybee subspecies. Mol. Ecol. 2014;23:2353–2361. doi: 10.1111/mec.12731. [DOI] [PubMed] [Google Scholar]
- 65.Muñoz I., De la Rúa P. Wide genetic diversity in Old World honey bees threaten by introgression. Apidologie. 2021;52:200–217. doi: 10.1007/s13592-020-00810-0. [DOI] [Google Scholar]
- 66.Božič J., Kordiš D., Križaj I., Leonardi A., Močnik R., Nakrst M., Podgoršek P., Prešern J., BAJEC S.S., ZORC M. Novel aspects in characterisation of Carniolan honey bee (Apis mellifera carnica, Pollmann 1879) Acta Agric. Slov. 2016;5:18–27. [Google Scholar]
- 67.Mărghitaş L.A., Coroian C., Dezmirean D., Stan L., Furdui E. Genetic Diversity of Honeybees from Moldova (Romania) Based on mtDNA Analysis. Bull. UASVM Anim. Sci. Biotechnol. 2010;67:396–402. [Google Scholar]
- 68.Buescu E., Gurau M.R., Danes D. Identification Of The Honeybee Subspecies From Some Romanian Counties Using A Semiautomatic System For Analyzing Wings; Proceedings of the CBU International Conference Proceedings; Prague, Czech Republic. 21–23 March 2018; pp. 1124–1128. [Google Scholar]
- 69.Browne K.A., Hassett J., Geary M., Moore E., Henriques D., Soland-Reckeweg G., Ferrari R., Mac Loughlin E., O’Brien E., O’Driscoll S. Investigation of free-living honey bee colonies in Ireland. J. Apic. Res. 2020;60:229–240. doi: 10.1080/00218839.2020.1837530. [DOI] [Google Scholar]
- 70.Hassett J., Browne K.A., McCormack G.P., Moore E., Society N.I.H.B., Soland G., Geary M. A significant pure population of the dark European honey bee (Apis mellifera mellifera) remains in Ireland. J. Apic. Res. 2018;57:337–350. doi: 10.1080/00218839.2018.1433949. [DOI] [Google Scholar]
- 71.Garnery L. Rapport d´ Expertise 2018. Analyses Génétiques de la Population d´Abeilles Melliféres de I´Ile de Groix. Laboratoires Evolution Génomes Comportement et Ecologie; Gif-sur-Yvette, France: 2018. [Google Scholar]
- 72.Ilyasov R., Poskryakov A., Petukhov A., Nikolenko A. Molecular genetic analysis of five extant reserves of black honeybee Apis melifera melifera in the Urals and the Volga region. Russ. J. Genet. 2016;52:828–839. doi: 10.1134/S1022795416060053. [DOI] [PubMed] [Google Scholar]
- 73.Ilyasov R.A., Lee M.-L., Yunusbaev U., Nikolenko A., Kwon H.-W. Estimation of C-derived introgression into A. m. mellifera colonies in the Russian Urals using microsatellite genotyping. Genes Genom. 2020;42:987–996. doi: 10.1007/s13258-020-00966-0. [DOI] [PubMed] [Google Scholar]
- 74.Garnery L., Franck P., Baudry E., Vautrin D., Cornuet J.-M., Solignac M. Genetic diversity of the west European honey bee (Apis mellifera mellifera and A. m. iberica) I. Mitochondrial DNA. Genet. Sel. Evol. 1998;30:S31–S47. doi: 10.1186/1297-9686-30-S1-S31. [DOI] [Google Scholar]
- 75.De la Rúa Tarín P., Radloff S., Hepburn R., Serrano J. Do molecular markers support morphometric and pheromone analyses? A preliminary case study in Apis Mellifera populations of Morocco. Arch. De Zootec. 2007;56:33–42. [Google Scholar]
- 76.Irati M., Baylac M., Iriondo M., Manzano C., Garnery L., Estonba A. Both geometric morphometric and microsatellite data consistently support the differentiation of the Apis mellifera M evolutionary branch. Apidologie. 2011;42:150–161. [Google Scholar]
- 77.Moritz R.F. The limitations of biometric control on pure race breeding in Apis mellifera. J. Apic. Res. 1991;30:54–59. doi: 10.1080/00218839.1991.11101234. [DOI] [Google Scholar]
- 78.Weinstock G.M., Robinson G.E., Gibbs R.A., Weinstock G.M., Weinstock G.M., Robinson G.E., Worley K.C., Evans J.D., Maleszka R., Robertson H.M., et al. Insights into social insects from the genome of the honeybee Apis mellifera. Nature. 2006;443:931–949. doi: 10.1038/nature05260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Chávez-Galarza J., Henriques D., Johnston J.S., Azevedo J.C., Patton J.C., Muñoz I., De la Rúa P., Pinto M.A. Signatures of selection in the Iberian honey bee (Apis mellifera iberiensis) revealed by a genome scan analysis of single nucleotide polymorphisms. Mol. Ecol. 2013;22:5890–5907. doi: 10.1111/mec.12537. [DOI] [PubMed] [Google Scholar]
- 80.Henriques D., Lopes A.R., Pinto M.A. Centro de Investigaçao de Montanha-Instituto Politécnico de Bragança, Bragança, Portugal. Manuscript in preparation .
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data presented in this study are available on request from the corresponding author.




