Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2026 Jan 29.
Published in final edited form as: J R Soc Interface. 2026 Jan 28;23(234):rsif.2025.0225. doi: 10.1098/rsif.2025.0225

Directionality range in Emlen funnels

Ilias Patmanidis 1,2, Bo Leberecht 3, Martin Fränzle 4,5, David Lentink 6,*, Ilia A Solov’yov 1,7,8,*, Henrik Mouritsen 3,7,*
PMCID: PMC7618669  EMSID: EMS210630  PMID: 41592731

Abstract

Our understanding of bird orientation guided by magnetic and visual cues is primarily based on Emlen funnel experiments. Migration-motivated birds jump in the direction they want to fly, and their feet leave marks on paper lining the funnel, which yields the preferred direction. Despite the low-signal-to-noise ratio, this paradigm has proven instrumental for studying magnetoreception in birds. However, the high noise limits the questions that can be answered and there is no data-informed guideline for selecting sample sizes that have a high likelihood to be conclusive. Furthermore, differences in experimental design traditions limit comparison and reproducibility across studies, slowing down discovery. We performed a large meta-analysis across double-blind magnetic orientation studies with Emlen funnels performed at Oldenburg to statistically characterize Emlen funnel data and determine minimal sampling requirements for conclusive experimental design. The analysis confirms that pre-selecting migration-motivated animals before the real experiments start improves statistical power by reducing noise. We also highlight mathematical limitations of the widely used directionality measure “r”, due to lacking sample-size bias correction, and present realistic ranges for expected bird directedness in Emlen funnels. Combined, these results provide critical design and analysis guidelines for statistically informative magnetic orientation experiments.

1. Introduction

Magnetoreception, the ability to detect and respond to the geo-magnetic field, plays a crucial role in bird migration and avian navigation [16]. While the exact physico-chemical mechanisms of magnetoreception are not fully understood, its use is demonstrated by ample behavioural evidence [68]. One of the currently researched hypotheses assumes the photo-activation of light-sensitive cryptochrome 4 proteins in the avian retina and the consequent formation of radical pairs that are sensitive to the Earth’s magnetic field[5, 913]. The magnetic field sensitivity of the radical pairs is expected to be transmitted to the brain via the thalamofugal visual pathway[2, 1418]. The radical pair formation causes a cascade of events starting with changes in the cryprochrome’s electron spin dynamics which ultimately affects the bird’s orientation behaviour[12]. Alternative hypotheses suggest either magnetic particles (e.g. magnetite or maghemite) acting like microscopic compass needles [19, 20] or sensation of magnetic induction in the semicircular canals [21, 22]. The changes in the orientation behaviour based on either of these hypothesised magnetic perception mechanisms has been studied in Emlen funnels [23].

Behavioural experiments in Emlen funnels[2326] are a key paradigm for studying bird orientation behaviour and understanding the magnetic sensory mechanism(s) of birds. In these experiments, a bird is placed into a funnel-shaped cage lined with scratch-sensitive paper on its walls (an improvement upon the first inkpad-and-paper-based setup) [25]. Since the birds, on average, tend to jump towards their preferred migratory direction, their intended orientational direction and their ability to sense magnetic fields (or other environmental cues) is evaluated by analysing the scratches left by the birds’ feet on the scratch-sensitive paper, Figure 1A and S1 in the Supporting Information (SI). By blocking visual orientation cues, birds can be tested while they only rely on their magnetoreception capabilities to determine their preferred direction[16, 2730]. Furthermore, Emlen funnels provide a controlled environment, in which potential disturbances to their sensory systems can be tested such as the effects of radio frequency electromagnetic noise (also called electro-smog) on the birds’ magnetic compass orientation capabilities in the context of the radical pair mechanism of avian magnetoreception[2936].

Figure 1.

Figure 1

A: Schematic representation of the workflow in Emlen funnel experiments. B: Schematic representation of our data analysis workflow of the compiled dataset used in this work. The birds are categorized based on the number of experiments/scratched papers (Ntests, number of tests per individual) that they participated in and their associated individual directedness (rindividual from the Rayleigh test). All birds with larger values than the set Ntests and rindividual cut-offs were grouped to-gether and the deviation from uniformity was determined using the P value from the Rayleigh test. For each such group, its directedness (r) and mean orientation were calculated. Effectively the groups with high cut-offs include the most consistently oriented birds and the birds with more tests.

While Emlen funnels have provided a lot of very important knowledge about the sensory systems and mechanisms birds used for navigation, they also have some methodological weaknesses. In nature, birds do not migrate every day: they mix stopover time for resting, recovering and refuelling with migration days/nights [37]. One important consequence of this is that birds are not motivated to migrate every time that they are put in an Emlen funnel. Consequently, on some test nights, the birds tested in Emlen funnels are expected to be oriented in random directions (non-motivated bird nights), while on other nights, they are expected to be oriented mainly in the mean migratory direction (motivated bird nights). The challenge is that there is no way to determine a priori whether a given bird is motivated to migrate on a given night. Consequently, the behavioural recordings in Emlen funnels tend to contain noisy data whose interpretation is not straightforward, especially when the sample sizes are too low. The way most researchers using Emlen funnels deal with this noise is to test the birds enough times, so that the motivated, oriented nights are likely to be reflected in the mean direction of many tests of a given bird. Hence, different protocols for conducting Emlen funnel experiments and analyzing the behavioural data have been proposed [16, 2429, 38, 39].

The present exploratory study combines data from a large collection of Emlen funnel experiments and aims to quantify general trends that govern the consistency of bird orientation in the Emlen paradigm. We combined data sets from magnetic-cues-only studies performed by researchers from the University of Oldenburg using very similar protocols for all studies. Our results provide realistic ranges for the expected magnetic orientation of the birds in Emlen funnels and highlight the statistical limitations of commonly used analyses. Critically, our results offer data-informed experimental insights for designing and analyzing future magnetic orientation experiments in Emlen funnels. Apart from the fundamental questions regarding avian navigation and migration, understanding and parametrizing the underlying orientation mechanisms is also important for the development of biomimetic navigation systems that mimic the sensory abilities of birds[4042]. Ultimately, similar navigation principles can be utilized to design novel technologies and algorithms for the navigation and orientation of artificial robotic devices.

2. Methods

Data set

We analysed data from Emlen funnel studies dating from 2009 to 2023 [30, 32, 3436, 4346]. We gathered the raw digital data of all the studies from 2009 to 2023 and merged them to all conform to the same formatting [30, 32, 3436, 4346]. The entries were spell checked and verified by comparing them to the lab books and spreadsheets of the respective migratory seasons to ensure that the analysis of each scratch paper was assigned correctly (for 191 entries parts of the scratch paper analysis could not be retrieved). While all birds had numbered colour rings, occasionally combinations of number and colour were re-used or changed. To track the reoccurring and changing number-colour-combinations, all individual birds were distinctly labelled by comparing their numbered colour rings to arrival and departure dates in the animal keeping and catching logs. To get the most realistic picture of the noise in Emlen funnel data, our dataset includes data that were collected during the conduct of these studies, but in which the control conditions did not result in significant group orientation. In these cases, the data could not be clearly interpreted and, therefore, remained unpublished. Lastly, the data was filtered to only contain orientational values for control conditions of European robins and Eurasian blackcaps, i.e. removing entries evaluated as inactive, randomly oriented, or bimodally oriented.

In total, we collected 27,445 curated entries for the preferred bird orientation based on scratched papers from Emlen funnels, Figure 2. This is the largest laboratory-based data set that has so far been compiled and analysed in detail to study the magnetic orientation of migratory birds and the statistics of Emlen funnel data. The studies were performed on two migratory species, European robins and Eurasian blackcaps, for two different magnetic field conditions, normal magnetic field (NMF) and changed magnetic field (CMF). In the CMF condition, the horizontal component of the magnetic field was rotated 120° counter-clockwise. In addition to species and magnetic conditions, the data set included information on the bird activity, the concentration of scratches and the date of each measurement. Bird activity and concentration of scratches are categorical descriptors that report the performance of birds in the Emlen funnels. Low activity values suggest that the birds did not leave many scratches on the papers, and low concentration values suggest that the scratches were scattered in different directions. On the other hand, high activity and concentration values indicate that the birds scratched the papers many times in a specific direction.

Figure 2. Flow diagram for the selection and categorization of bird data into eight groups.

Figure 2

The names of each group describe its particular permutation: European robin (prefix “er-”) vs. Eurasian blackcap (prefix “bc-”); normal magnetic field (NMF) vs. changed magnetic field (CMF); spring (suffix “-s”) vs. autumn (suffix “-a”) migration season. For example, erNMFs corresponds to entries for European robins in NMF during spring, whereas bcCMFa corresponds to entries for Eurasian blackcaps in CMF during autumn.

Among the bird entries, there were several missing values for the activity and the concentration of scratches (191 entries, 0.7% of the total number of entries). These entries were completely excluded from our analysis to ensure consistency. The original experiments have been conducted during the bird’s biannual migratory seasons in spring and autumn, so we pooled entries for each migratory season as February-June (spring) and August-December (autumn), respectively. The compiled data set of Emlen funnel scratched papers included entries for birds that participated in pre-screening (experiments to evaluate the migration motivation of individuals) and actual tests (of pre-screened, motivated birds).

In the beginning of each of the migratory season, pre-screening tests take place to determine (i) which birds are motivated to perform orientation behaviour in the funnels and (ii) which birds use their magnetic sense to orient inside the funnels. Various parameters can affect the behaviour of birds and ultimately the experimental results. For example, there might be individuals inside the bird groups that are non-migratory [47], or due to time restrictions, some tests may have been conducted outside of the time frame when birds experience migration restlessness [48, 49]. Based on the pre-screening data, the birds classified as active and oriented are selected for the actual experiments. The selection of active and oriented birds is usually based on the consistency of activity and the directedness metric that is recorded for each bird. In general, pre-screening tests are necessary to ensure that only motivated birds participate in the actual tests.

The data set that we compiled included entries for birds that participated in the actual experiments (motivated birds) and entries for birds that participated in the pre-screening tests, but never made it to the final experiments (unmotivated birds). In order to reduce any possible selection bias, our meta-analysis was based on the individual bird directedness (rindividual) and the number of tests (Ntests) regardless of the individual’s activity or direction, Figure 1B. In theory, motivated birds would have a tendency towards a specific direction, and this tendency should be reflected on their individual directedness. To ensure that we had representative statistical data for each individual, only birds with 3 or more entries were considered for the analysis (as was custom in recent studies[35, 36]). The final number of entries that was used in our analysis was 26,326 entries (95.9% of the total number of entries), Figure 2.

For our meta-analysis design, the data set was categorized into 8 groups based on bird species (European robin and Eurasian blackcap), magnetic field condition (NMF and CMF) and season (spring and autumn), Figure 2. This classification enabled direct comparison of mean orientation between the eight groups of birds as a function of species, condition, and season. In this analysis, we ignored the minute changes of the earth’s magnetic field over time at Oldenburg, as we assumed that any potential influence cannot be observed in birds’ behaviour in Emlen funnels at present due to the extensive noise in the data. Accordingly, entries from different years have been grouped together.

Analysis

The Rayleigh test is the most common statistical method to test hypotheses for circular data and to evaluate deviations from uniformity [50, 51]. There are two variables that are usually reported when a Rayleigh test is performed. First, the p-value (P) which is the probability criterion for accepting or rejecting the null hypothesis (Ho) that a sample comes from a uniform distribution. Second, the r-value (r) which is the length of the mean vector for a set of normalized directional vectors. The r-value ranges from 0 to 1, where 0 indicates a random distribution of the entries and 1 suggests that all entries fall on the same spot. Among other parameters, r is often used as a directedness criterion (rindividual) for determining which birds are consistently motivated and able to perceive the earth’s magnetic field [30, 32, 3436, 4346, 52]. A typical rindividual cut-off is set at 0.1[30, 32, 4346] or 0.2[3436, 52]. In these cases, birds that demonstrated rindividual larger than these cut-off values and provided more than a specified number of oriented tests (Ntests) were used for the published statistics. This measure is usually taken to avoid that poorly performing birds (e.g. with few oriented tests or with random orientation rindividual < 0.2 or 0.1) affect the overall group orientation by contributing a more or less random value. Apart from the rindividual rating, the P value has also been used as a selection criterion for individual birds.[5355] This practice should be avoided. Even though the Rayleigh test is commonly used to analyse orientation data from Emlen funnel experiments, the underlying assumptions for using the test are not always met. For example, a key issue is that the scratches made by each individual bird cannot be assumed to be independent. [38] For this reason, rindividual is more appropriate for analysing the bird scratches (than P), because calculating the sum of the mean orientation vector does not rely on the assumption that the data are independent. Hence, we used this metric to evaluate (i) the directedness of each bird (rindividual) and (ii) the overall directedness (r) for every species, condition and season group in the data set. In general, the Rayleigh test captures deviations from uniformity reasonably well, especially when the data is concentrated in one direction, but it should only be used when the data meet the necessary assumptions. [51]

Power analysis is a statistical method used to determine the statistical power of a hypothesis test, which is the probability of correctly rejecting the null hypothesis when it is false. Power analysis is often overlooked during experimental design due to the practical limitations of conducting the experiments, but it should not be ignored when interpreting the statistical analysis outcome. To address this, we performed a power analysis to estimate the minimum sample size (N) that is required to detect an actual effect when performing hypothesis testing using Emlen funnel data. Using appropriate sample sizes reduces the risk of false negative predictions (β, Type II error), and increases the statistical confidence in our results. For performing a power analysis, the desired confidence level (α) and statistical power need to be defined. Traditionally, α is set to 0.05, which is the chance of detecting an effect if there is no such effect (Type I error), and the statistical power (1 - β) is set at 0.8, which means that the null hypothesis is correctly rejected in 80% of the cases. The reasons and limitations of this tradition are explained in the statistical literature. [5658]

3. Results

Statistical analysis

First, we assessed the performance of the Rayleigh test for different known distributions to determine the optimal sample size for orientation analysis. The von Mises distribution, see Eq. (1), is used in circular statistics as a correspondence to the normal distribution in linear statistics, making it the most commonly used distribution model for circular data. We used different values for the spread (k -value) of the von Mises distribution to generate different distributions and perform controlled sample analyses. Smaller k values indicate broader distributions. This simulation approach allows us to thoroughly assess the relationship between the data sampling strategy and statistical analysis outcome, Figure 3A.

f(xμ,κ)=exp(κcos(xμ))2πI0(κ) (1)

Figure 3. Results from performing the Rayleigh test on different von Mises distributions.

Figure 3

A: Von Mises distributions with different spread (k), see Eq. (1). Each line corresponds to a different von Mises distribution. The legend includes k values with the respective standard deviations (σ) from normal distributions. B: P values from the Rayleigh test as a function of sample size (N) for different von Mises distribution spread (k). The lines indicate the mean value and the transparent bars represent standard deviation after sampling the von Mises distribution 10,000 times for each combination of k and N. The black dashed line highlights the traditional significance level (α=0.05). C: r values from the Rayleigh test as a function of the sample size for different von Mises distributions. D: Power analysis for rejecting the null hypothesis of uniformity for different von Mises distributions. Estimates are based on 10,000 samples for each combination of k and N. The significance level α was set to the traditional level of 0.05. The black horizontal dashed line highlights the associated statistical power threshold of 0.8.

where μ is the position of the mean and I0(κ) is a modified Bessel function (of the first kind of order 0).

To explore the performance of the Rayleigh test, we generated data sets by sampling values (N) from different von Mises distributions (by using different k), and performed the Rayleigh test for each case. In other words, we have tested the directedness of this circular distribution as a function on its spread (k) for different sample sizes (N) to see if they would pass the traditional significance level of α=0.05. This controlled simulation shows that even for small sample sizes (N=5 or N=10), highly concentrated distributions (k>3) usually give P values smaller than the traditional significance level (α=0.05), Figure 3B. In contrast, very broad distributions (k<1) do not reach the traditional significance level with such small sample sizes. For N=20, most theoretical distributions present a realistic capability of falling below the traditional significance level (α=0.05).

For small sample sizes, we find that the Rayleigh concentration parameter (r) tends to be elevated, Figure 3C. This trend is even more evident in broader von Mises distributions (k<2), and it is present in all sampled distributions. Furthermore, the variation is larger for smaller sample sizes, in particular for broader von Mises distributions (k<2). The effects of sample size on r has already been reported by Batschelet[50]. In fact, it is a standard result in circular statistics that the formula generally applied for empirically estimating the directedness from samples constitutes a biased estimator, with Kutil[59] providing a perfect bias correction for estimating r2 and a near-perfect bias correction for estimating r from samples.1

The trends of the generally used (biased) empirical estimator are displayed in Figure 3C, which shows that distributions for all concentration parameters k eventually (and logically) converge to a r-value asymptote, given a large enough sample size. Notably, for concentrated distributions (k>2), the proximity to this asymptote is reached at considerably lower sample sizes than for less concentrated distributions (k>2). Orientation experiments usually focus on the deviation of the group orientation from a random distribution (k =0). If these theoretical trends are taken as an indication for real experiments, group sample sizes around N = 20 should allow for a differentiation between a random distribution (k =0) and broad distributions (k =1).

The power analysis confirms the aforementioned effects, Figure 3D. Similarly, it also shows that sample sizes around 5 do not provide enough statistical power, when the spread of the data starts getting large (k<2). Sample sizes around 15 significantly improve the statistical power, but the results show that ideally larger sample sizes should be used to obtain enough statistical power. The relationship between sample sizes and statistical power, the features of various simulated data and the performance of different statistical tests in the analysis of circular data has been explored in the literature. [50, 6063]

It is important to emphasize that the statistical analysis was performed on data drawn from von Mises distributions, thus the ideal sample sizes apply only for these data. The ideal sample sizes for the von Mises distributions should be used more as a reference, rather than as strict guidelines. Actual experiments tend to produce much noisier data, and larger sample sizes are more likely to capture the real trends, but they can be hard to obtain in reasonable time frames. Furthermore, increasing the sample sizes and the number of tests can raise logistical issues. Regardless, 3R ethical considerations ask researchers to design experiments with sufficient power to be conclusive when the data for calculating power is available. The meta-analysis that we present here enables this.

Emlen funnels data analysis

The combined data set included in total 26,326 Emlen funnel scratched paper entries from 2,734 individual birds. The first goal was to obtain an estimate of values for the r parameter and compare them to ideal von Mises distributions.

In order to analyse the data, we categorised the birds as a function of the number of tests (Ntests; the number of Emlen funnel scratched papers) that were performed for each bird and the individual directedness (rindividual) of each bird, Figure 4. These two parameters, the number of tests and the directedness of each bird, were used to apply the cut-off grouping. Figure 4 explains the cut-off process in the group of European robins tested in normal magnetic field conditions in spring (erNMFs). Each point on these plots represents a group of birds that includes birds with higher values than the Ntests and rindividual cut-offs (red regions in Figure 4A). By increasing the cut-off step-by-step, groups with a higher directedness and a greater number of tests are effectively pooled together. By reducing the cut-off, the bird group sizes increase as birds with lower number of tests and directionality are included in the analysis. Once the groups have been determined, the Rayleigh test is applied to each group, and the directedness (r) and P value are calculated for each point on the plot, Figure 4B. Then, groups of birds with the same sample sizes (black contour lines) that presented lower P values than the significance level (α < 0.05) were used for calculating the mean directedness (r ) for each sample size, Figure 4C. The range of values for Ntests was 3 to 30, and for rindividual, it was 0 to 1, respectively.

Figure 4. Description of the Emlen data analysis for European robins in the normal magnetic field during spring (erNMFs) group.

Figure 4

A: Number of birds as a function of directedness (rindividual) and the number of Emlen funnel tests (Ntests; performed on a single individual) after applying the cut-off grouping process. The black contour lines indicate the size of each group or, in other words, the number of birds (N = 5, 15, 25 and 50) that met the cut-off criteria in each point of the plot. The red regions highlight the features of birds included in points i and ii. Specifically, i and ii include 15 and 25 birds, respectively. B: Results from the Rayleigh test, r as a function of rindividual and Ntests and P value boundaries. The red contour line highlights the cases for which the P of each Rayleigh test is lower than 0.05. C: Bird groups that showed statistically significant deviations from uniformity (P < 0.05). The groups within the red region were used for calculating the mean directedness (r ).

Four main conclusions could be formulated based on the meta-analysis of the existing Emlen funnel data:

  1. The data in the meta-analysis has a high level of noise, as would be expected, given the noisy origin of the data for these behavioural experiments. Groups with a few birds (N<5) have high group concentration parameter (r ), but r decreases fast as the group sizes increase. The lower left corner of the plot in Figure 4A, which includes most birds in each group, shows generally small r values. The main reason for the low r values is the inclusion of pre-screening birds that never reached the actual experiments.

  2. Large sample sizes (N>50; see the bold contour line in Figure 4B and 4C) present statistically significant deviations from uniformity regardless of the noise. It is sensible that as the sample size increases, small deviations from uniformity are captured easier by the Rayleigh test. This effect was also observed in the analysis of the P value for different von Mises distributions, Figure 3B.

  3. Individual birds with high directedness (rindividual) did not always give high r and small P values in the following Rayleigh tests. In these cases, a high rindividual can be explained as an artefact from the small number of tests for the individual birds (Ntests<5 and rindividual>0.75, Figure 3C), which systematically yields an overestimation of the directedness due to the bias in the empirical estimator generally employed.

  4. There are statistically significant deviations from uniformity all along the spectrum of Ntests and rindividual cut-offs. While the example of erNMFs presented a well-defined region with significant deviations from uniformity (area within the red dashed contour lines where rindividual>0.25 and Ntests>5, see Figure 4B and 4C), other analysed groups displayed more spread statistical significant regions and patchier ranges in the cut-off parameters. Plots for all the eight analysis groups are included in the SI, see Figure S2.

The next analysis involves a comparison of the groups that presented statistical significant deviations from uniformity based on the Rayleigh tests and had the same size (number of birds). Ideally, the sample sizes should be the same when comparing different Rayleigh tests, so that all observations have the same weight [50]. This is not always possible due to practical and logistics issues during the conduct of the behavioural experiments, but the present large (meta-analysis) data set allowed such calculation for different sample sizes. Accordingly, we have calculated the group concentration parameter / directedness (r ) and the mean orientation of the directed birds for each group for different consistent sample sizes, see Figure 5 and 6, respectively. The same analysis was performed with different statistical significance levels (α = 0.1, 0.05 and 0.01) for determining deviations from uniformity, in an effort to assess the robustness of the analysis. The overall trends are maintained regardless of the significance levels, see Figure S3 in the SI.

Figure 5. Concentration parameter/directedness (r ) calculated by performing the Rayleigh test on the different bird groups as a function of the number of birds (N).

Figure 5

A: Mean r value for the 8 bird groups based on the cut-off grouping scheme and the P value from the Rayleigh tests. Each solid red (European robin) and black (Eurasian blackcap) line represents a different species or magnetic field permutation (NMF or CMF) during spring experiments, while the respective dashed lines correspond to autumn experiments. B: Mean r value for each bird species and comparison with von Mises distributions. C: Mean r value for motivated birds (birds that have passed the pre-selection tests). The r value was calculated by sampling randomly bird orientations within each group. The mean r was measured by repeating the sampling process 1,000 times for different N. The dots represent the reported r from the respective Emlen funnel experiments included in our sample. D: Comparison between the mean r values with von Mises distributions.

Figure 6. Mean bird orientations for each group considered in this study.

Figure 6

A: Density distribution of the mean bird orientation. The transparent red and black slices indicate the expected mean orientation of the European robins and Eurasian blackcaps based on known migration behaviour in the wild. This is shown for each season and magnetic field condition. The direction of the magnetic North in each case is highlighted with a red symbol on the outer circle. The density has been normalised for each bird group and condition, separately. B: Superposition of the bird preferred directions on the map to relate their capture and testing location in Oldenburg to their global migration habitat. The preferred direction of European robins and Eurasian blackcaps are shown in red and black, respectively.

The group directedness (r ) follows a decaying trend similar to the von Mises distributions, but the shape is markedly different, see Figure 5B. Sample sizes (N) around 5 give high r values (~0.8-0.9), however r is almost half when the sample size is increased to 15 or 20. Since the data were pooled from published studies, the r values of the motivated birds (birds that have passed the pre-selection tests) can be calculated compared with the r values from the cut-off grouping process, see Figure 5C and 5D. All r value lines obtained from motivated birds are more similar to the von Mises distributions, comparison shown in Figure 5D. However, the asymptotes of these lines are quite different among different bird groups, and they range from 0.35 to 0.7. Furthermore, the reported r values do not seem to follow the same trends, especially for Eurasian blackcaps, see Figure 5C. These differences could be attributed to selection based on a minimum required rindividual during the pre-tests or the small size of the pooled data that affects the mean r value calculation. It is clear from both analyses that the asymptotes at large sample sizes are close to the von Mises distributions with a large spread of data (k = 1), which is in line with the intrinsic noise of Emlen funnel data. The mean r with its standard deviation for each bird are shown in the SI, see Figure S4 and S5.

As for the mean orientation, there are groups of birds whose mean orientation is roughly the same between species and follows the seasonally appropriate direction (e.g. erNMFs or bcNMFs), but there are also groups that present large spread and deviations (e.g. erCMFs or bcCMFa), see Figure 6. The analysis of the mean bird orientations shows that in most cases the bird groups on average followed the seasonally appropriate orientation directions, see Figure 6B. However, the selection of directed birds based on solely the Ntests and rindividual can lead to artefacts. For example, the presence of birds in the bcCMFa group that were tested many times (large Ntests value), but eventually presented low directionality (small rindividual value) tend to dominate in the tests of statistical significance, see Figure S6 in the SI. This effect is more evident when the mean orientation is plotted as a function of the sample size (N), see Figure S7 in the SI.

In order to mathematically model how r varies as a function of the bird group sample size (N), we performed a curve fitting analysis, see Figure 7. An exponential function was chosen to represent the mean of the r profiles, and the least square method was used to obtain the best fitting curves for the r profiles. The fitted curves were compared to the published r values from the respective publications to evaluate the performance of our approach for determining directionality, see Figure 7. The superimposition of the modelled r values with the published data suggests that our approach captured the main trends well. The r profiles for European robins and Eurasian blackcaps are compared with the respective published r values of the oriented birds from the studies that we included in our meta-analysis [30, 32, 3436, 4346]. Two similar functions were obtained for the r profiles of the European robins and Eurasian blackcaps, Eq. (2) and (3), respectively.

rER=0.72e0.08N+0.34 (2)
rBC=0.78e0.08N+0.28 (3)

Figure 7.

Figure 7

Fitted mean r profiles as a function of the sample size and comparison with the published r values (triangles) from Emlen funnel experiments from the studies covered by our meta-analysis, European robins (A) and Eurasian blackcaps (B). The mean r values (dots) for each bird species has been fitted to an exponential function, Eq. (2) and (3).

From our fitted mean r profiles (see Figure 7) and the concentration parameter calculations (see Figure 5B and 5D), European robins and Eurasian blackcaps can both be expected to produce an r around 0.5 (assumed for illustrative simplicity in the following example). A range of sample sizes between 15-25 results in 95% angular confidence intervals (CI) of approximately ±44° to ±33° around the group mean orientation, respectively (see Figure 5.2.1 in Batschelet[50]). With a given r, we can calculate the angular variance s2 (s2 = 2 * (1 − r); Eq. 2.3.1 in Batschelet[50]) and the angular deviation (s(degrees)=(180×s2)/π; Eq. 2.3.3 in Batschelet[50]) around the group mean orientation. For our suggested sample size range with r = 0.5, the angular deviation would be ±57.3°. As these calculations are solely depending on r, they are identical over our suggested range (but they would of course vary for real data). These exemplary values can inform the planning of orientation experiments where two conditions should result in statistically significant differences.

Last, the constructed data set has been analysed without using Ntests and rindividual as cut-off values. Instead, the birds were grouped based on the Ntests and rindividual values. The concentration parameter (r ), statistical significant deviation from uniformity (P ), sample size (N) and mean orientation of these groups are included in the SI, see Figure S8. Again, we conclude that the group concentration parameter r is larger for the small sample sizes (N<5), similar to observations by Batschelet[50]. The probability criterion P is below the statistical significance level when rindividual>0.2 and 5<Ntests<15. In these regions, the mean orientation is usually close to the expected seasonally appropriate direction. Last, there are several outliers in mean orientation, which are mostly groups with small number of tests or small sample sizes.

4. Discussion

In this work, we have combined and analysed a large data set of behavioural experiments conducted with European robins and Eurasian blackcaps jumping in Emlen funnels under different magnetic conditions during spring and autumn migration. The jumping behaviour was recorded on sensitive paper that the birds scratched with their feet. This meta-analysis enabled us to calculate the average directionality range with respect to the magnetic field for each bird species during spring and autumn migration. Given that magnetic orientation experiments are laborious and challenging to design statistically, due to the high noise in the orientation data, we estimated the minimal sampling requirements for correctly interpreting bird orientation behaviour in Emlen funnels using the Rayleigh circular statistical analysis adopted by the field.

Our analysis highlights the relationship between the concentration parameter (r ) and the number of birds tested, the sample size (N). Testing a small number of birds is prone to give extremely large r values due to the bias inherent to the empirical estimator. Mathematically, we expect that observed empirical estimates r2^ for the r2 values develop as

r2^=1N+N1Nr2,

as shown by Kutil[59]. By solving this formula for r2, it is straightforward to devise a perfect bias correction and thus obtain an unbiased estimator for r2. Unfortunately, the same is not true for estimating r itself, for which no perfect bias corrections are known, although there are near-unbiased estimators for r, see Kutil[59]. In general, r is the main variable that is reported in the literature, so our goal was to show what would be a reasonable r range as a function of the sample size (N) for a large data set. Standard deviations and confidence intervals are usually calculated for a single group with a specific sample size by bootstrapping. r2 with the bias correction could be a better “concentration parameter”, because it removes the inherent sample size bias, but it has never been reported in Emlen funnel literature. Consequently, the benefit for the community is currently limited. The same goes for the z variable that can be obtained from the Rayleigh test. Studying the impact of using different metrics for directedness could be explored in the future, but it is beyond the scope of the current study. We suggest that future work should therefore consider adopting r2, together with its bias-corrected estimator, rather than r as a directedness measure. A comparison between r and r2 for samples from different von Mises distributions is shown in Figure S9 in the SI.

A successful strategy for reducing noise in orientation data is to remove non-migration motivated birds that show little activity or are not oriented in a consistent direction. In the directed bird groups selected via such pretests, we find that r shows the same trend in both species and demonstrates that the birds are indeed oriented. The same applies to the activity and concentration scores of birds in different conditions, Figure S10 in the SI. However, in both species the trends in r deviate from how r varies according to the widely adopted von Mises distributions, showing the underlying assumptions of the Rayleigh test are not strictly met in bird magnetic orientation behaviour. The trends in both bird groups are similar to the published values, which helps confirm that our approach captured the differences between directed and non-directed birds as reported before. However, we did note that the mean orientation does not always follow the expected seasonally appropriate migration orientation trend. This raises new research questions about why birds tested in Emlen funnels sometimes orient in a mean direction different from their expected orientation direction in the wild. These more fundamental questions cannot be resolved with a minimal cut-off requirement for the number of tests needed to correctly determine if birds are oriented using the Rayleigh test. Defining better generalizable pretest criteria, ideally grounded in the mathematics of the statistical model used, for discriminating motivated and unmotivated birds could maybe advance the field’s ability to analyze Emlen funnel data.

According to our statistical analyses and meta-analysis, ideal sample sizes for future Emlen funnel studies should range at least between 15-25 to maintain a small probability of statistical error for a species, regardless of the underlying data distribution. Unfortunately, it is extremely difficult to quantify a priori the number of motivated birds (or the fraction of motivated/unmotivated birds). There are several parameters that affect the motivation of birds to migrate and the definition of motivated birds can be arbitrary. However, having estimates for the minimum sample sizes, and sample sizes that meet the requirements for statistically significant results is very important during the design of the experiments. In order to provide better estimates for the necessary number of tests, we have looked into the number of motivated birds (rindividual>0.2) and the total number of tested birds for each season and condition, and we calculated the mean ratio over the years (2009-2023), see Table S1 in the SI. This analysis shows that the mean number of motivated birds is quite similar between European Robins and Eurasian Blackcaps regardless of the season. For a typical rindividual cut-off equal to 0.2, it suggests that testing 30-35 birds would give an average of 21-24 motivated birds (30-35 * 70%). However, the choice of the rindividual cut-off significantly affects this percentage. If we increase the cut-off to 0.3, the percentage goes to 54% and with a cut-off of 0.4 to 36%. Even though this analysis is crude (and the choice of the rindividual cut-off is arbitrary), it highlights the difficulty of determining the number of motivated birds at the beginning of a study.

Sample sizes of motivated birds between 15-25 can provide good estimates for bird directedness, the mean bird orientation and the angular deviation. In the context of magnetic compass orientation, this could be the difference in the group orientation in the NMF and CMF conditions. The above estimated 95% CI suggests that the expected difference in group orientation should be >88° apart (twice the 95% CI for N = 15; respectively 66° for N = 25), so that the 95% CIs do not overlap. The CMF should therefore be turned by at least 88°. However, considering the noisy nature of behavioural data, one should allow some room for variation. To accomplish such a “buffer”, we recommend to use twice the angular deviation as a suitable value to turn the magnetic field, in our example 120° (114.6° rounded to the next 10°).

A typical example where this recommendation was applied was included in our analyses [30]. There, the effect of electromagnetic noise on the magnetic compass of European robins was investigated. We only included data from control conditions without any additional test conditions in our analyses, i.e. data from the NMF and CMF conditions in spring and autumn of 2011. In spring, European robins under NMF conditions oriented towards the North, the appropriate spring migratory direction (Figure 3b in Engels et al.[30]). The group of birds responded to the CMF condition with a 101° counter clockwise adjustment of their mean orientation (Figure 3c[30]). The 95% CIs were well separated and had a gap of 25°. In the autumn of the same year, the group of birds oriented West-Southwest in the NMF condition and responded with a 151° counter clockwise turn to the CMF condition (Figure 4e[30]). Here, the gap between the 95% CIs was even larger with 82°. Although the adjustment to the 120° counter clockwise turned magnetic field (CMF) was not precisely 120° (spring: 101°; autumn: 151°), the separation between the 95% CIs was large enough to facilitate a clear distinction between the group orientations (spring: 25°; autumn: 82°). Notably, all four described conditions had a similar number of birds tested and comparable group r-values. The difference between NMF and CMF in the spring of 2011 highlights that the inclusion of a “buffer” in such experiments is highly recommended, as the 95% CIs otherwise likely would have overlapped (e.g. with a turn of the CMF by just 90°, instead of 120°).

There are, however, exceptions with atypically high r or numbers of individuals to detect smaller angular differences. In a well-known example, Helbig compared the orientation behaviour of a population of Eurasian blackcaps migrating on a south-western route with a population on a southeastern route [64]. The number of tested birds (Ntests = 15-20) were on the lower end of our suggested sample size range, but highly consistent in their group orientation. Unsurprisingly, the angular deviation and CI around the respective means did not overlap. The crossbred offspring of these populations displayed a preferred southern direction. The CI around the mean of the offspring group did not overlap with either the parental population. This clear differentiation vanished as the migratory season progressed and caused the orientations of parents and offspring to overlap. In a follow-up study, a parental population of Southwest-migrating blackcaps was crossbred with a population migrating west-ward, resulting in an offspring population orienting in a direction in-between the parental group directions [65]. It should be noted that the group r within the tested groups was much lower than in the previous study[64], but with larger group sample sizes undergoing a comparable number of tests per bird (Ntests = 13-18). One would expect that the 46° difference between the means of the parental populations would not facilitate a distinction between the two. However, while the angular deviations of approx. 54° was around the group means’ overlap, the CI of the parental populations did not, thus allowing a clear distinction. Both of these examples highlight that a low sample size with high r, based on sufficiently high numbers of individual tests (Ntests), can yield reliable results for a small expected difference[64, 65]. Furthermore, narrower ranges of differentiation can be achieved, given a larger sample size, despite an overall lower degree of consistency in the r of each group[64, 65]. A 120° difference in expected directions remains our recommendation for typical studies. However, many tests with each individual and a high number of tested birds can in some cases enable smaller angular differences to be detected.

Overall, our study constitutes the largest meta-analysis of laboratory-based data on the magnetic orientation of migratory birds. Our results provide a realistic range of values for the expected directness of birds in Emlen funnels that could be used as a reference for future studies.

Supplementary Material

Supporting Information

Acknowledgements

The authors would like to thank the Volkswagen Foundation (Lichtenberg Professorship to I.A.S.), the Deutsche Forschungsgemeinschaft (SFB 1372: Magnetoreception and Navigation in Vertebrates, no. 395940726 to I.A.S. and H.M.; EXC 3051: NaviSense, no. 533653176 to M.F., D.L., I.A.S. and H.M.; TRR386 Hyperpolarization in molecular systems HYP*MOL, no 514664767 to I.A.S., and FR 2715/6-1 PreCePT), and the Ministry for Science and Culture of Lower Saxony (Simulations Meet Experiments on the Nanoscale: Opening up the Quantum World to Artificial Intelligence (SMART) and Dynamik auf der Nanoskala: Von kohärenten Elemen-tarprozessen zur Funktionalität (DyNano)), the European Research Council (under the European Union’s Horizon 2020 research and innovation programme, grant agreement no. 810002, Synergy Grant: ‘QuantumBirds’, awarded to H.M.); Computational resources for the simulations were provided by the CARL Cluster at the Carl-von-Ossietzky University Oldenburg, which is supported by the DFG and the ministry for science and culture of Lower Saxony. The authors gratefully acknowledge the computing time made available to them on the high-performance computers HLRN-IV at GWDG at the NHR Centers NHR@Göttingen. These Centers are jointly supported by the Federal Ministry of Education and Research and the state governments participating in the NHR.

Footnotes

1

It may seem counterintuitive that a perfect bias correction for estimating r2 from samples is possible, yet not for estimating r itself. The reason is that for a random variable x, the expected value E(x2) of its square in general is different from the squared expected value (E(x))2 such that y generally fails to be a good estimate for x, i.e. yE(x) holds, even if yE(x2).

Availability

The data sets used in our study are included in the SI and they are publicly available at https://dare.uol.de/dataset.xhtml?persistentId=doi:10.57782/X2BBXC.

References

  • [1].Pedersen JB, Nielsen C, Solov’yov IA. Multiscale description of avian migration: from chemical compass to behaviour modeling. Scientific reports. 2016;6:36709. doi: 10.1038/srep36709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Solov’yov IA, Mouritsen H, Schulten K. Acuity of a cryptochrome and vision-based magnetoreception system in birds. Biophysical journal. 2010;99:40–49. doi: 10.1016/j.bpj.2010.03.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [3].Ritz T, Adem S, Schulten K. A model for photoreceptor-based magnetoreception in birds. Biophysical journal. 2000;78:707–718. doi: 10.1016/S0006-3495(00)76629-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Hore PJ, Mouritsen H. The radical-pair mechanism of magnetoreception. Annual review of biophysics. 2016;45:299–344. doi: 10.1146/annurev-biophys-032116-094545. [DOI] [PubMed] [Google Scholar]
  • [5].Wong SY, et al. Navigation of migratory songbirds: a quantum magnetic compass sensor. Neuroforum. 2021;27:141–150. [Google Scholar]
  • [6].Mouritsen H. Sturkie’s avian physiology. Elsevier; 2022. Magnetoreception in birds and its use for long-distance migration; pp. 233–256. [Google Scholar]
  • [7].Wiltschko R, Wiltschko W. Magnetic orientation in animals. Berlin, Germany: 1995. [Google Scholar]
  • [8].Schneider WT, Holland RA, Lindecke O. Over 50 years of behavioural evidence on the magnetic sense in animals: what has been learnt and how? The European Physical Journal Special Topics. 2023;232:269–278. [Google Scholar]
  • [9].Rodgers CT, Hore PJ. Chemical magnetoreception in birds: the radical pair mechanism. Proceedings of the National Academy of Sciences. 2009;106:353–360. doi: 10.1073/pnas.0711968106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Wong SY, Wei Y, Mouritsen H, Solov’yov IA, Hore P. Cryptochrome magnetoreception: four tryptophans could be better than three. Journal of the Royal Society Interface. 2021;18:20210601. doi: 10.1098/rsif.2021.0601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Xu J, et al. Magnetic sensitivity of cryptochrome 4 from a migratory songbird. Nature. 2021;594:535–540. doi: 10.1038/s41586-021-03618-9. [DOI] [PubMed] [Google Scholar]
  • [12].Görtemaker K, et al. Direct interaction of avian cryptochrome 4 with a cone specific g-protein. Cells. 2022;11:2043. doi: 10.3390/cells11132043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Chetverikova R, Dautaj G, Schwigon L, Dedek K, Mouritsen H. Double cones in the avian retina form an oriented mosaic which might facilitate magnetoreception and/or polarized light sensing. Journal of the Royal Society Interface. 2022;19:20210877. doi: 10.1098/rsif.2021.0877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Mouritsen H, Feenders G, Liedvogel M, Wada K, Jarvis ED. Night-vision brain area in migratory songbirds. Proceedings of the National Academy of Sciences. 2005;102:8339–8344. doi: 10.1073/pnas.0409575102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Heyers D, Manns M, Luksch H, Güntürkün O, Mouritsen H. A visual pathway links brain structures active during magnetic compass orientation in migratory birds. PLoS One. 2007;2:e937. doi: 10.1371/journal.pone.0000937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Zapka M, et al. Visual but not trigeminal mediation of magnetic compass information in a migratory bird. Nature. 2009;461:1274–1277. doi: 10.1038/nature08528. [DOI] [PubMed] [Google Scholar]
  • [17].Zapka M, Heyers D, Liedvogel M, Jarvis ED, Mouritsen H. Night-time neuronal activation of cluster n in a day-and night-migrating songbird. European Journal of Neuroscience. 2010;32:619–624. doi: 10.1111/j.1460-9568.2010.07311.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Heyers D, et al. Morphology, biochemistry and connectivity of cluster n and the hippocampal formation in a migratory bird. Brain Structure and Function. 2022;227:2731–2749. doi: 10.1007/s00429-022-02566-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Kirschvink JL, Walker MM, Diebel CE. Magnetite-based magnetoreception. Current opinion in neurobiology. 2001;11:462–467. doi: 10.1016/s0959-4388(00)00235-x. [DOI] [PubMed] [Google Scholar]
  • [20].Uebe R, Schüler D. Magnetosome biogenesis in magnetotactic bacteria. Nature Reviews Microbiology. 2016;14:621–637. doi: 10.1038/nrmicro.2016.99. [DOI] [PubMed] [Google Scholar]
  • [21].Winklhofer M. Magnetoreception: a dynamo in the inner ear of pigeons. Current Biology. 2019;29:R1224–R1226. doi: 10.1016/j.cub.2019.10.022. [DOI] [PubMed] [Google Scholar]
  • [22].Nimpf S, et al. A putative mechanism for magnetoreception by electromagnetic induction in the pigeon inner ear. Current Biology. 2019;29:4052–4059. doi: 10.1016/j.cub.2019.09.048. [DOI] [PubMed] [Google Scholar]
  • [23].Emlen ST, Emlen JT. A technique for recording migratory orientation of captive birds. The Auk. 1966;83:361–367. [Google Scholar]
  • [24].Mouritsen H, Larsen ON. Migrating songbirds tested in computer-controlled emlen funnels use stellar cues for a time-independent compass. Journal of Experimental Biology. 2001;204:3855–3865. doi: 10.1242/jeb.204.22.3855. [DOI] [PubMed] [Google Scholar]
  • [25].Mouritsen H, Feenders G, Hegemann A, Liedvogel M. Thermal paper can replace typewriter correction paper in emlen funnels. Journal of Ornithology. 2009;150:713–715. [Google Scholar]
  • [26].Bianco G, et al. Emlen funnel experiments revisited: methods update for studying compass orientation in songbirds. Ecology and Evolution. 2016;6:6930–6942. doi: 10.1002/ece3.2383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Wiltzschko W, Wiltzschko R. Magnetic compass of european robins. Science. 1972;176:62–64. doi: 10.1126/science.176.4030.62. [DOI] [PubMed] [Google Scholar]
  • [28].Mouritsen H. Redstarts, phoenicurus phoenicurus, can orient in a true-zero magnetic field. Animal behaviour. 1998;55:1311–1324. doi: 10.1006/anbe.1997.0696. [DOI] [PubMed] [Google Scholar]
  • [29].Ritz T, Thalau P, Phillips JB, Wiltschko R, Wiltschko W. Resonance effects indicate a radical-pair mechanism for avian magnetic compass. Nature. 2004;429:177–180. doi: 10.1038/nature02534. [DOI] [PubMed] [Google Scholar]
  • [30].Engels S, et al. Anthropogenic electromagnetic noise disrupts magnetic compass orientation in a migratory bird. Nature. 2014;509:353–356. doi: 10.1038/nature13290. [DOI] [PubMed] [Google Scholar]
  • [31].Ritz T, et al. Magnetic compass of birds is based on a molecule with optimal directional sensitivity. Biophysical journal. 2009;96:3451–3457. doi: 10.1016/j.bpj.2008.11.072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Schwarze S, et al. Weak broadband electromagnetic fields are more disruptive to magnetic compass orientation in a night-migratory songbird (erithacus rubecula) than strong narrow-band fields. Frontiers in behavioral neuroscience. 2016;10:55. doi: 10.3389/fnbeh.2016.00055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [33].Pakhomov A, et al. Very weak oscillating magnetic field disrupts the magnetic compass of songbird migrants. jr soc interface. 2017;14:20170364. doi: 10.1098/rsif.2017.0364. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [34].Kobylkov D, et al. Electromagnetic 0.1–100 khz noise does not disrupt orientation in a night-migrating songbird implying a spin coherence lifetime of less than 10 μs. Journal of the Royal Society Interface. 2019;16:20190716. doi: 10.1098/rsif.2019.0716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].Leberecht B, et al. Broadband 75–85 mhz radiofrequency fields disrupt magnetic compass orientation in night-migratory songbirds consistent with a flavin-based radical pair magnetoreceptor. Journal of Comparative Physiology A. 2022;208:97–106. doi: 10.1007/s00359-021-01537-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [36].Leberecht B, et al. Upper bound for broadband radiofrequency field disruption of magnetic compass orientation in night-migratory songbirds. Proceedings of the National Academy of Sciences. 2023;120:e2301153120. doi: 10.1073/pnas.2301153120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [37].Schmaljohann H, Eikenaar C, Sapir N. Understanding the ecological and evolutionary function of stopover in migrating birds. Biological Reviews. 2022;97:1231–1252. doi: 10.1111/brv.12839. [DOI] [PubMed] [Google Scholar]
  • [38].Cherry JD, Able KP. An alternative method for the analysis of emlen funnel data. The Auk. 1986;103:225–227. [Google Scholar]
  • [39].Muheim R, Henshaw I, Sjöberg S, Deutschlander ME. Birdoritrack: a new video-tracking program for orientation research with migratory birds. Journal of Field Ornithology. 2014;85:91–105. [Google Scholar]
  • [40].Franz MO, Mallot HA. Biomimetic robot navigation. Robotics and autonomous Systems. 2000;30:133–153. [Google Scholar]
  • [41].Milford M, Schulz R. Principles of goal-directed spatial robot navigation in biomimetic models. Philosophical Transactions of the Royal Society B: Biological Sciences. 2014;369:20130484. doi: 10.1098/rstb.2013.0484. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [42].Gao Z, Shi Q, Fukuda T, Li C, Huang Q. An overview of biomimetic robots with animal behaviors. Neurocomputing. 2019;332:339–350. [Google Scholar]
  • [43].Hein CM, Engels S, Kishkinev D, Mouritsen H. Robins have a magnetic compass in both eyes. Nature. 2011;471:E1. doi: 10.1038/nature09875. [DOI] [PubMed] [Google Scholar]
  • [44].Engels S, Hein CM, Lefeldt N, Prior H, Mouritsen H. Night-migratory songbirds possess a magnetic compass in both eyes. 2012 doi: 10.1371/journal.pone.0043271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [45].Lefeldt N, Dreyer D, Schneider N-L, Steenken F, Mouritsen H. Migratory blackcaps tested in emlen funnels can orient at 85 degrees but not at 88 degrees magnetic inclination. Journal of Experimental Biology. 2015;218:206–211. doi: 10.1242/jeb.107235. [DOI] [PubMed] [Google Scholar]
  • [46].Schwarze S, et al. Migratory blackcaps can use their magnetic compass at 5 degrees inclination, but are completely random at 0 degrees inclination. Scientific Reports. 2016;6:33805. doi: 10.1038/srep33805. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [47].Adriaensen F, Dhondt AA. Population dynamics and partial migration of the european robin (erithacus rubecula) in different habitats. The Journal of Animal Ecology. 1990:1077–1090. [Google Scholar]
  • [48].Berthold P, Fiedler W, Querner U. Migratory restlessness or zugunruhe in birds—a description based on video recordings under infrared illumination. Journal für Ornithologie. 2000;141:285–299. [Google Scholar]
  • [49].Åkesson S, Ilieva M, Bianco G. Flexibility and control of circadian activity, migratory restlessness and fueling in two songbird migrants. Frontiers in Ecology and Evolution. 2021;9:666176 [Google Scholar]
  • [50].Batschelet E. Mathematics in biology. Academic Press; 1981. Circular Statistics in Biology. URL https://books.google.de/books?id=RJlYwAEACAAJ. [Google Scholar]
  • [51].Landler L, Ruxton GD, Malkemper EP. Circular data in biology: advice for effectively implementing statistical procedures. Behavioral ecology and sociobiology. 2018;72:1–10. doi: 10.1007/s00265-018-2538-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [52].Wynn J, et al. Naive songbirds show seasonally appropriate spring orientation in the laboratory despite having never completed first migration. Biology Letters. 2023;19:20220478 [Google Scholar]
  • [53].Kishkinev D, et al. Experienced migratory songbirds do not display goal-ward orientation after release following a cross-continental displacement: an automated telemetry study. Scientific Reports. 2016;6:37326. doi: 10.1038/srep37326. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [54].Chernetsov N, Pakhomov A, Davydov A, Cellarius F, Mouritsen H. No evidence for the use of magnetic declination for migratory navigation in two songbird species. PLoS One. 2020;15:e0232136. doi: 10.1371/journal.pone.0232136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [55].Pakhomov A, Prokshina A, Cellarius F, Mouritsen H, Chernetsov N. Access to the sky near the horizon and stars does not play a crucial role in compass calibration of european songbird migrants. Journal of Experimental Biology. 2022;225:jeb243631. doi: 10.1242/jeb.243631. [DOI] [PubMed] [Google Scholar]
  • [56].Myors B, Murphy KR, Wolach A. Statistical power analysis: A simple and general model for traditional and modern hypothesis tests. Routledge; 2010. [Google Scholar]
  • [57].Cohen J. Statistical power analysis for the behavioral sciences. Routledge; 2013. [Google Scholar]
  • [58].Serdar CC, Cihan M, Yücel D, Serdar MA. Sample size, power and effect size revisited: simplified and practical approaches in pre-clinical, clinical and laboratory studies. Biochemia medica. 2021;31:27–53. doi: 10.11613/BM.2021.010502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [59].Kutil R. Biased and unbiased estimation of the circular mean resultant length and its variance. Statistics. 2012;46:549–561. [Google Scholar]
  • [60].Humphreys RK, Ruxton GD. Consequences of grouped data for testing for departure from circular uniformity. Behavioral Ecology and Sociobiology. 2017;71:1–7. doi: 10.1007/s00265-017-2393-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [61].Landler L, Ruxton GD, Malkemper EP. The hermans–rasson test as a powerful alternative to the rayleigh test for circular statistics in biology. BMC ecology. 2019;19:1–8. doi: 10.1186/s12898-019-0246-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [62].Landler L, Ruxton GD, Malkemper EP. Model selection versus traditional hypothesis testing in circular statistics: a simulation study. Biology open. 2020;9:bio049866. doi: 10.1242/bio.049866. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [63].Landler L, Ruxton GD, Malkemper EP. The multivariate analysis of variance as a powerful approach for circular data. Movement Ecology. 2022;10:21. doi: 10.1186/s40462-022-00323-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [64].Helbig AJ. Inheritance of migratory direction in a bird species: a cross-breeding experiment with se-and sw-migrating blackcaps (sylvia atricapilla) Behavioral Ecology and Sociobiology. 1991;28:9–12. [Google Scholar]
  • [65].Helbig AJ, Berthold P, Mohr G, Querner U. Inheritance of a novel migratory direction in central european blackcaps. Naturwissenschaften. 1994;81:184–186. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

Data Availability Statement

The data sets used in our study are included in the SI and they are publicly available at https://dare.uol.de/dataset.xhtml?persistentId=doi:10.57782/X2BBXC.

RESOURCES