Abstract
Spatial patterns of stable isotopes in animal tissues or “isoscapes” can be used to investigate animal origins in a range of ecological and forensic investigations. Here, we developed a feather hydrogen isotope (δ2Hf) isoscape for Brazil based on 192 samples of feathers from the family Thraupidae from scientific collections. Raw values of δ2Hf ranged from -107.3 to +5.0‰, with higher values at the Caatinga biome (northeast Brazil) and lower values at the Amazon and Pantanal. A Random Forest (RF) method was used to model the spatial surface, using a range of environmental data as auxiliary variables. The RF model indicated a negative relationship between δ2Hf and Mean Annual Precipitation, Precipitation in the Warmest Quarter, and Annual Temperature Range and positive relationships for amount-weighted February-April precipitation δ2H (δ2Hp(Feb-April)) and Mean Annual Solar Radiation. Modelled δ2Hf values ranged from -85.7 to -13.6‰. Ours is the first δ2Hf isoscape for Brazil that can greatly assist our understanding of both ecological and biogeochemical processes controlling spatial variation in δ2H for this region. This isoscape can be used with caution, due to its poor predictive power (as found in other tropical regions) and can benefit from new sample input, new GNIP data, ecological and physiological studies, and keratin standard material better encompassing the range in feather samples from Brazil. So, we encourage new attempts to build more precise feather H isoscapes, as well as isoscapes based on other elements.
Introduction
The world is rapidly losing biodiversity [1] due to human activities that include habitat loss, direct persecution and climate change [2, 3], an epoch defined as the Anthropocene. One factor of great importance is the increased illegal trade in wild species, especially birds [4]. Unfortunately, this activity is tremendously difficult to control and it is usually impossible to define the provenance of seized material. However, the development of isotopic techniques to infer origins of birds and other animals through the isotopic measurement of feathers or other tissues shows considerable promise as a forensic tool to assist with investigations of illegally traded birds and other wildlife [5]. Such tools are urgently needed in Brazil because this country holds huge biodiversity and deals with high illegal wildlife trade [6]. A first step in establishing such isotopic tools is the creation of predicted tissue-specific isotopic patterns or isoscapes to form the basis of provenance tracking.
Hydrogen and oxygen have well established isotopic patterns across continents [7] as demonstrated by δ2H and δ18O models from precipitation based primarily on the >60-year dataset from the International Atomic Energy Agency (IAEA), the Global Network of Isotopes in Precipitation (GNIP) [8]. Hydrogen in metabolically inactive tissues like feather keratin is fixed from drinking water and diet, with subsequent isotopic changes associated with metabolism. Feather δ2H is influenced, then, by diet composition, body size, and spatial environmental gradients in food web δ2H [9]. Because H in food webs is ultimately derived from environmental waters used by plants, the GNIP-based precipitation isoscapes provide a means of deriving tissue isoscapes of consumers providing appropriate linkages between the tissue of interest and environmental waters (reviewed in [5]). The power of such predictive tools will depend on how well environmental isoscapes are known for a particular region and how precisely tissue δ2H values can be linked to them.
Recent development of isotopic techniques to infer origins of birds and other animals have included more refined regional isoscapes [10–14] and used advanced statistical and artificial intelligence methodologies for isoscape building [15, 16]. Feather δ2H isoscapes are available today, ranging from continental models [17, 18] to country focused models [19, 20]. In addition, for birds, such isoscapes have been developed for specific foraging or migratory guilds [21]. For several animal taxa, tissue-specific isoscapes have been successfully built and used to determine the probabilistic origin of migratory species [5] such as monarch butterflies (Danaus plexippus [22]), dragonflies [23], and several birds [24, 25], including Neotropical migrants [21, 26–28].
Forensic applications based on stable isotope measurements are numerous [29–31]. In Brazil, isotopic forensic investigations have contributed to the understanding of the origin of seized marijuana [32, 33] and the patterns of human diet [34]. Sena-Souza et al. [35] presents an overview of isoscapes in the Brazilian context, exploring potential applications for understanding biogeochemical cycle mechanisms and processes in Brazil, as well as the potential application to solve crimes and track drug and illegal animal trade. However, few forensic studies have formally used large-scale isoscapes to predict animal origins.
Here, we established a feather δ2H isoscape for Brazil based on existing knowledge of precipitation-based δ2H models, environmental variables, and known-origin feathers from museum collections. Our goal was to derive a useful forensic tool that best fits the precipitation and feather isotope data based on a machine learning approach.
Materials and methods
Feather collection
Feathers were sampled from non-migratory passerine birds held in ten scientific ornithological collections in Brazil, and also from field campaigns between 2016–2018 in different Brazilian National Parks, as part of a Brazilian isotope forensic project (Edital CAPES 25/2014 –Pró-Forenses). Our dataset comprised 192 samples distributed among 129 locations in Brazil and included 49 species. We chose species belonging to six most representative sub-families of Thraupidae due to the wide range of distribution of this family in Brazil and its prominent use in the illegal bird trade [36]. We also included two specimens from family Fringillidae (genus Euphonia) and two from family Cardinalidae (genus Cyanoloxia). Taxonomic classification follows the Brazilian Ornithological Records Committee [37].
We prioritized specimens collected since 2013 and the use of flight feathers. Body and tail feathers were included only when flight feathers were not allowed from collections. Wing primaries or secondaries are more associated with a specific molting period, but, if grown simultaneously, body and flight feathers are expected to have similar δ2Hf values [38]. Due to the regional scarcity of samples collected within our target period, we also included a few feathers collected before 2013 (but after 2006). Sample information (species, year, locality, coordinates) is available as S1 Table.
Most birds in Brazil molt their remiges and rectrices following reproduction, showing little or no overlap between reproduction and molt, especially in Thraupidae and generally in frugivores [39, 40]. The reproductive season is mostly concentrated at the end of the dry season and the beginning of the rainy season; the molt is expected to occur in the middle to the end of the rainy season [40]. Molt duration seems to be longer in tropical regions, where it can take 100–230 days [41, 42], while lasting 42–105 days in temperate regions.
The procedures described here were approved by the Animal Use and Ethics Committee–University of Brasília (55712/2016). Field collection permits were done under the Chico Mendes Institute for Biodiversity Conservation’s license (SISBIO 8745–1). Feathers were transported from Brazil to Canada for stable isotopes analysis under CITES license (144180 and 144541). All Genetic Patrimony Access was also registered under the SISGEN system (#A4C6D05).
Isotopic analysis of feathers
Feathers were steam-dried at 100°C for 20 min to meet requirements for international transportation to Canada. Before isotopic analysis, feathers were soaked in a 2:1 chloroform:methanol solution overnight, drained and air-dried under a fume hood. Distal feather vane was cut, weighed (~ 0.35 mg) into silver capsules, crushed, and placed into a Uni Prep carousel (Eurovector, Milan, Ital) at 60°C. The carousel was evacuated, flushed, and held under pressurized helium flow and samples were combusted in a Eurovector 3000 elemental analyzer using a glassy carbon reactor held at 1350°C. Resultant H2 gas was analyzed on a coupled Thermo Delta V Plus (Thermo, Bremen, Germany) continuous-flow isotope-ratio mass spectrometer at the University of Western Ontario LSIS-AFAR laboratory. Stable-hydrogen isotope results are reported for the non-exchangeable H fraction using the comparative equilibration method of Wassenaar and Hobson [43] based on within run (n = 5 each) measurements of CBS (-197‰) and KHS (-54.1‰) keratin standards. All results are reported in standard delta notation relative to the VSMOW (Vienna Standard Mean Ocean Water) standard scale. Based on within-run replicate measurements of standards, we estimate our measurement precision to be ~ ± 2‰.
Exploratory analysis
Aiming to understand how sample characteristics affected our dataset, we ran exploratory analyses evaluating whether δ2Hf are affected by year of sample collection, feather type (body, tail or wing), biome where sample was collected, sub-family to which the species belong, latitude, and longitude (S1 Table).
The whole dataset was tested for normality and the variables year of collection, feather type, biome, and sub-family were independently tested through analysis of variance followed by a Tukey’s HSD (multiple comparisons of means). Latitude and longitude were tested through simple linear models. Additionally, year of collection was also tested through a linear model, where model residuals were also explored.
Subsequentially, we also ran a linear model selection (LM), with all variables, to explore variables’ importance to explain the measured δ2Hf values. We used “dredge” and “model average” functions to summarize the best models, ranking them by increasing Akaike’s Information Criteria (AICc) and considering models within ΔAIC < 2 as competitive. To run this analysis we used the package ‘MuMIn’ [44] available in software R [45].
The R code used and the results for each analysis are available as S1 File. Figures were edited using Inkscape free software [46].
Isoscape modeling
Our Brazilian δ2Hf isoscape was modeled using a Random Forest analysis approach. First, using the “Recursive Feature Elimination” method, we selected environmental variables that most contributed to the prediction of δ2Hf values. Then, using the selected variables, we ran the Random Forest analysis to predict the best-fitting isoscape.
The R script used here was adapted from Bataille et al. [16] and Sena-Souza et al. [15]. Statistical analyses were performed in R version 4.1.3 [45] as well as RStudio interface [47]. Packages used were ‘raster’ [48], ‘sf’ [49], ‘dismo’ [50], ‘caret’ [51, 52], ‘rgdal’ [53], and ‘randomForest’ [54]. Plots and figures were built using package ‘ggplot2’ [55] and ‘pdp’ [56] in R, but also Inkscape [46] and Qgis [57] free software.
Environmental variables
We used data for temperature, precipitation, solar radiation, wind speed, water vapor pressure and altitude from WorldClim database [58]; and data for humidity and potential evapotranspiration, for the period of 1901–2019, was used from Climatic Research Unit (CRU) database [59].
The precipitation and the maximum and minimum temperature were calculated for the period of 2007–2018, and the bioclimatic variables for this period were calculated using package ‘dismo’[50]. The R script used here is available as S2 File. The mean annual solar radiation, wind speed and water vapor pressure were used for the period of 1970–2000, as this is the only period available in WorldClim database and recent tests confirm the reliability of this approach as recent proxies [60].
We also used data from Waterisotopes database [61], corresponding to isotopic composition of monthly amount-weighted precipitation (δ2Hp) [62]. The correlation between δ2Hp and δ2Hf in Brazil (r2 = 0.167) was low when compared to the same correlation in North America (r2 = 0.83; [63]). In temperate regions, this correlation is usually higher when applied to the ‘growing season’ values, representing the isotopic precipitation values in the months when temperature is greater than 0º C. As temperatures in Brazil are rarely below 0º C, we investigated a timeframe that better represented H incorporation into food webs relevant to feather growth. For this reason, we selected the timeframe from February to April (amount-weighted February-April precipitation δ2H; δ2Hp(Feb-April)), which gave us the best fit for δ2Hf observed data (r2 = 0.252). The idea of joining the more representative months is consistent with the idea of WorldClim bioclimatic variables, where year-quarters (3 subsequent months) are used as units of analysis. The criteria and R code used to define this timeframe is available as S4 Fig and S4 Table in S3 File. The R code used to extract the bioclimatic values for sample points, all the raster images, and the values for each variable are available as S4 File.
Variable selection
Before applying a Random Forest analysis, we used a Recursive Feature Elimination (RFE) method to select a reduced number of variables most relevant in predicting our δ2Hf values from our known-origin feathers. This approach reduces the collinearity between variables which would result in overfitting [64]. Here, we used function “rfFuncs”, method “cv”, and 10-fold cross-validation as outer sampling method. We started with a subset including all possible variables. The best combination of variables was considered for the model with the lowest Root Mean Squared Error (RMSE) after cross-validation.
Random Forest
Random Forest is a machine-learning model algorithm based on decision trees that are merged into a single prediction, reducing noise and increasing accuracy [65]. Here, the RF model was run using a training subset corresponding to 80% of samples for the ensemble learning and a holdout subset corresponding to 20% to test the model [51]. The model randomly selects a set of variables to be used in each tree split or node, and this parameter (mtry) was defined as mtry = 3, based on out-of-bag error and avoiding mtry = 1 due to overfitting (see S4 File for details). The importance of each covariate to the final model was assessed by the percentage increase in the Mean Standard Error (%incMSE), indicating how much of the model accuracy is lost when that variable is removed. Each variable’s influence on the δ2Hf was assessed using partial dependence plots (package ‘pdp’ [56] and ‘ggplot2’ [55]).
Model validation
Model performance was evaluated through linear regressions between observed δ2Hf values and predicted δ2Hf values, for the following datasets: (A) training dataset after 10-fold cross-validation and (B) testing dataset (20% of data). For each linear regression we compared the Mean Absolute Error (MAE = mean of model residuals2), the Root Mean Squared Error (RMSE = the root squared of MAE), and the r-squared (r2).
Spatial prediction
Spatial predictions were conducted using the function “predict” available in the ‘raster’ package [48]. The final isoscape was obtained from a mean of the spatial predictions from 20 RF models using the same 80% dataset previously defined. The R code and raw results are available as S4 File.
Results
Values of δ2Hf ranged from -107.3‰ to +5.0‰ (N = 192; Fig 1). The higher values were found in northeast region (Caatinga biome) and the lower values were found in the northwest region (Amazon and Pantanal biomes).
Fig 1. Raw observed δ2Hf values (‰ VSMOW).
Biome limits and names are indicated (Biome limits from Instituto Brasileiro de Geografia e Estatística–IBGE—https://www.ibge.gov.br/geociencias/downloads-geociencias.html).
Exploratory analysis
Using individual exploration of each variable through analysis of variance and linear models, the Caatinga biome had higher δ2Hf values than all other biomes (Tukey’s HSD: p < 0.001 for all comparisons), followed by Atlantic Forest values that were also higher than values found in all other biomes (Tukey’s HSD: p < 0.01 for all comparisons) (Fig 2A and S1 File). Samples from Amazon, Cerrado, Pantanal and Pampas had lower values, but they were not significantly different from each other. Feather types presented no significant differences for δ2Hf values (Fig 2B and S1 File), although such differences in values for wing and body feathers were close to significance (Tukey’s HSD: p = 0.052). Overall, most subfamilies did not differ in δ2Hf (Fig 2C and S1 File), but subfamily Diglossinae had lower δ2Hf values than the subset including ‘other’ less represented subfamilies, and the subfamilies Tachyphoninae and Thraupinae (Tukey’s HSD: p < 0.001 for all comparisons). The most sampled years (2013–2018) were homogeneous in δ2Hf but 2009 had higher values than years 2013–2019 (Fig 2D and S1 File). The years 2007–2012 represented only 17 samples (11% of total samples). Due to this difference, variable year was further explored with an additional linear model and exploration of residuals. The distribution of residuals was normal (Shapiro Test: W = 0.99, p = 0.20) and residuals from year 2009 were not out of the residual range. For this reason, samples from 2009 were kept in the following isoscape modeling. Latitude and longitude were positively related to δ2Hf values, but only longitude was significantly related to δ2Hf values (Tukey’s HSD: p < 0.001; Fig 2E and 2F and S1 File), with coastal longitudes corresponding to higher values of δ2Hf.
Fig 2. Effect of variables on the raw observed δ2Hf values (‰ VSMOW).

(A) Biomes where the sample was collected; (B) Feather type; (C) Bird’s subfamilies; (D) Year when sample was collected; (E) Longitude; and (F) Latitude. The p values refer to individual analyses of variance followed by Tukey’s HSD or to linear models (latitude and longitude).
According to our exploratory model selection, all included covariables were important to explain our δ2Hf values. Variable biome, subfamily, and longitude, were included in all models with ΔAIC < 2 (Table 1). After an average of models with ΔAIC < 2, the final conditional average model indicates biome Caatinga (p = 0.004), feather from wing (p = 0.03), and longitude (p = 0.003) as significantly influencing our δ2Hf values (S1 File).
Table 1. Exploratory model selection results for observed δ2Hf values.
Showing the more competitive models and their degrees of freedom, AICc, ∆AIC and weight.
| LM full model: δ2Hf ~ Year + Feather type + Longitude + Latitude + Biome + Subfamily | ||||
|---|---|---|---|---|
| Selected models | df | AICc | ∆AIC | Weight |
| Biome + Long + Subfamily + Feather type | 16 | 1561.6 | 0.00 | 0.230 |
| Biome + Long + Subfamily + Feather type + Lat | 17 | 1562.3 | 0.63 | 0.168 |
| Biome + Long + Subfamily | 14 | 1562.5 | 0.85 | 0.150 |
| Biome + Long + Subfamily + Year | 15 | 1563.1 | 1.52 | 0.108 |
| Biome + Long + Subfamily + Feather type + Year | 17 | 1563.5 | 1.91 | 0.088 |
Only models with ∆AIC < 2 were considered competitive.
Isoscape modeling
Recursive feature elimination
Through the RFE method, we selected the environmental variables predicting the δ2Hf values from our known-origin feathers. The model presenting the lowest RMSE after cross-validation included the following five environmental variables: Annual Temperature Range (Bio7), Mean Annual Precipitation (Bio12), Precipitation at the Warmest Quarter (Bio18), Mean Annual Solar Radiation, and δ2Hp(Feb-April).
Random Forest
Mean Annual Precipitation was the most important predictor for our known-origin feather δ2Hf values, and its removal from the model caused an ~20% increase in the Mean Standard Error (MSE). The δ2Hp(Feb-April) was the second most-important variable (~15%), followed by Precipitation at the Warmest Quarter (~13%), Mean Annual Solar Radiation (~7%), and Annual Temperature Range (~5%).
Each variable had a different influence on δ2Hf values. We observed each of these relationships through the partial plots (Fig 3). Mean Annual Precipitation (MAP) varied from 383–3221 mm and most of our samples were concentrated in areas holding 1000–2000 mm of rain (annual mean). MAP was negatively related to δ2Hf (Fig 3A), as sites with MAP lower than 1000 mm had higher δ2Hf values while sites with MAP higher than 2000 mm had lower values. The isotopic precipitation values (δ2Hp(Feb-April)) varied from -5.2 to -59.4‰ and our samples were more evenly distributed for this environmental variable. The δ2Hp(Feb-April) was positively related to the observed δ2Hf values (Fig 3B).
Fig 3. Influence of selected variables to feather δ2H prediction.
Partial plots showing the influence of the five selected variables to δ2Hf (‰ VSMOW) prediction using the Random Forest algorithm and each associated linear regression, with corresponding adjusted r2 values (upper smaller plot). (A) Mean Annual Precipitation; (B) δ2Hp mean values from Feb-April; (C) Precipitation at the Warmest Quarter; (D) Mean Annual Solar Radiation; (E) Annual Temperature Range.
The Precipitation at the Warmest Quarter (PWQ) varied from 50.8–762.1 mm and our samples were concentrated in areas with 200–450 mm of rain at the warmest period of the year. The PWQ was negatively related to δ2Hf (Fig 3C), as sites with higher PWQ had lower δ2Hf values. The Mean Annual Solar Radiation (MASR) varied from 0.81–3.78 kJ/m2/day and our samples were concentrated in areas with 1–2 kJ/m2/day. The MARS was positively related to δ2Hf (Fig 3D), as sites with higher solar radiation incidence had higher δ2Hf values. Finally, the Annual Temperature Range (ATR) varied from 8–24 ºC and was negatively related to δ2Hf values (Fig 3E).
Model validation
After 10-fold cross-validation, the predicted δ2Hf values explained 25.6% (F1,778 = 270.1; p < 0.001) of the variance in observed values, with a MAE of 231.21 and a RMSE of 15.21 (Fig 4A). For model-predicted δ2Hf and the holdout testing data (n = 36), the predicted values explained 90.5% (F1,34 = 337,3; p < 0.001) of the variation, with a MAE of 34.73 and a RMSE of 5.89 (Fig 4B). For both validations, slope was significantly different from 1 and intercept was different from 0 (p < 0.01; See S4 File for more information).
Fig 4. Observed vs. predicted δ2Hf values.

Scatter plots of observed vs. predicted δ2Hf (‰ VSMOW) values. (A) training dataset after 10-fold cross-validation (MAE = 231.21, RMSE = 15.21); and (B) holdout testing dataset (MAE = 34.73, RMSE = 5.89). Regression equation and r2 also provided. The gray line corresponds to slope 1 and intercept 0.
Spatial prediction
After running the 20 RF models, the final mean spatial prediction (isoscape) represented a δ2Hf range between -85.7 and -13.6‰ (Fig 5).
Fig 5. Isoscape.
Final spatial prediction of δ2Hf (‰ VSMOW) for Brazil after 20 Random Forest models, based on Mean Annual Precipitation, Precipitation at the Warmest Quarter, Mean Annual Solar Radiation, δ2Hp from Feb-April, and Annual Temperature Range. Biome limits from Instituto Brasileiro de Geografia e Estatística–IBGE; Spatial variables from WorldClim database.
Discussion
Here we developed the first feather hydrogen isoscape for Brazil based on an extensive analysis of available climatic data using a machine learning approach. Besides its novelty, this is the first such attempt in the Brazilian territory and we acknowledge the need for further improvements. The isoscape presented here has a low predictive power when compared to isoscapes developed for temperate regions. Possible reasons for such low predictability are (1) the poor modelled precipitation isoscape for Brazil due to low GNIP station coverage, (2) the complex climate and hydrology of Brazil, (3) the relatively low sample sizes of birds when compared to other works in temperate zones, (4) the complexities related to species ecology, migration and molt strategies, and (5) the poor ability to evaluate over which period of integration of H from precipitation supports local food webs.
The GNIP coverage in Brazil accounted for 15 to 20 stations between the 1980’s and 1990’s. However, those stations were deactivated and only one or two stations operated between 2008 and 2013 [66, 67]. Since 2018, an international initiative and national task force has installed 10 stations and plans to install 12 more [68]. This initiative will certainly provide a more reliable database for future studies. It is important to mention that even with these 22 stations in operation, our dataset still comprises an important number of independent locations (129), adding important isotopic information to the Brazilian context.
Brazil is a continental country, holding a land surface of 8.5 million km2 and with a complex climatic and hydrological structure. The main air masses influencing the climate are the Equatorial Atlantic, the Tropical Atlantic, and the Polar Atlantic air masses [69]. All of this complexity brings a high spatial and seasonal variability to the isotopic composition of meteoric precipitation [66]. Although complex, the effect of isotopic variability in precipitation might be reduced if biomes or geographical regions are analyzed separately (ex. [19, 70]).
When avian sample size is compared to studies conducted in North America (n = 461 samples in [19]; and n = 544 for calibration and n = 269 for validation in [18]), our sample size is relatively low and this might be responsible for part of the reduced predictability. Main reasons for this low sample size are the poor availability of collected specimens over the country for recent years and the low willingness of curators from some Brazilian scientific collections to provide samples. Scientific collections (especially museums) hold a very important set of taxonomical information, but these collected birds could serve additional uses as we have demonstrated through the provision of a small feather sample per individual. In this way, Brazilian science urges curators of scientific collections to be more flexible in making materials available to the scientific community (see Avisample.net [71]). Another observed caveat that can also be considered is support for the establishment of long-term avian banding stations in Brazil, in order to both provide feather samples and more information about migratory patterns. Nevertheless, we also support any fieldwork involving bird capture that could include feather collection in its activities (under the appropriated licenses). If these feathers are well identified (species name, precise coordinates) and deposited in scientific collections, they can be a great source of information for future isoscape development.
Although ornithological knowledge has grown in Brazil in the last decades, there is still a huge lack of basic ecological information for several species, including information related to molt strategies that are important here. This caveat also reflects on the state of physiological knowledge, resulting in a poor ability to evaluate over which period H from precipitation is integrated into local food webs. That is, it is currently unclear how birds integrate water into feather in tropical environments and in the different biomes found in Brazil. This knowledge can be established experimentally or further investigated by the collection and analysis of feathers close to the new GNIP stations in Brazil, providing new insights to unravel this relationship.
Another important caveat is the absence of a keratin standard material for higher δ2H values. Here we used the two recognized standards (CBS: -197.00 ± 0.9‰ and KHS: -54.1 ± 1.1‰; [72]) for keratinous tissues, but our sample values went beyond the available more positive standard values, raising the need for the development of a new standard to be used on future researches aiming to explore areas with less negative isotopic values. Our highest values came from the Northeast region, where several domestic animals, such as goats and donkeys, have keratinized tissue, and could be good candidates to provide reference material. Nonetheless, since the calibration relationship is expected to be linear over a broad range, we assume this effect was minor.
Exploratory analysis
While previous global models strongly account for latitudinal variation, the geospatial structure of δ2Hf modelled for Brazil varied longitudinally, with values decreasing from east to west resulting in feathers enriched in 2H on the northeastern coast and more depleted westward. This reflects the atmospheric circulation over Brazil that predominantly brings rainfall from the east coast to the interior [69].
Feather type, avian subfamily, and year of collection introduced variability to our δ2Hf dataset. Wing feathers (i.e. primaries, secondaries) are preferred over body and tail feathers as they are less likely to be lost and replaced over periods that do not represent the annual molt. As expected, body feathers presented higher variability than tail and wing feathers. The most distinguishable subfamily in our sample was Diglossinae, which comprises a highly trafficked species (Sicalis flaveola). The observed differences among years were not taken here as a concern, since they can be associated with the region from which those samples came from. The 2008, 2011 and 2012 samples were from Atlantic Forest, while the 2009 samples were from Caatinga (with expected higher δ2Hf values). As Caatinga is represented by additional 23 samples, the higher δ2Hf values were likely not associated with a specific year, but to the region of origin.
The isotopic variability resulting from different subfamilies and feather types should be considered. We recommend that future researches should focus initially on one species or genera and on wing feathers, to reduce the possible associated noise.
Biomes, bioclimatic variables, and air masses
The dry northeastern region (Caatinga) presented the highest δ2Hf values and part of this effect can be explained by the proximity to the coast. This region has the lowest values of Mean Annual Precipitation (MAP; <1,000 mm) and high Mean Annual Solar Radiation (MASR; 1.8–1.9 kJ/m2/day). It receives the Equatorial Atlantic air mass, which is mostly hot and humid [69], but the region also presents orographic barriers that prevent rainfall from the Atlantic Ocean entering the region, contributing to the overall aridity. The high values found are consistent with patterns found for migratory Barn Swallows (Hirundo rustica), where breeding individuals from Mississippi presented mean δ2Hf equal to -36.4‰ and were associated to non-breeding grounds in northeast South-America [26]. This contrast may be particularly useful in animal trafficking applications, as the region is usually a considerable source of illegally captured animals [73].
The northwestern region (Amazon) presented a 20‰ range in δ2Hf values with an isotopic gradient observed from east to west. The region presents the highest MAP (1500–3500 mm) and a low Annual Temperature Range (ATR; 10º C in the north and 15º C in the south to the east). This region is affected by the Equatorial Continental air mass, which is hot and humid, and transports humidity to the central part of the country [66, 69]. Isotopic homogeneity was previously observed in modeled precipitation isoscapes and have been reported as an effect of the strong and relatively uniform evapotranspiration from the Amazon forest [28, 74]. Terzer-Wassmuth et al. [70] also observed a significant influence of outgoing longwave radiation on Amazonian basin precipitation δ2H values. However, our findings suggest that this region may not be so homogeneous, but due to the lack of sampling on these remote landings, we suggest that more effort should be dedicated to isotopic investigations in this region. However, our findings suggest the possibility of using a more structured isoscape based on tissue δ2H as a geographical tracer of animals from the region.
The central region (Cerrado) is primarily represented by savanna. It is formed by a very heterogeneous formation, where grasslands and gallery forests occur in juxtaposition. This heterogeneity was not reflected in our δ2Hf values, but we did find a depletion in 2Hf from northeast to southwest. This pattern may occur due to the atmospheric circulation of the rains in the central region of Brazil, being more affected by Equatorial air masses in the northern part, while the southern part is more affected by Tropical (drier) air masses [69]. Additionally, precipitation coming from the Amazon influences the water circulation towards the southeast. As a result, the δ2H values of precipitation in the west of the Cerrado are more depleted in 2H.
The central western region (Pantanal) is located west of the Cerrado biome. It is a seasonally flooded steppe savanna, affected by the Equatorial Continental air mass coming from the Amazon and by the Polar Atlantic air mass [69]. The MAP is similar to Cerrado (1,000–1,500 mm), however, the region receives lower MASR (1.5–1.6 kJ/m2/day) than northern Cerrado and Caatinga. This region is more depleted in 2H in its eastern portion, similar to the adjacent Cerrado portion. However, the western portion presented a similar pattern to the δ2Hp(Feb-April) surface, with higher values than the eastern portion.
The Atlantic Forest is present along the Brazilian coast extending to the interior of the country in the south. This region had more latitudinal variation in feather δ2H, with higher values in the northern portion and lower in the southern. The pattern aligns with the MAP values, the δ2Hp(Feb-April) surface, the PWQ and the MARS values, with southern portion holding higher precipitation amounts, lower δ2Hp values and lower solar radiation. This biome is affected by the Tropical Atlantic air mass in the coast and by the Polar Atlantic mass in the south [69]. The presence of a polar air mass, associated with higher altitudes can be responsible for the higher 2H depletion in the region.
The southernmost region (Pampas) is a biome dominated by grasslands. It has the highest ATR (20 ºC) and intermediate values for the other bioclimatic variables. The region is affected by the Polar Atlantic air mass [69] and presented low δ2Hf values consistent with the most southern and coldest region in Brazil.
Model validation and uncertainty
Our model presented a spatial pattern consistent with the expected result for Brazil. However, users should pay attention to some weaknesses of the model regarding the spatial pattern of the samples and the model’s performance in explaining the variance of observed data. We are aware that the testing dataset should be spatially independent of the training dataset [75]. The high performance from the holdout testing data (R2 = 0.9) may indicate a strong spatial dependency between the testing and training dataset. That is expected because our data comes from museums and is highly related to bird capture hotspots. To minimize this limitation, we also used the 10-fold cross-validation with five repetitions to access the model performance. We believe it is the most reliable and conservative method to assess the model performance.
As shown by model validation using the training dataset after 10-fold cross-validation, our model explained 25.6% of δ2Hf variation, significantly departing from the 1:1 correspondence line. As shown in Methods, the initial relationship between δ2Hp from waterisotope database and observed δ2Hf in Brazil explained 16.7% of the variation in δ2Hf, while the relationship found for waterisotope database selected quarter (February to April) explained 25.2% of the variation (S4 File). This result indicates that our model presented a very low improvement when compared to a simple linear regression using only the δ2Hp(Feb-April) variable, and that improvements on GNIP network in Brazil can likely bring great benefits for the development of future tissue-specific hydrogen isoscapes.
The isoscape presented here can help direct future sampling for new isoscapes and for origin assignments, based on designed sampling strategies. According to Contina et al. (2021) [76], an end-point strategy might help to reduce sampling effort and the uncertainties due to model extrapolation. This strategy is based on training data from the maximum and minimum latitudinal or isotopic values. Nevertheless, those authors indicate that the best strategy can be variable according to the research questions and the study constrains, and should be carefully evaluated.
Concluding remarks
The isoscape developed and presented here was designed to test the possibility of using isotopic tracing techniques to assist in academic and enforcement applications in Brazil. Although presenting a lower predictability when compared to isoscapes from Northern regions, it is aligned with low predictability found in other tropical regions [77]. The observed low variation (-85.7 to -13.6‰) for δ2Hf values over large regions of Brazil can be compensated for through the use of informed Bayesian priors [78, 79] in assignments as well as the use of stable isotopes of other elements (as δ13C, δ15N, δ18O, δ87Sr). A multiple-isotopic approach has been successfully used in other regions, improving the origin-assignment of migratory bird species [17, 24, 26, 28, 80, 81] and should also be developed for Brazil and neighboring regions.
We encourage the improvement of this isoscape through the use of a larger sample size focused on a taxon-specific approach and on flight feathers, the use of new keratin standard material for higher δ2Hf values, and the use of a more robust Brazilian surface of water δ2H.
Despite the acknowledged improvements needed and weaknesses discussed, the provided isoscape can be very useful for broader scale questions, such as identifying whether an animal seized in Brazil comes from captivity or from the wild. Other uses are also viable if the isoscape is used with caution and attention to its limited spatial precision.
Supporting information
Identification number (ID), year of collection, species name under CBRO (Brazilian Ornithological Records Committee [37]), family, subfamily, feather type (body, wing or tail), latitude and longitude, Brazilian state and biome where feathers were collected, corresponding university’s scientific collection (museum*) where feathers were deposited, and feather’s hydrogen isotopic (‰ VSMOW) raw values.
(PDF)
R code and results for Analysis of Variance followed by a Tukey’s HSD (variable year of collection, feather type, biome, and sub-family), Linear Models (variable latitude and longitude), and Model Selection with all variables.
(PDF)
R code to calculate WorldClim bioclimatic variables for a specific timeframe.
(PDF)
R code and results. Including S4 Fig (Linear regressions between δ2Hf and δ2Hp) and S4 Table (Precipitation isotopic values extracted from waterisotopes.org).
(PDF)
First step- Prepare data. R code to input raster and observations, and extract values. Including S5 Table in S4 File with extracted values; Second step- Selection of variables using Recursive Feature Elimination–RFE. R code to run RFE and plot results; Third step- Prepare data and run Random Forest (RF). R code to run RF and explore results with partial plots; Fourth step- Model validation. R code to validate the model and plot results; Fifth step- Spatial prediction. R code to apply modeled values into an isoscape, calculate method uncertainty, mean, standard deviation and coefficient of variation.
(PDF)
Acknowledgments
We thank all the curators that kindly provided samples and those that collected or sent samples, namely, Alexandre Aleixo and Lincoln Carneiro from Museo Paraense Emílio Goeldi, Mario Cohn-Haft from Instituto Nacional de Pesquisas da Amazônia (INPA), Renato Caparroz, Eliane Freitas, Gislaine Fernandes, Thiago Filadelfo, Pedro Diniz, and Miguel Marini from Universidade de Brasília (UnB), Vitor Piacentini from Universidade Federal de Mato Grosso (UFMT), Mauro Pichorim from Universidade Federal do Rio Grande do Norte (UFRN), Renato Gaban from Universidade Federal de Alagoas (UFAL), Henrique Batalha and Rilquer Mascarenhas from Universidade Federal da Bahia (UFBA), Karlla Rios from Cemafauna at the Universidade Federal do Vale do São Francisco (UNIVASF), Carla Fontana from Pontíficia Universidade Católica do Rio Grande do Sul (PUCRS), and Leandro Bugoni from Universidade Federal do Rio Grande (FURG). We thank Blanca X. Mora-Alvarez for assistance in preparing stable isotope samples, and S. Van Wilgenburg and an anonymous reviewer for their valuable contributions.
Data Availability
All relevant data are within the manuscript and its Supporting information files.
Funding Statement
The work was supported by the project number 23038.006832/2014-11- Edital CAPES 25/2014 – Pró-Forenses (Coordenação de Aperfeiçoamento de Pessoal de Nível Superior -https://www.gov.br/capes/pt-br). RDA received a pos-doctoral scholarship from CAPES (process number 88881.357615/2019-1) and KAH received funding from the University of Western Ontario (https://www.uwo.ca/) and the Natural Sciences and Engineering Research Council (NSERC -https://www.nserc-crsng.gc.ca/index_eng.asp) of Canada (2017-04430). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Johnson CN, Balmford A, Brook BW, Buettel JC, Galetti M, Guangchun L, et al. Biodiversity losses and conservation responses in the Anthropocene. Science. 2017;356: 270–275. doi: 10.1126/science.aam9317 [DOI] [PubMed] [Google Scholar]
- 2.Fahrig L. Effect of habitat fragmentation on the extinction threshold: A synthesis. Ecol Appl. 2002;12: 346–353. 10.1890/1051-0761(2002)012[0346:EOHFOT]2.0.CO;2. [DOI] [Google Scholar]
- 3.Beever EA, Ray C, Wilkening JL, Brussard PF, Mote PW. Contemporary climate change alters the pace and drivers of extinction. Glob Chang Biol. 2011;17: 2054–2070. doi: 10.1111/j.1365-2486.2010.02389.x [DOI] [Google Scholar]
- 4.Tella JL, Hiraldo F. Illegal and legal parrot trade shows a long-term, cross-cultural preference for the most attractive species increasing their risk of extinction. PLoS One. 2014;9: e107546. doi: 10.1371/journal.pone.0107546 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hobson KA, Wassenaar LI, editors. Tracking animal migration with stable isotopes. Elsevier Academic Press; 2019.
- 6.Destro GFG, Pimentel TL, Sabaini RM, Borges RC, Barreto R. Efforts to combat wild animals trafficking in Brazil. In: Lameed GA, editor. Biodiversity Enrichment in a Diverse World. InTech; 2012. pp. 421–436. 10.5772/48351.
- 7.Bowen GJ, Cai Z, Fiorella RP, Putman AL. Isotopes in the water cycle: Regional- to global-scale patterns and applications. Annu Rev Earth Planet Sci. 2019;47: 453–479. doi: 10.1146/annurev-earth-053018-060220 [DOI] [Google Scholar]
- 8.Rozanski K, Araguás-Araguás L, Gonfiantini R. Isotopic patterns in modern global precipitation. Geophysical Monograph Series. American Geophysical Union (AGU); 1993. pp. 1–36.
- 9.Magozzi S, Zanden HBV, Wunder MB, Bowen GJ. Mechanistic model predicts tissue–environment relationships and trophic shifts in animal hydrogen and oxygen isotope ratios. Oecologia. 2019;191: 777–789. doi: 10.1007/s00442-019-04532-8 [DOI] [PubMed] [Google Scholar]
- 10.West AG, February EC, Bowen GJ. Spatial analysis of hydrogen and oxygen stable isotopes (“isoscapes”) in ground water and tap water across South Africa. J Geochemical Explor. 2014;145: 213–222. doi: 10.1016/j.gexplo.2014.06.009 [DOI] [Google Scholar]
- 11.Fiorella RP, Poulsen CJ, Pillco Zolá RS, Barnes JB, Tabor CR, Ehlers TA. Spatiotemporal variability of modern precipitation δ18O in the central Andes and implications for paleoclimate and paleoaltimetry estimates. J Geophys Res Atmos. 2015;120: 4630–4656. doi: 10.1002/2014JD022893 [DOI] [Google Scholar]
- 12.Yamanaka T, Makino Y, Wakiyama Y, Kishi K, Maruyama K, Kano M, et al. How reliable are modeled precipitation isoscapes over a high-relief mountainous region? Hydrol Res Lett. 2015;9: 118–124. doi: 10.3178/hrl.9.118 [DOI] [Google Scholar]
- 13.Zhang M, Wang S. A review of precipitation isotope studies in China: Basic pattern and hydrological process. J Geogr Sci. 2016;26: 921–938. 10.1007/s11442-016-1307-y. [DOI] [Google Scholar]
- 14.Gori Y, Stradiotti A, Camin F. Timber isoscapes. A case study in a mountain area in the Italian Alps. Heinze B, editor. PLoS One. 2018;13: e0192970. doi: 10.1371/journal.pone.0192970 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Sena-Souza JP, Houlton BZ, Martinelli LA, Nardoto GB. Reconstructing continental-scale variation in soil δ15N: a machine learning approach in South America. Ecosphere. 2020;11: 1–41. doi: 10.1002/ecs2.3223 [DOI] [Google Scholar]
- 16.Bataille CP, von Holstein ICC, Laffoon JE, Willmes M, Liu XM, Davies GR. A bioavailable strontium isoscape for Western Europe: A machine learning approach. PLoS One. 2018;13: 1–27. doi: 10.1371/journal.pone.0197386 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Hobson KA, Van Wilgenburg SL, Wassenaar LI, Powell RL, Still CJ, Craine JM. A multi-isotope (δ13C, δ15N, δ2H) feather isoscape to assign Afrotropical migrant birds to origins. Ecosphere. 2012;3: 1–20. doi: 10.1890/ES12-00018.1 [DOI] [Google Scholar]
- 18.Hobson KA, Van Wilgenburg SL, Wassenaar LI, Larson K. Linking hydrogen (δ2H) isotopes in feathers and precipitation: Sources of variance and consequences for assignment to isoscapes. PLoS One. 2012;7: e35137. doi: 10.1371/journal.pone.0035137 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hobson KA, Van Wilgenburg SL, Larson K, Wassenaar LI. A feather hydrogen isoscape for Mexico. J Geochemical Explor. 2009;102: 167–174. doi: 10.1016/j.gexplo.2009.02.007 [DOI] [Google Scholar]
- 20.Rogers KM, Wassenaar LI, Soto DX, Bartle JA. A feather-precipitation hydrogen isoscape model for New Zealand: Implications for eco-forensics. Ecosphere. 2012;3: 1–13. doi: 10.1890/ES11-00343.1 [DOI] [Google Scholar]
- 21.Hobson KA, Van Wilgenburg SL, Faaborg J, Toms JD, Rengifo C, Sosa AL, et al. Connecting breeding and wintering grounds of Neotropical migrant songbirds using stable hydrogen isotopes: A call for an isotopic atlas of migratory connectivity. J F Ornithol. 2014;85: 237–257. doi: 10.1111/jofo.12065 [DOI] [Google Scholar]
- 22.Flockhart TDT, Wassenaar LI, Martin TG, Hobson KA, Wunder MB, Norris DR. Tracking multi-generational colonization of the breeding grounds by monarch butterflies in eastern North America. Proc R Soc B Biol Sci. 2013;280. doi: 10.1098/rspb.2013.1087 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Hobson KA, Soto DX, Paulson DR, Wassenaar LI, Matthews JH. A dragonfly (δ2H) isoscape for North America: A new tool for determining natal origins of migratory aquatic emergent insects. Methods Ecol Evol. 2012;3: 766–772. doi: 10.1111/j.2041-210X.2012.00202.x [DOI] [Google Scholar]
- 24.Veen T, Hjernquist MB, Van Wilgenburg SL, Hobson KA, Folmer E, Font L, et al. Identifying the African wintering grounds of hybrid flycatchers using a multi-isotope (δ2H, δ13C, δ 15N) assignment approach. PLoS One. 2014;9. doi: 10.1371/journal.pone.0098075 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.De Ruyck C, Hobson KA, Koper N, Larson KW, Wassenaar LI. An appraisal of the use of hydrogen-isotope methods to delineate origins of migratory Saw-whet Owls in North America. Condor. 2013;115: 366–374. doi: 10.1525/cond.2013.120019 [DOI] [Google Scholar]
- 26.Garcia-Perez B, Hobson KA. A multi-isotope (δ2H, δ13C, δ15N) approach to establishing migratory connectivity of Barn Swallow (Hirundo rustica). Ecosphere. 2014;5: 1–12. doi: 10.1890/ES13-00116.1 [DOI] [Google Scholar]
- 27.Hobson KA, Wassenaar LI. Linking breeding and wintering grounds of neotropical migrant songbirds using stable hydrogen isotopic analysis of feathers. Oecologia. 1997;109: 142–148. doi: 10.1007/s004420050068 [DOI] [PubMed] [Google Scholar]
- 28.Hobson KA, Kardynal KJ. An isotope (δ34S) filter and geolocator results constrain a dual feather isoscape (δ2H, δ13C) to identify the wintering grounds of North American Barn Swallows. Auk. 2016;133: 86–98. doi: 10.1642/auk-15-149.1 [DOI] [Google Scholar]
- 29.Chesson LA, Barnette JE, Bowen GJ, Brooks JR, Casale JF, Cerling TE, et al. Applying the principles of isotope analysis in plant and animal ecology to forensic science in the Americas. Oecologia. 2018;187: 1077–1094. doi: 10.1007/s00442-018-4188-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Meier-Augenstein W. From stable isotope ecology to forensic isotope ecology—Isotopes’ tales. Forensic Sci Int. 2019;300: 89–98. doi: 10.1016/j.forsciint.2019.04.023 [DOI] [PubMed] [Google Scholar]
- 31.Cerling TE, Omondi P, Macharia AN. Diets of Kenyan elephants from stable isotopes and the origin of confiscated ivory in Kenya. Afr J Ecol. 2007;45: 614–623. doi: 10.1111/j.1365-2028.2007.00784.x [DOI] [Google Scholar]
- 32.Shibuya EK, Sarkis JES, Negrini-Neto O, Martinelli LA. Carbon and nitrogen stable isotopes as indicative of geographical origin of marijuana samples seized in the city of São Paulo (Brazil). Forensic Sci Int. 2007;167: 8–15. doi: 10.1016/j.forsciint.2006.06.002 [DOI] [PubMed] [Google Scholar]
- 33.Shibuya EK, Sarkis JES, Negrini-Neto O, Ometto JPHB. Multivariate classification based on chemical and stable isotopic profiles in sourcing the origin of marijuana samples seized in Brazil. J Braz Chem Soc. 2007;18: 205–214. doi: 10.1590/S0103-50532007000100024 [DOI] [Google Scholar]
- 34.Nardoto GB, Sena‐Souza JP, Chesson LA, Martinelli LA. Tracking geographical patterns of contemporary human diet in Brazil using stable isotopes of nail keratin. In: Parra RC, Zapico SC, Ubelaker DH, editors. Forensic Science and Humanitarian Action: Interacting with the Dead and the Living. John Wiley & Sons; 2020.
- 35.Sena-Souza J, Costa F, Nardoto G. Background and the use of isoscapes in the Brazilian context: Essential tool for isotope data interpretation and natural resource management. Rev Ambient e Agua. 2019;14: e2282. [Google Scholar]
- 36.Costa FV, Monteiro KRG. Guia de identificação de aves traficadas no Brasil. Costa FV, Monteiro KRG, editors. Florianópolis: BECONN—Produção de Conteúdo; 2016.
- 37.Piacentini V de Q, Aleixo A, Agne CE, Maurício GN, Pacheco JF, Bravo GA, et al. Annotated checklist of the birds of Brazil by the Brazilian Ornithological Records Committee. Rev Bras Ornitol. 2015;23: 91–298. [Google Scholar]
- 38.Kelly JF, Atudorei V, Sharp ZD, Finch DM. Insights into Wilson’s Warbler migration from analyses of hydrogen stable-isotope ratios. Oecologia. 2002;130: 216–221. doi: 10.1007/s004420100789 [DOI] [PubMed] [Google Scholar]
- 39.Johnson EI, Stouffer PC, Bierregaard RO Jr. The phenology of molting, breeding and their overlap in central Amazonian birds. J Avian Biol. 2012;43: 141–154. doi: 10.1111/j.1600-048X.2011.05574.x [DOI] [Google Scholar]
- 40.Marini MÂ, Durães R. Annual patterns of molt and reproductive activity of passerines in south-central Brazil. Condor. 2001;103: 767–775. doi: 10.1650/0010-5422(2001)103[0767:apomar]2.0.co;2 [DOI] [Google Scholar]
- 41.Silveira MB, Marini MÂ. Timing, duration, and intensity of molt in birds of a Neotropical Savanna in Brazil. Condor. 2012;114: 435–448. doi: 10.1525/cond.2012.110022 [DOI] [Google Scholar]
- 42.Wolfe JD, Terrill RS, Johnson EI, Powell LL, Ryder TB. Ecological and evolutionary significance of molt in lowland Neotropical landbirds. Ornithology. 2021;138: 1–13. doi: 10.1093/ornithology/ukaa073 [DOI] [Google Scholar]
- 43.Wassenaar LI, Hobson KA. Improved method for determining the stable-hydrogen isotopic composition (δD) of complex organic materials of environmental interest. Environ Sci Technol. 2000;34: 2354–2360. doi: 10.1021/es990804i [DOI] [Google Scholar]
- 44.Barton K. MuMIn: Multi-Model Inference. 2016. https://cran.r-project.org/web/packages/MuMIn/MuMIn.pdf.
- 45.R Core Development Team. A language and environment for statistical computing. Viena, Austria: R Foundation for Statistical Computing; 2022. www.R-project.org. [Google Scholar]
- 46.Inkscape Project. Inkscape. 2020. https://inkscape.org.
- 47.RStudio Team. RStudio: Integrated development for R. Boston: RStudio, PBC; 2020.
- 48.Hijmans RJ. Raster: Geographic data analysis and modeling. R Package. 2019. https://cran.r-project.org/package=raster.
- 49.Pebesma E. Simple features for R: Standardized support for spatial vector data. R J. 2018;10: 439–446. doi: 10.32614/RJ-2018-009 [DOI] [Google Scholar]
- 50.Hijmans RJ, Phillips S, Leathwick J, Elith J. dismo: Species distribution modeling. 2017. https://cran.r-project.org/package dismo.
- 51.Kuhn M. Building predictive models in R using the caret package. J Stat Softw. 2008;28: 1–26.27774042 [Google Scholar]
- 52.Kuhn M. caret: Classification and regression training. R package version 6.0–86. 2020. https://cran.r-project.org/package=caret.
- 53.Bivand R, Keitt T, Rowlingson B. rgdal: Bindings for the “Geospatial” data abstraction library. 2019. https://cran.r-project.org/package=rgdal.
- 54.Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002;2: 18–22. https://cran.r-project.org/doc/Rnews/. [Google Scholar]
- 55.Wickham H. ggplot2: Elegant graphics for data analysis. Springer-Verlag; New York; 2016. https://ggplot2.tidyverse.org. [Google Scholar]
- 56.Greenwell BM. pdp: An R package for constructing partial dependence plots. R J. 2017;9: 421–436. https://journal.r-project.org/archive/2017/RJ-2017-016/index.html. [Google Scholar]
- 57.Qgis Developmental Team. QGIS v.3.4.7. Geographic Information System. Open Source Geospatial Foundation Project. http://qgis.osgeo.org. 2019.
- 58.Fick SE, Hijmans RJ. WorldClim 2: new 1‐km spatial resolution climate surfaces for global land areas. Int J Climatol. 2017;37: 4302–4315. doi: 10.1002/joc.5086 [DOI] [Google Scholar]
- 59.Harris I, Osborn TJ, Jones P, Lister D. Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset. Sci data. 2020;7: 109. 10.6084/m9.figshare.11980500 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Harbert RS, Nixon KC. Climate reconstruction analysis using coexistence likelihood estimation (CRACLE): A method for the estimation of climate using vegetation. Am J Bot. 2015;102: 1277–1289. doi: 10.3732/ajb.1400500 [DOI] [PubMed] [Google Scholar]
- 61.Bowen GJ. Gridded maps of the isotopic composition of meteoric waters. 2020. https://wateriso.utah.edu/waterisotopes/pages/data_access/ArcGrids.html.
- 62.Bowen GJ, Revenaugh J. Interpolating the isotopic composition of modern meteoric precipitation. Water Resour Res. 2003;39. doi: 10.1029/2003WR002086 [DOI] [Google Scholar]
- 63.Wassenaar LI, Hobson KA. Stable-carbon and hydrogen isotope ratios reveal breeding origins of red-winged blackbirds. Ecol Appl. 2000;10: 911–916. doi: 10.1890/1051-0761(2000)010[0911:SCAHIR]2.0.CO;2 [DOI] [Google Scholar]
- 64.Gregorutti B, Michel B, Saint-Pierre P. Correlation and variable importance in random forests. Stat Comput. 2017;27: 659–678. doi: 10.1007/s11222-016-9646-1 [DOI] [Google Scholar]
- 65.Kuhn M, Johnson K. Applied predictive modeling. Springer; 2013.
- 66.Gastmans D, Santos V, Galhardi JA, Gromboni JF, Batista LV, Miotlinski K, et al. Controls over spatial and seasonal variations on isotopic composition of the precipitation along the central and eastern portion of Brazil. Isotopes Environ Health Stud. 2017;53: 518–538. doi: 10.1080/10256016.2017.1305376 [DOI] [PubMed] [Google Scholar]
- 67.Cota S, Peixoto C, Barreto A, Gastmans D, Santos V, Terzer S, et al. GNIP stations in Brazil: Importance, past and current developments. III International Congress on Subsurface Environment. 2013. pp. 1–4. https://aguassubterraneas.abas.org/asubterraneas/article/view/27504.
- 68.Franzini AS, Kirchheim RE, Nogueira GS, Niemayer AF, Oliveira FR, Gastmans D, et al. The new GNIP Network in Brazil: An example of sound institutional arrangements. Int Symp Isot Hydrol Adv Underst Water Cycle Process. 2019; 11–14. https://www.researchgate.net/publication/333532988_THE_NEW_GNIP_NETWORK_IN_BRAZIL_AN_EXAMPLE_OF_SOUND_INSTITUTIONAL_ARRANGEMENTS.
- 69.James PE. Air masses and fronts in South America. Geogr Rev. 1939;29: 132–134. doi: 10.2307/210071 [DOI] [Google Scholar]
- 70.Terzer-Wassmuth S, Wassenaar LI, Welker JM, Araguás-Araguás LJ, Terzer‐Wassmuth S, Wassenaar LI, et al. Improved high‐resolution global and regionalized isoscapes of δ18O, δ2H and d‐excess in precipitation. Hydrol Process. 2021;35: 285–287. doi: 10.1002/hyp.14254 [DOI] [Google Scholar]
- 71.Brlík V, Pipek P, Brandis K, Chernetsov N, Costa FJV, Herrera M. LG., et al. The reuse of avian samples: opportunities, pitfalls, and a solution. Ibis. 2021. doi: 10.1111/ibi.12997 [DOI] [Google Scholar]
- 72.Soto DX, Koehler G, Wassenaar LI, Hobson KA. Re-evaluation of the hydrogen stable isotopic composition of keratin calibration standards for wildlife and forensic science applications. Rapid Commun Mass Spectrom. 2017;31: 1193–1203. doi: 10.1002/rcm.7893 [DOI] [PubMed] [Google Scholar]
- 73.Alves RRN, Lima JRF, Araujo HFP. The live bird trade in Brazil and its conservation implications: an overview. Bird Conserv Int. 2013;23: 53–65. doi: 10.1017/S095927091200010X [DOI] [Google Scholar]
- 74.Bowen GJ, Wassenaar LI, Hobson KA. Global application of stable hydrogen and oxygen isotopes to wildlife forensics. Oecologia. 2005;143: 337–348. doi: 10.1007/s00442-004-1813-y [DOI] [PubMed] [Google Scholar]
- 75.Ploton P, Mortier F, Réjou-Méchain M, Barbier N, Picard N, Rossi V, et al. Spatial validation reveals poor predictive performance of large-scale ecological mapping models. Nat Commun. 2020;11: 1–11. doi: 10.1038/s41467-019-13993-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Magozzi S, Contina A, Wunder M, Zanden H Vander, Bowen G. A global compilation of known-origin keratin hydrogen and oxygen isotope data for wildlife and forensic research. 2020; 9674. 10.5194/egusphere-egu2020-9674. [DOI]
- 77.Gutiérrez-Expósito C, Ramírez F, Afán I, Forero MG, Hobson KA. Toward a deuterium feather isoscape for sub-saharan Africa: Progress, challenges and the path ahead. PLoS One. 2015;10: e0135938. doi: 10.1371/journal.pone.0135938 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Pekarsky S, Angert A, Haese B, Werner M, Hobson KA, Nathan R. Enriching the isotopic toolbox for migratory connectivity analysis: A new approach for migratory species breeding in remote or unexplored areas. Divers Distrib. 2015;21: 416–427. doi: 10.1111/ddi.12306 [DOI] [Google Scholar]
- 79.Ma C, Vander Zanden HB, Wunder MB, Bowen GJ. assignR: An r package for isotope-based geographic assignment. Methods Ecol Evol. 2020;11: 996–1001. doi: 10.1111/2041-210X.13426 [DOI] [Google Scholar]
- 80.Koehler G, Kardynal KJ, Hobson KA. Geographical assignment of polar bears using multi-element isoscapes. Sci Rep. 2019;9: 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Hobson KA, Møller AP, Van Wilgenburg SL. A multi-isotope (δ13C, δ15N, δ2H) approach to connecting European breeding and African wintering populations of barn swallow (Hirundo rustica). Anim Migr. 2012;1: 8–22. doi: 10.2478/ami-2012-0002 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Identification number (ID), year of collection, species name under CBRO (Brazilian Ornithological Records Committee [37]), family, subfamily, feather type (body, wing or tail), latitude and longitude, Brazilian state and biome where feathers were collected, corresponding university’s scientific collection (museum*) where feathers were deposited, and feather’s hydrogen isotopic (‰ VSMOW) raw values.
(PDF)
R code and results for Analysis of Variance followed by a Tukey’s HSD (variable year of collection, feather type, biome, and sub-family), Linear Models (variable latitude and longitude), and Model Selection with all variables.
(PDF)
R code to calculate WorldClim bioclimatic variables for a specific timeframe.
(PDF)
R code and results. Including S4 Fig (Linear regressions between δ2Hf and δ2Hp) and S4 Table (Precipitation isotopic values extracted from waterisotopes.org).
(PDF)
First step- Prepare data. R code to input raster and observations, and extract values. Including S5 Table in S4 File with extracted values; Second step- Selection of variables using Recursive Feature Elimination–RFE. R code to run RFE and plot results; Third step- Prepare data and run Random Forest (RF). R code to run RF and explore results with partial plots; Fourth step- Model validation. R code to validate the model and plot results; Fifth step- Spatial prediction. R code to apply modeled values into an isoscape, calculate method uncertainty, mean, standard deviation and coefficient of variation.
(PDF)
Data Availability Statement
All relevant data are within the manuscript and its Supporting information files.



