Skip to main content
Scientific Data logoLink to Scientific Data
. 2016 Aug 1;3:160049. doi: 10.1038/sdata.2016.49

Bat trait, genetic and pathogen data from large-scale investigations of African fruit bats, Eidolon helvum

Alison J Peel 1,2,3,a, Kate S Baker 1,2,4,5, David T S Hayman 1,2,6, Richard Suu-Ire 7,8, Andrew C Breed 9, Guy-Crispin Gembu 10, Tiziana Lembo 11, Andrés Fernández-Loras 2,12, David R Sargan 1, Anthony R Fooks 9, Andrew A Cunningham 2, James L N Wood 1
PMCID: PMC4968192  PMID: 27479120

Abstract

Bats, including African straw-coloured fruit bats (Eidolon helvum), have been highlighted as reservoirs of many recently emerged zoonotic viruses. This common, widespread and ecologically important species was the focus of longitudinal and continent-wide studies of the epidemiological and ecology of Lagos bat virus, henipaviruses and Achimota viruses. Here we present a spatial, morphological, demographic, genetic and serological dataset encompassing 2827 bats from nine countries over an 8-year period. Genetic data comprises cytochrome b mitochondrial sequences (n=608) and microsatellite genotypes from 18 loci (n=544). Tooth-cementum analyses (n=316) allowed derivation of rare age-specific serologic data for a lyssavirus, a henipavirus and two rubulaviruses. This dataset contributes a substantial volume of data on the ecology of E. helvum and its viruses and will be valuable for a wide range of studies, including viral transmission dynamic modelling in age-structured populations, investigation of seasonal reproductive asynchrony in wide-ranging species, ecological niche modelling, inference of island colonisation history, exploration of relationships between island and body size, and various spatial analyses of demographic, morphometric or serological data.

Subject terms: Population dynamics, Ecological epidemiology, Genetic variation

Background & Summary

The straw-coloured fruit bat (Eidolon helvum) is a common, widely distributed, migratory species, occurring across sub-Saharan Africa and some offshore islands (Fig. 1)1,2. Since 2007, investigations into the epidemiology and ecology of zoonotic viral infections in E. helvum have been undertaken via longitudinal sampling of wild populations in Ghana. Complementing this, between 2008–2011 and in 2014, cross-sectional sampling events were undertaken to determine the genetic population structure of E. helvum, and to assess whether the serological findings in Ghana were representative across the species’ range (Fig. 1).

Figure 1. Map showing location of E. helvum sampling locations for genetic and serological analyses.

Figure 1

Shading represents the distribution range of E. helvum. Sampling locations are numbered as in Table 1 (available online only). Adapted with permission from Mickleburgh et al.43 and Peel et al.12.

Four viruses were the focus of our serological surveys in E. helvum bats: Lagos bat virus (LBV), African henipaviruses, Achimota virus 1 (AchPV1) and Achimota virus 2 (AchPV2). Lagos bat virus is one of at least 15 known species in the Lyssavirus genus3 and has been isolated from E. helvum on multiple occasions4,5. An African henipavirus is still yet to be isolated, however a full genome sequence has been obtained (putative name: African bat henipavirus Eid_hel/GH-M74a/GHA/2009 (M74))6. Achimota viruses 1 and 2 are closely related rubulaviruses for which serological evidence suggestive of spillover to humans in Africa exists7.

The specific aims of the data collection were to:

  • Investigate whether antibodies to LBV, henipaviruses and Achimota viruses are present in E. helvum across its continental and island range, and to explore the antibody dynamics where possible.

  • Describe the genetic metapopulation structure of E. helvum using a combination of mitochondrial (mtDNA) and microsatellite markers.

  • Gather information on E. helvum distribution and seasonal patterns of reproduction.

  • Combine results from these multidisciplinary studies to make inferences about virus transmission dynamics, and ultimately make inferences on the spillover risk to human populations.

Samples in this dataset are from 2827 bats from nine countries over an 8-year period (Fig. 2, Table 1 (available online only)). Raw data comprises spatial (roost location), seasonal (timing of sampling and seasonal birth pulses), morphological (forearm length, body weight), demographic (age, sex, reproductive status, mother-offspring relationships) and identification (individually numbered thumb-band) components. Data generated includes genetic characterisation (mtDNA sequencing and microsatellite genotyping) and serological assay results (for LBV, henipaviruses and Achimota viruses).

Figure 2. Sampling intensity per month, by country.

Figure 2

Red represents high sampling intensity and the numbers of samples collected per month is recorded within each grid cell. Records in the database with unknown collection date are not represented here (nine from Annobón, seven from Bioko, 14 from Príncipe, 10 from Rio Muni and 13 from São Tomé).

Table 1. Overview of Eidolon helvum sampling locations and sample types used in this study.

  Ghana Democratic Republic of the Congo Kenya Zambia Malawi Tanzania Uganda Rio Muni Bioko Príncipe São Tomé Annobón TOTAL
Map references refer to numbers in Fig. 1.
                         
Persons involved in sample collection: DTSH (David T.S. Hayman), AJP (Alison J. Peel), KSB (Kate S. Baker), GCG (Guy Crispin Gembu), AFL (Andrés Fernández Loras), RCCDS (Ricardo Castro Cesar De Sa), IRP (Iñaki Rodriguez Prieto), ACB (Andrew C Breed), JJ (Javier Juste), IVK (Ivan Kuzmin). Table modified from Peel et al. Continent-wide panmixia of an African fruit bat facilitates transmission of potentially zoonotic viruses. Nature Communications, 2013 4, p3770
                         
Map ref 1, 2, 3 4 5, 6 7 8 9, 10 11, 12 13 14 15 16 17  
Sampling locations 5.588 N, 0.182 W 0.528 N, 25.371 E 0.007 S, 34.605 E 12.589 S, 30.246 E 15.788 S, 35.011 E 6.801 S, 39.283 E 0.303 N, 32.554 E * 3.747 N, 8.770 E 1.680 N, 7.390 E 0.239 N, 6.482 E 1.459 S, 5.645 E  
6.700 N, 1.625 W   0.043 S, 34.410 E     6.800 S, 39.285 E 0.427 N, 33.201 E   3.752 N, 8.772 E 1.590 N, 7.337 E 0.341 N, 6.563 E 1.418 S, 5.621 E  
7.647 N, 1.882 W   3.216 N, 40.119 E     6.823 S, 37.666 E 0.416 N, 33.204 E   * * 0.286 N, 6.678 E 1.460 S, 5.630 E  
                    0.048 N, 6.505 E *  
                    0.022 N, 6.517 E    
                    0.043 N, 6.544 E    
                    0.029 N, 6.532 E    
                    0.252 N, 6.671 E    
                    *    
Sampling Years 2007, 2008, 2009 2008 2007 2008 2008 2009, 2010 2009 1992 1992, 2010 1992, 2010 1992, 2010 1992, 2010, 2011, 2014  
Collected by DTSH, KSB, AJP, ACB, AFL GCG IVK5 AJP AJP AJP AJP JJ44 AJP, AFL, JJ44 AJP, AFL, JJ44 AJP, AFL, JJ44 AJP, AFL, RCCDS, IRP, ACB, JJ44  
Number of bats 1735 34 93 123 24 279 7 10 112 76 122 211 2826
Number of forearm measurements 1454 34 92 42 22 234 1 0 105 60 101 202 2347
Number of body weight measurements 1540 34 92 30 16 248 1 0 105 25 103 202 2396
Number of tooth ages 87 0 0 0 0 93 0 0 24 34 76 2 316
Number of Lagos bat virus results 792 0 0 10 12 230 5 0 105 57 96 121 1428
Number of henipavirus results 1637 0 0 12 16 248 7 0 105 62 98 123 2308
Number of Achimota virus 1 results 157 0 0 0 0 25 0 0 0 21 31 10 244
Number of Achimota virus 2 results 157 0 0 0 0 25 0 0 0 21 31 10 244
Number of cytochrome b sequences 74 21 20 23 24 50 7 10 108 70 118 83 608
Number of microsatellite genotypes 20 21 20 20 24 49 7 9 111 75 115 83 554

*Exact sampling location for some samples not available.

Multiple publications have arisen from these data, however many aspects remain unexplored. Demographic analyses have estimated birth and survival rates8,9, and explored the effect of hunting on the latter9. Variations in roost composition have suggested a fission-fusion social structure9. Serological analyses have identified: the presence of antibodies against LBV, henipaviruses and Achimota viruses in E. helvum in Ghana7,10,11 and more broadly across the species’ range, including isolated off-shore islands7,12,13; that these viruses circulate endemically in E. helvum in Ghana, with evidence of horizontal transmission7,14,15; and that E. helvum bats previously infected with LBV can have long-survival post infection16. The henipavirus dataset was used to develop a Bayesian method to determine appropriate cutoffs for serological assays17. Population genetic analyses identified that E. helvum are panmictic across their continental range, but that genetically isolated populations exist on isolated islands12.

Other publications arising from these samples, but based on analyses not included here, include the development of a universal real-time assay and a pseudotype neutralisation assay for Lyssaviruses18,19, microsatellite loci characterisation20, estimation of divergence times between Eidolon sister species21, inference of movement ecology based on stable isotope ratios22, demonstration of Ebola antibodies16,23, identification of multiple novel viruses24, and novel Bartonella species in bat flies collected from E. helvum25,26.

This dataset contributes a substantial volume of data on the ecology of E. helvum and its viruses and will be valuable for a wide range of studies. In particular, an age-specific dataset such as this is rare and valuable for wildlife, especially bats. Further analyses could include viral transmission dynamic modelling in age-structured populations, including the use of cutting-edge Bayesian approaches to address complex epidemiological questions27; time-series analyses on 5 years of wild henipavirus serological data from the same study site in Ghana (n=1486 data points), investigation of seasonal reproductive asynchrony in wide-ranging species; ecological niche modelling; inference of island colonisation history, exploration of relationships between island and body size; and various spatial analyses of demographic, morphometric or serological data. Field samples (e.g. serum, blood cells, urine, skin samples) and extracted DNA from individual bats in this dataset exist in storage and the authors are open to collaborative requests to undertake further analyses.

Methods

These methods are expanded and modified versions of descriptions in our previous publications, as cited in each section below. All associated data can be found in ‘Eidolon helvum data 2007–2014.csv’ [Data Citation 1].

Capture and Data Collection

Capture and sampling information has been described previously e.g. (refs 10,12). Sampling locations comprised 13 E. helvum roosting sites in continental Africa, and 14 in the four main islands in the Gulf of Guinea (Fig. 1, further detail in Table 1 (available online only)). In the majority of locations, data are from a single sampling event (sometimes comprising multiple sampling sessions within a one month period). Repeated sampling was conducted in Ghana (multiple sampling events per year over four years), Tanzania (one sampling event per year over two years) and Annobón (three sampling events over 4 years) as these locations were the focus of specific research studies. All fieldwork was undertaken under permits granted by national and local authorities (listed in Acknowledgements) and under ethics approval from the Zoological Society of London Ethics Committee (WLE/0489 and WLE/0467), using field protocols which followed ASM guidelines28. Bats were captured at the roost with mist nets (6–18 m; 38 mm) as they departed the roost site at dusk, or returned at dawn. Except for a proportion of bats that were euthanased for virological studies (n=238), bats were released following sampling. Additional samples and data were obtained from other research groups (n=152) and in collaboration with local hunters in São Tomé (n=102), where bats are hunted for human consumption.

Personal protective equipment (long clothing, face masks, eye protection and gloves) was worn during sample collection. Morphometric and demographic details were recorded from bats under manual restraint. Female reproductive status was assigned as non-reproductive, pregnant, or lactating, according to the descriptions provided in Table 2. The phase in the reproductive cycle (i.e. the time in months between the sampling date and the beginning of the last birthing season) was estimated based on published data and the pregnancy status of females (foetal size, assuming a true gestation period of 4 months (Mutere 1965)) or degree of juvenile development during sampling.

Table 2. Reproductive status classifications for female E. helvum.

Phase Reproductive status Abbreviation Description
Phase represents the estimated time in months since the beginning of the last birthing season.      
1–2 Lactating L Neonate attached and suckling, or milk able to be expressed from mammary glands of females
3–8 Non-reproductive NR Not detectably pregnant on abdominal palpation
9 Very early pregnant VEP Uterine bulge palpable on abdominal palpation, up to 1cm diameter (detectable from~0.5 cm diameter with careful palpation)
10 Early pregnant EP Uterine bulge palpable on abdominal palpation, 1–2 cm diameter
11 Mid pregnant MP Uterine bulge and foetus palpable on abdominal palpation, but no obvious abdominal distension of the female (2-~3.5 cm)
12 Late pregnant LP Foetus palpable on abdominal palpation, with distension of the female's abdomen so that it is wider than the base of the ribs

Age was assessed by morphological characteristics (Table 3) and all individuals were placed into one of four age classes: Neonate (N; <2mths), Juvenile (J; 2–<6 months), Sexually Immature (SI; 6–<24 months) or Adult (A; ≥24 months). For a subset of samples, the timing of sampling in relation to the birthing season permitted further classification of SI individuals into 6-month age groups SI.1, SI.2 and SI.3 (6 –<12, 12–<18, 18–<24 months, respectively). Additionally, for bats that were hunted or euthanased following capture, upper canine teeth were extracted, air dried and shipped to the USA (Matson’s laboratory, USA) for histological examination to assess the number of tooth cementum annuli present29,30. Following previous studies31, it was assumed that each observed cementum layer represented one year. Each age estimation was scored with a certainty code: A: highest certainty of reported age (51% of samples, e.g. Fig. 3a,b), B: histological evidence supported a given age result±0.5–1.5 years (46% of samples, e.g. Fig. 3c), or C: tooth or section quality was too compromised to accurately age (3% of samples).

Table 3. Age classification system used for E. helvum.

Age class
Abbr. Age (mths) Timing of presence Classification features  
Standard age classes Extended age classes        
Neonate Neonate   <2 Up to 2 months after birth pulse Suckling
Juvenile Juvenile J 2–<6 2–6 months after birth pulse Small body size
Sexually Immature SI 6 – <24   Adult size (or nearly adult size), but undeveloped nipples and genitalia compared with adults  
  Sexually Immature 1 SI.1* 6 – <12 6–12 months after birth pulse Early in this period, distinguishable from SI.3 by smaller body size, and from Adults by lack of sexual development. In the last 3 months, females expected to be pregnant, but with undeveloped nipples.
  Sexually Immature 2 SI.2 12 –<18 Up to 6 months after birth pulse Clearly distinguishable from juveniles by size and adults by lack of sexual development
  Sexually Immature 3 SI.3* 18 – <24 6–12 months after birth pulse Early in this period, distinguishable from SI.1 by larger body size, and from Adults by lack of sexual development. In the last 3 months this is less clear, however females expected to be pregnant with their first offspring (primiparous), but with undeveloped nipples.
Adult Adult A ≥ 24 All year Full body size within ranges expected for an adult. Females have developed nipples, males with developed testicles.

*SI.1 and SI.3 stages represent two different birth cohorts, overlapping in time. Where distinction between SI.1 and SI.3 was unclear during sampling and analysis of measurements, individuals were simply classified as ‘SI’ and omitted from extended age class analyses.

SI individuals become sexually active during the last 4 months of their second year, however the highly synchronous and seasonal nature of the birthing period means that it is simpler for individuals to be recognised as sexually mature only after the beginning of the birthing pulse. While pregnant SI.3 (primiparous) individuals are therefore not technically sexually ‘immature’, by classifying them as such, they can be differentiated from adult females (>2 years of age).

Figure 3. Histological sections of upper canine teeth from E. helvum for cementum age analysis (Giemsa stain).

Figure 3

Photographs and captions courtesy of Gary Matson, Matson’s Laboratory, MT, USA. Each age estimation was scored with a certainty code: A: highest certainty of reported age, B: histological evidence supported a given age result±0.5–1.5 years, or C: tooth or section quality was too compromised to accurately age. (a) Bat ID 424. Cementum age 2, certainty code A. 100X. The tooth was in excellent histological condition, as indicated by the presence of periodontal membrane and good differential staining between annuli and light cementum. (b) Bat ID 62. Cementum age 6, certainty code A. 100X. Annuli are complex, with at least two components each year. A key feature of age analysis is resolving uncertainty about whether complex annuli or individual components are being used as age indicators. (c) Bat ID 44. Cementum age 13, certainty code B (13–15 yrs). 400X. The root tip of this tooth had been broken off during extraction. Missing cementum complicates age analysis, reducing the evidence available for evaluating whether annuli observed at one point may be clearly identifiable as components of complex annuli at another point.

Genetic and blood samples were collected under manual restraint. Wing membrane biopsies (4-mm) were placed into 70% alcohol. Up to 1 ml blood was collected from the propatagial vein using a citrated 1 ml syringe and placed into a plain 1.5 ml eppendorf tube.

Molecular methods

Molecular methods have been described previously12,20. Genomic DNA was extracted from E. helvum tissues (predominantly wing membrane biopsies, but also liver and muscle samples, all stored in ethanol) using DNeasy Blood and Tissue Kits (QIAGEN Ltd., Crawley, West Sussex, UK). DNA was quantified using Quant-iT PicoGreen dsDNA kits (Molecular probes, UK), and later using a Nanodrop ND-1000 Spectrophotometer (Thermo Fisher Scientific, UK) and diluted to a standard concentration.

Twenty E. helvum loci developed in a previous study20 were quality-checked using a subset of samples. Loci E and Ae were discarded due to difficulty in scoring or high error rates and data from locus Ag were re-binned and re-scored, correcting earlier issues with allelic dropout. In total, 170 continental and 385 island samples were run as multiplex PCRs at 18 loci (TSY, FWB, MNQX, AgPK, AcAfAi, AdAh) in 10 μl PCRs, containing 4ng template DNA, 0.2 μM of each primer, and 5 μl Type-it Multiplex PCR Master Mix (QIAGEN Ltd.). Positive and negative controls were included on each plate and amplification was performed using the following conditions: 5 min at 94 °C; 30 cycles of 30 s at 95 °C, 90 s at 57 °C, and 30 s at 72 °C; then 30 s at 60 °C. Genotyping was performed by capillary electrophoresis using a Beckman CEQ 8000 (Beckman, UK). Allele sizes were scored automatically prior to manual verification. Genotyping data from 18 loci are provided in ‘Eidolon helvum data 2007–2014.csv’ [Data Citation 1]. Loci B has previously been identified as being X-linked20.

Fragments of the mitochondrial DNA cytochrome b (cytb) gene were amplified from continental samples by PCR using the generic primers L14722 (5′- CGA AGC TTG ATA TGA AAA ACC ATC GTT G)32 and H15149 (5′- AAA CTG CAG CCC CTC AGA ATG ATA TTT GTC CTC A)33 in 20 μl reactions, containing 0.1–1 ng template DNA, 0.2 μM of each primer, 0.25 mM of each dNTP, 1.5 mM MgCl2, 0.25 μl of Taq polymerase (Invitrogen), and 0.2 μl 10X reaction buffer and with the following conditions: 5 min at 94 °C; 40 cycles of 1 min at 93 °C, 1 min at 54 °C, and 2 min at 72 °C; then 7 min at 72 °C. Although these generic primers were adequate with continental samples (8% PCR failure), amplification from isolated Gulf of Guinea island samples was less successful (48% PCR failure). Shortened primers (EhM2814 (5′- GCT TGA TAT GAA AAA CCA TCG TTG) and EhM2815 (5′- CAG CCC CTC AGA ATG ATA TTT GT) resulted in successful amplification when using Microzone MegaMix-Gold reagent (Microzone Ltd, UK). PCRs were performed in 20 μl reactions, containing 2 ng template DNA, 0.25 μM of each primer, and 10 μl MegaMix-Gold, using the following conditions: 5 min at 95 °C; 33 cycles of 30 sec at 95 °C, 30 sec at 53 °C, and 45 sec at 72 °C. PCR products were checked by gel electrophoresis on 1% agarose gels, purified using Exosap-IT clean-up (USB Europe, Germany) and sequenced in both directions on an ABI 3730xl DNA Analyser, (Applied Biosystems). Paired sequences were edited and aligned using the STADEN Package v1.6 (ref. 34). Multiple sequence alignment was performed using default settings in T-COFFEE35. Sequences were checked manually and trimmed to a standard length (397 bp) in JALVIEW v2 (ref. 36). No sequence differences were detected in 38 samples sequenced using both primer pairs, so data were combined.

Data from 608 and 544 individuals is available for cytb and microsatellite analyses (at 18 loci), respectively (Table 1 (available online only)).

Serological analyses

Serological methods have been described previously7,10–14.

A modified fluorescent antibody virus neutralization (mFAVN) assay using the LBVNig56 isolate was used to detect neutralising antibodies against LBV10,37. Samples were tested in duplicate using threefold serial dilutions (representing reciprocal titres of 9, 27, 81, and 243–19,683). Human rabies immunoglobulin, LBV-positive rabbit serum, and rabies-vaccinated mouse serum were used as positive controls and negative rabbit and mouse serum were negative controls. Titres were considered positive at IC100 endpoint reciprocal dilutions >1: 9 (100% neutralisation of virus).

Henipavirus antibodies detected in African fruit bat samples using virus neutralisation assays, multiplexed microsphere assays and pseudotype assays developed to target other known henipaviruses (Hendra and Nipah viruses) and are presumed to represent cross-neutralisation or cross-reactivity12. Here, Luminex multiplexed microsphere binding assays were used to detect antibodies against henipaviruses (HeV and NiV). In these assays, purified recombinant expressed henipavirus soluble G glycoproteins38 are conjugated to internally coloured and distinguishable microspheres, allowing multiplexing. For African bat samples, stronger results were consistently observed in NiV binding assays and virus neutralisation tests13, so only NiV binding assay results are included in the dataset. Binding results are outputted as median fluorescence intensity (MFI) values of at least 100 microspheres for each virus type. In mid-December 2010, major repair work was undertaken on the Luminex machine being used for serological analysis. A subset of samples that had been analysed before the repairs were repeated to calibrate results (n=293). MFI values pre- and post-repair work were significantly different, making the use of a single cutoff inappropriate17. Two approaches were taken to designate results as seropositive or seronegative. First, a Bayesian mixture model was applied as described in17. Cutoffs for pre- and post-repair work were determined so that samples above this cutoff were ≥ 99% likely to be in the seropositive distribution (MFI=156.1 and 127.5, respectively). Second, linear regression of pre- and post-maintenance ln(MFI) values demonstrated a significant linear relationship (Fig. 4, R20.81, F-statistic: 1306 (1, 296), P<2.2e-16), and the variance decreases for higher MFI values (above the cutoff). Pre-maintenance MFI values were converted to post-maintenance values using the formula:

Figure 4. Correlation between ln(MFI) values pre- and post-repair of the Luminex machine used to run the assays, based on 293 samples.

Figure 4

The linear regression line is in red. (R20.81, F-statistic: 1306 (1, 296), P<2.2e–16).

NEW_MFI=exp (0.7795774*ln(OLD_MFI)+0.4392832).

The Bayesian mixture model was applied to this transformed and combined data using the same method. From this analysis, MFI values >94.2 were ≥ 99% likely to be in the seropositive distribution. Results from the two methods were compared and the second method resulted in the highest congruence between pre- and post-maintenance paired results (congruence in 266/298 samples versus 250/298 samples for the first method), and these data were therefore used in the final dataset. Raw MFI values are available on request.

Antibodies against Achimota viruses 1 and 2 were detected using virus neutralisation assays7, with all testing in duplicate. Samples were diluted to 1:20 and incubated with 200 TCID50 of virus for 30 min at 37 °C prior to the addition of Vero cell suspension at an MOI equivalent to 0.01. Cell monolayers were assessed for evidence of virus neutralization 7 days post infection. Where sample volume permitted, positive samples were titrated in a 2-fold dilution series from 1: 20 to 1: 160 and retested using the same protocol.

Data Records

The data are contained in a single comma-separated file (.csv format), entitled ‘Eidolon helvum data 2007–2014’ (Data Citation 1). Each row below the header represents an individual bat (n=2,827), and the columns (n=68) contain sample identifier information, demographic and morphometric data, and results of genetic and serological assays. Full descriptions of the column titles are included in the Table 4 (available online only).

Table 4. Descriptor codes for data file.

Column header name Description
Sample Unique sample identifier, generally consisting of two-letter country identifier, and sample number
SamplingEventID Sampling event identifier, consisting of two-letter country identifier, three-letter location identifier (see LocationID) and the date of sampling in the format yymmdd
Samplers Initials of researchers who undertook sample collection. ACB (Andrew C Breed), AFL (Andrés Fernández-Loras), AJP (Alison J Peel), DTSH (David T S Hayman), GCG (Guy-Crispin Gembu), IRP (Iñaki Rodriguez-Prieto), IVK (Ivan Kuzmin), KSB (Kate S Baker), TL (Tiziana Lembo), JJ (Javier Juste), DJL (DJ Long).
Samplingdate Sampling date yyyy-mm-dd
Birthdate Estimated date for start of the seasonal birth pulse for that location.
Mths.since.birthing Months (to one decimal place) from the date of sampling to the beginning of the previous birth pulse
Cont_island Whether sampling location is on the African continent or on an island
Country Country of sampling location.To facilitate analysis, islands are recorded under their island name. The islands São Tomé and Príncipe are part of the country ‘São Tomé and Príncipe’ and Bioko, Rio Muni and Annobón are part of the country ‘Equatorial Guinea’
Region Sampling region/city
Location Specific sampling location
Latitude Latitude in decimal notation
Longitude Longitude in decimal notation
Bat.wt Bat weight in grams, measured to the nearest gram
Sex Bat sex. F (Female), M (Male), NA (not recorded)
Age Age classification, assessed by morphological characteristics. N (Neonate; <2mths), J (Juvenile; 2 – <6 months), SI (Sexually Immature; 6 – <24 months) or A (Adult; ≥24 months).
Age.3 Age classification, assessed by morphological characteristics and timing of sampling relative to the birth pulse. For a subset of samples, the timing of sampling in relation to the birthing season permitted further classification of SI individuals into 6-month age groups SI.1, SI.2 and SI.3 (6 – <12, 12 – <18, 18 – <24 months, respectively).
Teeth.Age Age in years from histological examination of tooth cementum annuli (0–15)
TeethCC Teeth certainty score. Each age estimation was scored with a certainty code: A: highest certainty of reported age, B: histological evidence supported a given age result, or C: tooth or section quality was too compromised to accurately age.
Teeth.Age.Range The possible age range based on teeth age and teeth certainty score.
Teeth.Age.mths Teeth Age converted into months using time since estimated beginning of previous birth pulse (0–188)
Age.mths Age in months, from ‘known age from teeth’ plus from individuals <2 years for which an age in months could be calculated from Age.3 classifications (0–188)
Repro.status Reproductive status of adult female bats, and primiparous SI bats. NR (non-reproductive), P (pregnant), VEP (pregnant), EP (pregnant), MP (pregnant), LP (pregnant), L (lactating) or U (unknown) according to the descriptions provided in Table 2. Males and neonate, juvenile and other sexually immature females are recorded as NA.
Mother.ID Sample ID of the mother if sample was a neonate or foetus (if known)
Offspring.ID Sample ID of the neonate or foetus if sample was a pregnant or lactating bat (if known)
Forearm Forearm length in millimetres, measured to one decimal place
Band.no. ID number on metal thumb bands attached to some bats
LBV.mFAVN Result from Lagos Bat Virus mFAVN serological assay. 0 (seronegative), 1 (seropositive), NA (no result)
Henipavirus Result from henipavirus Luminex binding assay, using a cutoff so that samples above the cutoff are 99% likely to be seropositive. Data is from the Nipah virus assay, but is presumed to represent cross-reactivity to African henipaviruses rather than the presence of Nipah virus itself. 0 (seronegative), 1 (seropositive), NA (no result)
AchPV1 Result from Achimota virus 1 virus neutralisation assay. 0 (seronegative), 1 (seropositive), NA (no result)
AchPV2 Result from Achimota virus 1 virus neutralisation assay. 0 (seronegative), 1 (seropositive), NA (no result)
GeneticsID Alternative sample ID used in genetic analyses
Cytb Cytochrome b sequence
GenBankAccession Accession number for cytochrome b sequences deposited in genbank (PopSets 426207275 and 723591518)
T.1/T.2 Microsatellite allele sizes at loci T
S.1/S.2 Microsatellite allele sizes at loci S
F.1/F.2 Microsatellite allele sizes at loci F
W.1/W.2 Microsatellite allele sizes at loci W
N.1/N.2 Microsatellite allele sizes at loci N
Q.1/Q.2 Microsatellite allele sizes at loci Q
X.1/X.2 Microsatellite allele sizes at loci X
P.1/P.2 Microsatellite allele sizes at loci P
K.1/K.2 Microsatellite allele sizes at loci K
Ac.1/Ac.2 Microsatellite allele sizes at loci Ac
Af.1/Af.2 Microsatellite allele sizes at loci Af
Ai.1/Ai.2 Microsatellite allele sizes at loci Ai
Ad.1/Ad.2 Microsatellite allele sizes at loci Ad
Y.1/Y.2 Microsatellite allele sizes at loci Y
Ag.1/Ag.2 Microsatellite allele sizes at loci Ag
Ah.1/Ah.2 Microsatellite allele sizes at loci Ah
B.1/B.2 Microsatellite allele sizes at loci B
M.1/M.2 Microsatellite allele sizes at loci M

Technical Validation

Molecular analyses

Recommendations for minimisation and assessment of errors that may occur during the sampling, DNA extraction, amplification, sequencing, genotyping and data analysis processes were followed where possible39 (Table 5).

Table 5. Measures adopted to minimise and allow assessment of errors which may occur during the sampling, DNA extraction, amplification, sequencing, genotyping and data analysis processes (adapted from Bonin et al. 2004; Table 4).

MEASURES ADOPTED TO REDUCE AND DETECT ERRORS
Sampling
•Standard protocols followed for collection and labelling samples
•Sampling information logged on paper sampling sheets, and retained as backup
•Sampling data transferred to a single sampling database
DNA extraction
•Inclusion of negative controls to monitor contamination
•Consistency in extraction protocol
•Extraction performed in a different room to downstream analyses
DNA amplification
•Inclusion of negative control to monitor contamination
•Inclusion of positive control as a reference sample
•Consistency in amplification protocol
•Replicate amplifications - by main user, and also another user (blinded)
Sequencing
•Inclusion of negative control to monitor contamination
•Inclusion of positive control as a reference sample
•Consistency in sequencing protocol
•Sequencing in both forward and reverse directions
•Replicate sequencing - including multiple samples individually repeated, and a single sample repeated many times
Sequencing Analyses
•Alignment of forward and reverse sequences, with automated and manual checking of inconsistencies
•Alignment against GenBank database
•Multiple alignment - manual checking of all polymorphisms
Genotyping
•Perform pilot study to assess suitability of new loci for larger study
•Automated scoring, with manual checking
•Inclusion of positive control as a reference sample
•Discard low-quality samples
•Inclusion of a locus invariant in 99% of samples to monitor inter-assay variability of CEQ allele migration
•Cross-reading of datasets, and repeat scoring of overall dataset if required
Genotyping Analyses
•Discard poor markers from pilot study
•Quantify overall genotyping error rate and assess acceptability
•Consider genotyping errors as a possible cause of Hardy-Weinberg or linkage disequilibrium

As previously described20, microsatellite loci were tested for evidence of departure from Hardy–Weinberg equilibrium (HWE) and genotypic disequilibrium using FSTAT 2.9 (ref. 40), with appropriate Bonferroni corrections for multiple testing. All loci were analysed in MICRO-CHECKER41 to test for null alleles, stuttering and large allelic dropout as a cause of departure from HWE. Additionally, since Locus M displayed extremely low polymorphism (99.1% of individuals were homozygous for a particular allele), this locus was included in all PCR plates as a positive control and to determine inter-assay variability in allele fragment length. Error rates for microsatellite loci are reported in Peel et al.20 Inter-assay genotyping variability, measured by the variation in fragment length of the dominant allele of locus M on each plate, was low (range 134.32−134.66) across 27 runs and two control samples. Loci Y proved difficult to confidently bin due to alleles of single nucleotide difference and was therefore not included in the dataset.

Error rates for cytb analyses were assessed by replicate extractions (performed on 2.4% of samples), replicate PCR and sequencing reactions (performed on 8–14% of extracted samples), and by inclusion of positive and negative controls for all extractions and PCRs. Poor quality mtDNA sequence traces were excluded. Background PCR and sequencing error rates of the new E. helvum cytb primers EhM2814 and EhM2815 were assessed by running 70 replicates of a single sample. PCR and sequencing error rates were calculated at the base-pair level. Sequencing error rate was negligible (0–0.01%) across samples repeated in duplicate, and no substitutions were observed in the 70 replicate sequences obtained from a single sample (Table 6).

Table 6. Mitochondrial DNA PCR repeat rates and sequencing error rates.

  cytb original primers cytb new primers
Number of samples 207 438
Number of samples repeat PCR and sequenced 28 1
Number of duplicates run per sample 1 70
Total number of repeats 28 70
% of sequences repeated 13.5% 16.0%
Number of base pairs in sequence 418 397
Number of repeated sequences 28 70
Number of base pairs compared 11704 27790
Number of sequence errors (mismatches) 1 0
% PCR/sequence error (No of mismatches/no bp compared) 0.01% 0%
% samples extracted in duplicate, cytb PCR and sequenced 2.4%  
Number of errors (mismatches) 0  
% error 0.0%  

Serological analyses

All serological assays included positive and negative controls. Samples were tested in duplicate (LBV and Achimota viruses) or with 100 replicates (henipaviruses). Further validation procedures for multiplexed microsphere binding assays are presented as part of the methods, above.

Usage Notes

Users of these data are advised that importing the.csv data file (Data Citation 1) into Microsoft Excel can result in formatting errors, particularly with the column ‘Teeth.Age.Range’. Rather than opening the file with Excel (by double-clicking, for example), it is suggested that users instead select ‘File>Import>csv file >Delimited’, then select the ‘Teeth.Age.Range’ column and set the column data format as ‘Text’. Alternatively, importing and processing the data into the software ‘R’42 may be preferable.

Additional Information

How to cite: Peel, A. J. et al. Bat trait, genetic and pathogen data from large-scale investigations of African fruit bats, Eidolon helvum. Sci. Data 3:160049 doi: 10.1038/sdata.2016.49 (2016).

Supplementary Material

sdata201649-isa1.zip (25.3KB, zip)

Acknowledgments

The authors thank the governments of Ghana, Tanzania, Malawi, Zambia, Uganda, Equatorial Guinea, and São Tomé and Príncipe for facilitating this research. For their invaluable support in planning and implementing the field work, we also thank the Wildlife Division of the Forestry Commission, and the Veterinary Services Directorate, Ghana; Zambian Wildlife Authority; Kasanka Trust; Malawi Ministry of Tourism, Wildlife and Culture, Malawi Ministry of Agriculture, Irrigation and Food Security; former Tanzania Ministry of Livestock and Fisheries Development (now Ministry of Agriculture, Livestock and Fisheries); Tanzania Wildlife Research Institute; Sokoine University of Agriculture, Tanzania; Makerere University, Uganda; Equatorial Guinea Ministerio de Agricultura y Bosques; Equatorial Guinea Instituto del Desarrollo Forestal y Gestión de las Áreas Protegidas; Universidad Nacional de Guinea Ecuatorial; São Tomé and Príncipe Ministério de Agricultura, Desenvolvimento Rural e Pesca; Ecosystèmes Forestiers d'Afrique Centrale; Associação Monte Pico, São Tomé and Príncipe; Iñaki Rodriguez-Prieto; Alex Torrance; Lucrecia Bilé Osa Ahara; Heidi Ruffler; Ricardo Castro César de Sá; Ricardo Faustino de Lima; Mariana Carvalho and Meyir Ziekah. Additional genetic samples from the Gulf of Guinea islands and Rio Muni were supplied by Javier Juste (Museo Nacional de Ciencias Naturales, CSIC) and DJ Long (Californian Academy of Sciences). Additional sera were supplied by Ivan Kuzmin (previously Centers for Disease Control and Prevention). Christopher Broder (Uniformed Services University of the Health Sciences) supplied HeV and NiV soluble G glycoprotein for the henipavirus serological assays. The authors are indebted to Linfa Wang, Gary Crameri, Jennifer Barr and others at the CSIRO Australian Animal Health Laboratory, Geelong, Australia and Daniel Horton and David Seldon at the Animal Health and Veterinary Laboratories Agency (now Animal and Plant Health Agency), UK for providing invaluable support, constructive discussions and technical assistance throughout the project.

Funding for this study was provided by the Cambridge Infectious Diseases Consortium (grant VT0105) (A.J.P, D.T.S.H., K.S.B.), The Charles Slater Trust (A.J.P.), Zebra Foundation for Veterinary Zoological Education (A.J.P.), Isaac Newton Trust (A.J.P.), the Wellcome Trust (D.T.S.H., K.S.B.), a David H. Smith postdoctoral fellowship (D.T.S.H.), the RAPIDD program of the Science and Technology Directorate, Department of Homeland Security, Fogarty International Center, National Institutes of Health (D.T.S.H., J.L.N.W., A.R.F.), The Alborada Trust (J.L.N.W.), a European Union FP7 project ANTIGONE (Anticipating Global Onset of Novel Epidemics 278976) (A.C.B., J.L.N.W., A.A.C., A.R.F.), Royal Society Wolfson Research Merit Award (A.A.C.), Lincoln Park Zoo, Chicago, US (T.L.) and a Wellcome Trust Value in People award through the University of Glasgow (T.L.). KSB is in receipt of a Wellcome Trust Postdoctoral Training fellowship for clinicians (106690/Z/14/Z) and is supported by Wellcome Trust grant number 098051.

Footnotes

The authors declare no competing financial interests.

Data Citations

  1. Peel A. 2016. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.2fp34

References

  1. DeFrees S. L. & Wilson D. E. Eidolon helvum. Mammalian Species 312, 1–5 (1988). [Google Scholar]
  2. Bergmans W. Taxonomy and biogeography of African fruit bats (Mammalia, Megachiroptera). 3. The genera Scotonycteris Matschie, 1894, Casinycteris Thomas, 1910, Pteropus Brisson, 1762, Eidolon Rafinesque, 1815. Beaufortia 40, 111–177 (1990). [Google Scholar]
  3. Kuzmin I. V. in Current Laboratory Techniques in Rabies Diagnosis, Research and Prevention (eds Rupprecht C. & Nagarajan T. 1) 3–21 (Elsevier, 2014). [Google Scholar]
  4. Boulger L. & Porterfield J. Isolation of a virus from Nigerian fruit bats. Transactions of the Royal Society of Tropical Medicine and Hygiene 52, 421–424 (1958). [DOI] [PubMed] [Google Scholar]
  5. Kuzmin I. V. et al. Lagos bat virus in Kenya. J Clin Microbiol 46, 1451–1461 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Drexler J. F., Corman V. & Müller M. Bats host major mammalian paramyxoviruses. Nature Communications 3, 1–12 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Baker K. S. et al. Novel, Potentially Zoonotic Paramyxoviruses from the African Straw-Colored Fruit Bat Eidolon helvum. Journal of Virology 87, 1348–1358 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Hayman D. T. S. et al. Demography of straw-colored fruit bats in Ghana. J. Mammal 93, 1393–1404 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Hayman D. T. S. & Peel A. J. Can survival analyses detect hunting pressure in a highly connected species? Lessons from straw-coloured fruit bats. Biological Conservation 200, 131–139 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Hayman D. T. S. et al. Antibodies against Lagos bat virus in megachiroptera from West Africa. Emerg Infect Dis 14, 926–928 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Hayman D. T. S. et al. Evidence of henipavirus infection in West African fruit bats. PLoS ONE 3, e2739 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Peel A. J. et al. Continent-wide panmixia of an African fruit bat facilitates transmission of potentially zoonotic viruses. Nature Communications 4, 3770 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Peel A. J. et al. Henipavirus neutralising antibodies in an isolated island population of African fruit bats. PLoS ONE 7, e30346 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Hayman D. T. S. et al. Endemic Lagos bat virus infection in Eidolon helvum. Epidemiol Infect 140, 2163–2171 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Peel A. J. The epidemiology of Lagos bat virus and henipaviruses in straw-coloured fruit bats (Eidolon helvum). 1–327 (University of Cambridge, 2012). at http://www.dspace.cam.ac.uk/handle/1810/244571.
  16. Hayman D. T. S. et al. Long-term survival of an urban fruit bat seropositive for Ebola and Lagos bat viruses. PLoS ONE 5, e11978 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Peel A. J. et al. Use of cross-reactive serological assays for detecting novel pathogens in wildlife: assessing an appropriate cutoff for henipavirus assays in African bats. Journal of Virological Methods 193, 295–303 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hayman D. T. S. et al. A universal real-time assay for the detection of Lyssaviruses. Journal of Virological Methods 177, 87–93 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Wright E. et al. Virus neutralising activity of African fruit bat (Eidolon helvum) sera against emerging lyssaviruses. Virology 408, 183–189 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Peel A. J., Rossiter S. J., Wood J. L. N., Cunningham A. A. & Sargan D. R. Characterization of microsatellite loci in the straw-colored fruit bat, Eidolon helvum (Pteropodidae). Conserv Genet Resour 2, 279–282 (2010). [Google Scholar]
  21. Shi J. J. et al. A deep divergence time between sister species of Eidolon (Pteropodidae) with evidence for widespread panmixia. Acta Chiropterologica 16, 279–292 (2014). [Google Scholar]
  22. Ossa G., Kramer-Schadt S., Peel A. J., Scharf A. K. & Voigt C. C. The movement ecology of the straw-colored fruit bat, Eidolon helvum, in sub-Saharan Africa assessed by stable isotope ratios. PLoS ONE 7, e45729 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hayman D. T. S. et al. Ebola Virus Antibodies in Fruit Bats, Ghana, West Africa. Emerg Infect Dis 18, 1207–1209 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Baker K. S. et al. Metagenomic study of the viruses of African straw-coloured fruit bats: detection of a chiropteran poxvirus and isolation of a novel adenovirus. Virology 441, 95–106 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Billeter S. A. et al. Bartonella species in bat flies (Diptera: Nycteribiidae) from western Africa. Parasitology 139, 324–329 (2012). [DOI] [PubMed] [Google Scholar]
  26. Bai Y., Hayman D. T. S., McKee C. D. & Kosoy M. Y. Classification of Bartonella Strains Associated with Straw-Colored Fruit Bats (Eidolon helvum) across Africa Using a Multi-locus Sequence Typing Platform. PLoS Negl Trop Dis 9, e0003478 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Viana M. et al. Dynamics of a morbillivirus at the domestic-wildlife interface: Canine distemper virus in domestic dogs and lions. PNAS 112, 1464–1469 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Sikes R. S. & Gannon W.L. The Animal Care and Use Committee of the American Society of Mammalogists. Guidelines of the American Society of Mammalogists for the use of wild mammals in research. J Mammal 92, 235–253 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Matson G. et al. A laboratory manual for cementum age determination of Alaska brown bear first premolar teeth (Matson's Laboratory, 1993). [Google Scholar]
  30. Bodkin J. L., Ames J. A., Jameson R. J., Johnson A. M. & Matson G. M. Estimating age of sea otters with cementum layers in the first premolar. The Journal of Wildlife Management 61, 967–973 (1997). [Google Scholar]
  31. Divljan A., Parry-Jones K. & Wardle G. M. Age determination in the grey-headed flying fox. J Wildlife Manage 70, 607–611 (2006). [Google Scholar]
  32. Juste J. et al. Phylogeography of African Fruitbats (Megachiroptera). Mol Phylogenet Evol 13, 596–604 (1999). [DOI] [PubMed] [Google Scholar]
  33. Kocher T. et al. Dynamics of Mitochondrial DNA Evolution in Animals: Amplification and Sequencing with Conserved Primers. PNAS 86, 6196–6200 (1989). [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Staden R., Judge D., Bonfield J., Krawetz S., Womble D. in Introduction to Bioinformatics. A Theoretucal and Practical Approach (eds Krawetz S. A. & Womble D. D.) 760 (Human Press, 2003). [Google Scholar]
  35. Notredame C., Higgins D. & Heringa J. T-coffee: a novel method for fast and accurate multiple sequence alignment. Journal of Molecular Biology 302, 205–217 (2000). [DOI] [PubMed] [Google Scholar]
  36. Waterhouse A., Procter J., Martin D., Clamp M. & Barton G. Jalview Version 2 - a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Brookes S. M. et al. European bat lyssavirus in Scottish bats. Emerg Infect Dis 11, 572–578 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Bossart K. N. et al. Neutralization assays for differential henipavirus serology using Bio-Plex Protein Array Systems. Journal of Virological Methods 142, 29–40 (2007). [DOI] [PubMed] [Google Scholar]
  39. Bonin A. et al. How to track and assess genotyping errors in population genetics studies. Mol Ecol 13, 3261–3273 (2004). [DOI] [PubMed] [Google Scholar]
  40. Goudet J. FSTAT (Version 1.2): A computer program to calculate F-statistics. J Hered 86, 485–486 (1995). [Google Scholar]
  41. van Oosterhout C., Hutchinson W., Wills D. & Shipley P. Micro-Checker: software for identifying and correcting genotyping errors in microsatellite data. Mol Ecol Notes 4, 535–538 (2004). [Google Scholar]
  42. R Core Team. R: A language and environment for statistical computing. Version 3.0 (2014) at http://www.R-project.org/.
  43. Mickleburgh S., Hutson A. M., Bergmans W., Fahr J. & Racey P. Eidolon helvum, In:IUCN 2008. 2008 IUCN Red List of Threatened Species. Available from www.iucnredlist.org Downloaded on 6 February 2011. (2008). [Google Scholar]
  44. Juste J., Ibanez C. & Machordom A.. Morphological and allozyme variation of Eidolon helvum (Mammalia: Megachiroptera) in the islands of the Gulf of Guinea. Biol J Linn Soc 71, 359–378, (2000). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. Peel A. 2016. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.2fp34

Supplementary Materials

sdata201649-isa1.zip (25.3KB, zip)

Articles from Scientific Data are provided here courtesy of Nature Publishing Group

RESOURCES