Skip to main content
PLOS Biology logoLink to PLOS Biology
. 2004 Sep 28;2(10):e312. doi: 10.1371/journal.pbio.0020312

Identification of Birds through DNA Barcodes

Paul D N Hebert 1,, Mark Y Stoeckle 2, Tyler S Zemlak 1, Charles M Francis 3
PMCID: PMC518999  PMID: 15455034

Abstract

Short DNA sequences from a standardized region of the genome provide a DNA barcode for identifying species. Compiling a public library of DNA barcodes linked to named specimens could provide a new master key for identifying species, one whose power will rise with increased taxon coverage and with faster, cheaper sequencing. Recent work suggests that sequence diversity in a 648-bp region of the mitochondrial gene, cytochrome c oxidase I (COI), might serve as a DNA barcode for the identification of animal species. This study tested the effectiveness of a COI barcode in discriminating bird species, one of the largest and best-studied vertebrate groups. We determined COI barcodes for 260 species of North American birds and found that distinguishing species was generally straightforward. All species had a different COI barcode(s), and the differences between closely related species were, on average, 18 times higher than the differences within species. Our results identified four probable new species of North American birds, suggesting that a global survey will lead to the recognition of many additional bird species. The finding of large COI sequence differences between, as compared to small differences within, species confirms the effectiveness of COI barcodes for the identification of bird species. This result plus those from other groups of animals imply that a standard screening threshold of sequence difference (10× average intraspecific difference) could speed the discovery of new animal species. The growing evidence for the effectiveness of DNA barcodes as a basis for species identification supports an international exercise that has recently begun to assemble a comprehensive library of COI sequences linked to named specimens.


Can a single gene be used to identify all different animal species? An analysis of 255 North American birds shows that cytochrome c oxidase I might have this potential

Introduction

The use of nucleotide sequence differences in a single gene to investigate evolutionary relationships was first widely applied by Carl Woese (Woese and Fox 1977). He recognized that sequence differences in a conserved gene, ribosomal RNA, could be used to infer phylogenetic relationships. Sequence comparisons of rRNA from many different organisms led initially to recognition of the Archaea, and subsequently to a redrawing of the tree of life. More recently, the polymerase chain reaction has allowed sequence diversity in any gene to be examined. Genes that evolve slowly, like rRNA, often do not differ among closely related organisms, but they are indispensable in recovering ancient relationships, providing insights as far back as the origin of cellular life (Woese 2000). On the other hand, genes that evolve rapidly may overwrite the traces of ancient affinities, but regularly reveal divergences between closely related species.

Mitochondrial DNA (mtDNA) has been widely employed in phylogenetic studies of animals because it evolves much more rapidly than nuclear DNA, resulting in the accumulation of differences between closely related species (Brown et al. 1979; Moore 1995; Mindell et al. 1997). In fact, the rapid pace of sequence change in mtDNA results in differences between populations that have only been separated for brief periods of time. John Avise was the first to recognize that sequence divergences in mtDNA provide a record of evolutionary history within species, thereby linking population genetics and systematics and establishing the field of phylogeography (Avise et al. 1987). Avise and others also found that sister species usually show pronounced mtDNA divergences, and more generally that “biotic entities registered in mtDNA genealogies…and traditional taxonomic assignments tend to converge” (Avise and Walker 1999). Although many species show phylogeographic subdivisions, these usually coalesce into single lineages “at distances much shorter than the internodal branch lengths of the species tree” (Moore 1995). In other words, sequence divergences are much larger among species than within species, and thus mtDNA genealogies generally capture the biological discontinuities recognized by taxonomists as species. Taking advantage of this fact, taxonomic revisions at the species level now regularly include analysis of mtDNA divergences. For example, many newly recognized species of birds have been defined, in part, on the basis of divergences in their mtDNA (e.g., Avise and Zink 1988; Gill and Slikas 1992; Murray et al. 1994; AOU 1998; Banks et al. 2000, 2002, 2003).

The general concordance of mtDNA trees with species trees implies that, rather than analyzing DNA from morphologically identified specimens, it could be used the other way around, namely to identify specimens by analyzing their DNA. Past applications of DNA-based species identification range from reconstructing food webs by identifying fragments in stomachs (Symondson 2002) to recognizing products prepared from protected species (Palumbi and Cipriano 1998) and resolving complexes of mosquitoes that transmit malaria and dengue fever (Phuc et al. 2003). Despite such demonstrations, the lack of a lingua franca has limited the use of DNA as a general tool for species identifications.

If a short region of mtDNA that consistently differentiated species could be found and accepted as a standard, a library of sequences linked to vouchered specimens would make this sequence an identifier for species, a “DNA barcode” (Hebert et al. 2003a). Recent work suggests that a 648-bp region of the mitochondrial gene, cytochrome c oxidase I (COI), might serve as a DNA barcode for the identification of animal species. This gene region is easily recovered and it provides good resolution, as evidenced by the fact that deep sequence divergences were the rule between 13,000 closely related pairs of animal species (Hebert et al. 2003b). The present study extends these earlier investigations by testing the correspondence between species boundaries signaled by COI barcodes and those established by prior taxonomic work. Such tests require the analysis of groups that have been studied intensively enough to create a firm system of binomials; birds satisfy this requirement. Although GenBank holds many bird sequences, these derive from varied gene regions while a test of species identification requires comparisons of sequences from a standard gene region across species. Accordingly, the barcode region of COI was sequenced in 260 of the 667 bird species that breed in North America (AOU 1998).

Results

All 260 bird species had a different COI sequence(s); none was shared between species. COI sequences in the 130 species represented by two or more individuals were either identical or most similar to other sequences of the same species. Furthermore, with a few interesting exceptions discussed below, COI sequence differences between closely related species were far higher than differences within species (18-fold higher; average Kimura-2-parameter [K2P] differences between and within species, 7.93% and 0.43%, respectively) (Figure 1).

Figure 1. Comparison of Nucleotide Sequence Differences in COI among 260 Species of North American Birds.

Figure 1

Pairwise comparisons between 437 COI sequences are separated into three categories: differences between individuals in the same species, differences between individuals in the same genus (not including intraspecific differences), and differences between individuals in the same family (not including intraspecific or intrageneric differences).

In most cases the neighbor-joining (NJ) tree showed shallow intraspecific and deep interspecific divergences (Figure 2). However, in four exceptional cases, there were deep divergences within a species (Tringa solitaria, Solitary Sandpiper; Sturnella magna, Eastern Meadowlark; Cisthorus palustris, Marsh Wren; and Vireo gilvus, Warbling Vireo). COI sequences in each of these polytypic species separated into pairs of divergent clusters in the NJ tree. The intraspecific K2P distances in these exceptional species were 3.7%–7.2%, 9- to 17-fold higher than the average distance (Figures 2, 3, and S1).

Figure 2. NJ Tree of COI Sequences from 30 Species in Family Scolapacidae (Sandpipers and Kin).

Figure 2

The divergent pair of clustered sequences of Tringa solitaria is highlighted. An asterisk indicates a COI sequence from GenBank.

Figure 3. NJ Tree of COI Sequences from 260 Species of North American Birds.

Figure 3

Intraspecific divergences were sampled in 130 species; these are marked in blue. Four species showed deep intraspecific divergence: (a) Sturnella magna, (b) Cistothorus palustris, (c) Vireo gilvus, and (d) Tringa solitaria. Higher-order classifications in families (gray) and orders (gold) are highlighted, and are labeled on the left and right of the figure, respectively. Gold numerals indicate the two species that appear as paraphyletic lineages at the family level: (1) Oenanthe oenanthe and (2) Hirundo rustica.

Setting aside these polytypic species, the average intraspecific distance was very low, 0.27%, and the maximum average intraspecific difference was only 1.24%. Most congeneric species pairs showed divergences well above this value, but 13 species in four genera had interspecific distances that were below 1.25%. They included Larus argentatus, L. canus, L. delawarensis, L. glaucoides, L. hyperboreus, L. marinus, and L. thayeri (Herring Gull, Mew Gull, Ring-billed Gull, Iceland Gull, Glaucous Gull, Great Black-Backed Gull, and Thayer's Gull); Haematopus bachmani and H. palliatus (Black Oystercatcher and American Oystercatcher); Corvus brachyrhynchos and C. caurinus (American Crow and Northwestern Crow); and Anas platyrhynchos and A. rubripes (Mallard and American Black Duck) (Figure S1).

Although species were the focus of this study, we noted that the NJ tree of COI sequences generally matched avian classifications at higher levels, with most genera, families, and orders appearing as nested monophyletic lineages concordant with current taxonomy (Figures 3 and S1).

Discussion

The simplest test of species identification by DNA barcode is whether any sequences are found in two species; none was in this study. Although sequences were not shared by species, sequence variation did occur in some species. Thus the second test is whether the differences within species are much less than those among species. In this study we found that COI differences among most of the 260 North American bird species far exceeded those within species.

In order to conservatively test the effectiveness of COI barcodes as an identification tool, our sample must not have underestimated variability within species or have overestimated it among species. Our measures of intraspecific variation could be underestimates if members of a species show sequence divergence across their distribution that our study failed to adequately register. The two to three representatives of the 130 species used to examine this issue were collected from sites that were, on average, approximately 1,080 km apart, suggesting adequate representation of genetic diversity across their ranges. However, to further investigate this issue, we compared sequence differences within species to geographic distances between the collection points for their specimens and found these were unrelated (Figure 4). Based on these results, high levels of intraspecific divergence in COI in North American birds appear uncommon, given that we analyzed 130 different species in a variety of orders. Our findings are supported by a review of 34 mostly North American birds which showed a similarly low average maximum intraspecific K2P divergence of mtDNA of 0.7% (Moore 1995). Similarly, Weibel and Moore (2002) reported an average intraspecific divergence of 0.24% in their study of COI variation in woodpeckers. We conclude that our investigation has not underestimated intraspecific variation in any systematic fashion.

Figure 4. Genetic Difference versus Geographic Distance.

Figure 4

For each same-species pair of specimens, the geographic distance between where specimens were collected is plotted against their COI divergence (K2P).

On the other hand, our discovery of four polytypic species within a sample of 130 makes it likely there are other North American birds with divergent populations that may represent hidden species. Recent studies have identified marked mtDNA divergences within North American populations of Common Ravens (Omland et al. 2000), Fox Sparrows (Zink and Weckstein 2003), and Curve-billed Thrashers (Zink and Blackwell-Rago 2000), leading to proposals to split each into two or more species. Species with Holarctic distributions are particularly good candidates for unrecognized species, and recent DNA and morphological investigations have led taxonomists to split several such species into two, including Wilson's and Common Snipes, American and Eurasian Three-toed Woodpeckers, and American and Water Pipits (Zink et al. 1995, 2002; Miller 1996; AOU 1998; Banks et al. 2000, 2002, 2003). Widespread application of COI barcodes across the global ranges of birds will undoubtedly lead to the recognition of further hidden species.

Any critical test of the effectiveness of barcodes must also consider the possibility that our study has overestimated variability among species. We therefore looked at species individually, comparing their minimum distance to a congener with the maximum divergence within each species. This analysis included a number of well-recognized sibling species, including Calidris mauri and C. pusilla, Fraternicula arctica and F. corniculata, and Empidonax traillii and E. virescens. There were sufficient data to perform this analysis on three of the four polytypic species and on 70 of the 126 remaining species (Figure 5). The average maximum K2P divergence within these 70 species was 0.29%, while the average minimum distance to a congener was 7.05% (24-fold higher), values comparable to those for the entire data set. Prior studies that looked exclusively at sister species of birds found an average K2P mtDNA distance of 5.1% in 35 pairs (Klicka and Zink 1997) and 3.5% in 47 pairs (Johns and Avise 1998). More generally, 98% of sister species pairs of vertebrates were observed to have K2P mtDNA divergences greater than 2% (Johns and Avise 1998). Thus it appears that a COI barcode will enable the separation of most sister species of birds.

Figure 5. Intraspecific Compared to Interspecific COI Distances (K2P) for Individual Species.

Figure 5

For each species in which this comparison was possible (n = 73), maximum intraspecific variation is compared to minimum interspecific congeneric difference. For illustration purposes shown here, 2.0% is chosen as a cutoff between usual values for intra- and interspecific variation. This divides the graph into four quadrants that represent different categories of species: (I) Intraspecific distance, <2%; interspecific distance, >2%: concordant with current taxonomy; (II) Intraspecific distance, >2%; interspecific distance, >2%: probable composite species (i.e., candidate for taxonomic split); (III) Intraspecific distance, <2%; interspecific distance, <2%: recent divergence, hybridization, or synonymy; (IV) Intraspecific distance, >2%; interspecific distance, <2%: probable misidentification of specimen.

There is a possibility that the North American bird fauna is not representative of the global situation. The recent and extensive glaciations in North America may have decreased within-species variability by inducing bottlenecks in population size or may have increased variation between species by pruning many sister taxa (Avise and Walker 1998; Mila et al. 2000). This issue can only be resolved by evaluating the efficacy of barcodes in tropical and southern temperate faunas to ascertain if our results are general. We note that recent mtDNA studies in these settings have found both multiple sibling species in what were thought to be single species (Ryan and Bloomer 1999) and geographically structured variation suggesting the presence of cryptic species (Hackett and Rosenberg 1990; Bates et al. 1999).

The diagnosis of species is particularly difficult when they are young. Moreover, hybridization is often common when the ranges of recently arisen species overlap, further complicating identifications. Such newly emerged species are sometimes referred to as superspecies (Mayr and Short 1970), or species complexes, to indicate their close genetic similarity. For example, the white-headed gulls are thought to have diverged very recently, some less than 10,000 years ago (Crochet et al. 2002, 2003), and hybridization is common among many of them. It is thus not surprising that their COI barcodes and other gene loci are very similar. DNA barcodes can help to define the limits of such recently emerged species, but more gene loci need to be surveyed and more work is required to determine which analytical methods can best deduce species boundaries in such cases. The NJ method used here has the advantage of speed, and performs strongly when sequence divergences are low, so it is generally appropriate for recovering intra- and interspecies phylogeny. However, a library of COI barcodes linked to named specimens will provide the large data sets needed to test the efficacy of varied tree-building methods (for review, see Holder and Lewis 2003).

Even between species that diverged long ago, hybridization will lead to shared or very similar sequences at COI and other gene loci. Because mitochondrial DNA is maternally inherited, a COI barcode will assign F1 hybrids to the species of their female parent. Hybridization leading to the transfer of mtDNA from one species to another can result in a mtDNA tree that is incongruent with the species tree, but it will not necessarily prevent species from being distinguished, unless the mitochondrial transfer is so recent that their sequences have not diverged (Moore 1995). However, recent hybridization will lead species to share COI barcodes, and we expect that more intensive study will reveal such shared sequences in species that are known to hybridize, such as the white-headed gulls (Crochet et al. 2003) and Mallard/Black Ducks (Ankney et al. 1986; Avise et al. 1990).

In other cases, a lack of COI divergence may indicate that populations are part of a single species, helping to sort out misleading morphological classifications. For example, the blue and white morphs of Chen caerulescens, Snow Goose, were thought to be different species until recently (Cooke et al. 1995). The close COI similarity of American and Black Oystercatchers revealed in this study is consistent with suggestions that these are allopatrically distributed color morphs of a single species (Jehl 1985). Low COI divergences between American and Northwestern Crows similarly support earlier suggestions that these taxa are conspecific (Sibley and Monroe 1990; Madge and Burn 1994).

Just as COI similarities among species already questioned by taxonomists may reinforce these queries, deep COI divergences within species may reinforce suspicions of hidden diversity. For example, three of the four polytypic species in this study (Eastern Meadowlark, Marsh Wren, and Warbling Vireo) are split into two by some taxonomists (Wells 1998), and the fourth, Solitary Sandpiper, contains two allopatric subspecies with morphological differences (Godfrey 1976). In these cases, suspicions in the minds of taxonomists are reinforced by large COI divergences. If these species had not been the subject of prior scrutiny, COI barcoding would have flagged them as deserving of such attention.

The importance of sampling multiple individuals within each species is highlighted by a recent review which found evidence of species-level paraphyly or polyphyly in 23% of 2,319 animal species, including 16.7% of 331 bird species (Funk and Omland 2003). This review provides a clear discussion of possible causes (imperfect taxonomy, hybridization, incomplete lineage sorting) and indicates the need for the careful reexamination of current taxonomy and for the collection of genetic data across both geographic ranges and morphological variants. Barcoding, together with related developments in sequencing technology, is likely to provide an efficient approach to the assembly of such genetic data.

We expect that the assembly of a comprehensive barcode library will help to initiate taxonomic investigations that will ultimately lead to the recognition of many new avian species. This process will begin with the discovery of novel COI barcodes. Some of these cases will simply represent the first barcode records for described but previously unanalyzed species, but taxonomic study will confirm that others derive from new species. We propose that specimens with barcodes diverging deeply from known taxa should be known by a “provisional species” designation that links them to the nearest established taxon. For example, the divergent clusters of Solitary Sandpiper specimens might be called T. solitaria PS-1 and T. solitaria PS-2, highlighting a need for further taxonomic study.

What threshold might be appropriate for flagging genetically divergent specimens as provisional species? This threshold should certainly be high enough to separate only specimens that very likely belong to different species. Because patterns of intraspecific and interspecific variation in COI appear similar in various animal groups (Grant and Bowen 1998 [sardines]; Hebert et al. 2003a [moths]; Hogg and Hebert 2004 [springtails]), we propose a standard sequence threshold: 10× the mean intraspecific variation for the group under study. If applied to the birds examined in this study (0.27% average intraspecific variation; 2.7% threshold), a 10× threshold would recognize over 90% of the 260 known species, as well as the four probable new species. As this result demonstrates, a threshold approach will overlook species with short evolutionary histories and those exposed to recent hybridization, but it will be a useful screening tool, especially for groups that have not received intensive taxonomic analysis.

For 260 of the 667 bird species breeding in North America, our evidence shows that COI barcodes separate individuals into the categories that taxonomists call species. This adds to the evidence already in hand for insects and other arthropods that barcodes can be an efficient tool for species identification. Should future studies broaden this evidence, a comprehensive library of barcodes will make it easier to probe varied areas of avian biology. A DNA barcode will help, for example, when morphological diagnoses are difficult, as when identifying remnants (including eggs, nestlings, and adults) in the stomachs of predators. A DNA barcode could similarly identify fragments of birds that strike aircraft (Dove 2000) and recognize carcasses of protected or regulated species (Guglich et al. 1994). DNA barcodes could also reveal the species of avian blood in mosquitoes carrying West Nile virus (Michael et al. 2001; Lee et al. 2002), help experts distinguish morphologically similar juveniles or nonbreeding adults in banding work, and allow expanded nonlethal study of endangered or threatened populations.

The two essential components for an effective DNA barcode system (and thus a new master key to the encyclopedia of life [Wilson 2003]) are standardization on a uniform barcode sequence, such as COI, and a library of sequences linked to named voucher specimens. The present study provides an initial set of COI barcodes for about 40% of North American birds. More detailed sampling of COI sequences is needed for these species, and barcodes need to be gathered for the remaining North American birds and for those in other geographic regions. This work could represent a first step toward a DNA barcode system for all animal and plant life, an initiative with potentially widespread scientific and practical benefits (Stoeckle 2003; Wilson 2003; Blaxter 2004; Janzen 2004).

Materials and Methods

Existing data can only yield limited new insights into the effectiveness of a DNA-based identification system for birds. Two mitochondrial genes, cyt b and COI, are rivals for the largest number of animal sequence records greater than 600 bp in GenBank (4,791 and 3,009 species, respectively). However, COI coverage for birds is modest; 173 species share COI sequences with 600-bp overlap. As these records derive from a global avifauna of 10,000 species, they provide a limited basis to evaluate the utility of a COI-based identification system for any continental fauna, impelling us to gather new sequences.

We employed a stratified sampling design to gain an overview of the patterns of COI sequence divergence among North American birds. The initial level of sampling examined a single individual from each of 260 species to ascertain COI divergences among species. These species were selected on the basis of accessibility without regard to known taxonomic issues. The second level of sampling examined one to three additional individuals from 130 of these species to provide a general sense of intraspecific sequence divergences, as well as a preliminary indication of variation in each species. When possible, these individuals were obtained from widely separated localities in North America. The third level of our analysis involved sequencing four to eight more individuals for the few species where the second level detected more than 2% sequence divergence among individuals. Our studies examined specimens collected over the last 20 years; 98% were obtained from the tissue bank at the Royal Ontario Museum, Toronto, Canada. Collection localities and other specimen information are available in the “Birds of North America” file in the Completed Projects section of the Barcode of Life website (http://www.barcodinglife.com). Taxonomic assignments follow the latest North American checklist (AOU 1998) and its recent supplements (Banks et al. 2000, 2002, 2003).

Mitochondrial pseudogenes can complicate PCR-based studies of mitochondrial gene diversity (Bensasson et al. 2001; Thalmann et al. 2004). We used protocols to reduce pseudogene impacts that included extracting DNA from tissues rich in mitochondria (Sorenson and Quinn 1998), employing primers with high universality (Sorenson and Quinn 1998), and amplifying a relatively long PCR product because most pseudogenes are short (Pereira and Baker 2004). DNA extracts were prepared from small samples of muscle using the GeneElute DNA miniprep Kit (Sigma, St. Louis, Missouri, United States), following the manufacturer's protocols. DNA extracts were resuspended in 10 μl of H2O, and a 749-bp region near the 5′ terminus of the COI gene was amplified using primers (BirdF1-TTCTCCAACCACAAAGACATTGGCAC and BirdR1-ACGTGGGAGATAATTCCAAATCCTG). In cases where this primer pair failed, an alternate reverse primer (BirdR2-ACTACATGTGAGATGATTCCGAATCCAG) was generally combined with BirdF1 to generate a 751-bp product, but a third reverse primer (BirdR3-AGGAGTTTGCTAGTACGATGCC) was used for two species of Falco. The 50-μl PCR reaction mixes included 40 μl of ultrapure water, 1.0 U of Taq polymerase, 2.5 μl of MgCl2, 4.5 μl of 10× PCR buffer, 0.5 μl of each primer (0.1 mM), 0.25 μl of each dNTP (0.05 mM), and 0.5–3.0 μl of DNA. The amplification regime consisted of 1 min at 94 °C followed by 5 cycles of 1 min at 94 °C, 1.5 min at 45 °C, and 1.5 min at 72 °C, followed in turn by 30 cycles of 1 min at 4 °C, 1.5 min at 51 °C, and 1.5 min at 72 °C, and a final 5 min at 72 °C. PCR products were visualized in a 1.2% agarose gel. All PCR reactions that generated a single, circa 750-bp, product were then cycle sequenced, while gel purification was used to recover the target gene product in cases where more than one band was present. Sequencing reactions, carried out using Big Dye v3.1 and the BirdF1 primer, were analyzed on an ABI 377 sequencer. The electropherogram and sequence for each specimen are in the “Birds of North America” file, but all sequences have also been deposited in GenBank (see Supporting Information). COI sequences were recovered from all 260 bird species and did not contain insertions, deletions, nonsense, or stop codons, supporting the absence of nuclear pseudogene amplification (Pereira and Baker 2004). In addition to 429 newly collected sequences, nine GenBank sequences from five species were included (these were the only full-length COI sequences corresponding to species in this study).

Sequence divergences were calculated using the K2P distance model (Kimura 1980). A NJ tree of K2P distances was created to provide a graphic representation of the patterning of divergences among species (Saitou and Nei 1987).

Supporting Information

Figure S1. Birds Appendix.

Complete NJ tree based on K2P distances at COI for 437 sequences from 260 species of North American birds. Entries marked with an asterisk represent COI sequences from GenBank.

(100 KB PDF).

Accession Numbers

Sequences described in Materials and Methods have been deposited in GenBank under accession numbers AY666171 to AY666596.

Acknowledgments

Funding for this study was provided by grants from NSERC, the Canada Research Chairs program, and the Canadian Wildlife Service to PDNH. We express our particular appreciation to Allan Baker, Jon Barlow, and Mark Peck for providing access to specimens held in the Tissue Collection at the Royal Ontario Museum. We thank Heather Cole and Angela Holliss for assistance with DNA sequencing, Sujeevan Ratnasingham and Rob Dooh for assistance with data analysis, and Ian Smith for aid with graphics. Finally, we thank Jesse Ausubel, Teri Crease, Carla Dove, Jeremy deWaard, Dan Janzen, David Thaler, Paul Waggoner, Jonathan Witt, and four anonymous reviewers for their comments on earlier drafts of the manuscript.

Abbreviations

COI

cytochrome c oxidase I

K2P

Kimura-2 parameter

mtDNA

mitochondrial DNA

NJ

neighbor joining

Conflicts of interest. The authors have declared that no conflicts of interest exist.

Author contributions. PDNH conceived and designed the experiments. TSZ performed the experiments. PDNH, MYS, and TSZ analyzed the data. CMF contributed reagents/materials/analysis tools. PDNH, MYS, and CMF wrote the paper.

Academic Editor: Charles Godfray, Imperial College, Silwood Park

Citation: Hebert PDN, Stoeckle MY, Zemlak TS, Francis CM (2004) Identification of birds through DNA barcodes. PLoS Biol 2(10): e312.

References

  1. Ankney CD, Dennis DG, Wishard LN, Seeb JE. Low genic variation between Black Ducks and Mallards. Auk. 1986;103:701–709. [Google Scholar]
  2. [AOU] The American Ornithologists' Union. Lawrence, Kansas: AOU Press; 1998. Check-list of North American birds, 7th edition; 829 pp. [Google Scholar]
  3. Avise JC, Walker D. Pleistocene phylogeographic effects on avian populations and the speciation process. Proc R Soc Lond B Bio Sci. 1998;265:457–463. doi: 10.1098/rspb.1998.0317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Avise JC, Walker D. Species realities and numbers in sexual vertebrates: Perspectives from an asexually transmitted genome. Proc Natl Acad Sci U S A. 1999;96:992–995. doi: 10.1073/pnas.96.3.992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Avise JC, Zink RM. Molecular genetic divergence between avian sibling species: King and Clapper rails, Long-billed and Short-billed dowitchers, Boated-tailed and Great-tailed grackles, and Tufted and Black-crested titmice. Auk. 1988;105:516–528. [Google Scholar]
  6. Avise JC, Arnold J, Ball RM, Bermingham E, Lamb T, et al. Intraspecific phylogeography: The mitochondrial bridge between population genetics and systematics. Annu Rev Ecol Syst. 1987;18:489–522. [Google Scholar]
  7. Avise JC, Ankney CD, Nelson WS. Mitochondrial gene trees and the evolutionary relationship of Mallard and Black Ducks. Evolution. 1990;44:1109–1119. doi: 10.1111/j.1558-5646.1990.tb03829.x. [DOI] [PubMed] [Google Scholar]
  8. Banks RC, Cicero C, Dunn JL, Kratter AW, Ouellet H, et al. Forty-second supplement to the Ornithologists' Union checklist of North American birds. Auk. 2000;117:847–858. [Google Scholar]
  9. Banks RC, Cicero C, Dunn JL, Kratter AW, Rasmussen PC, et al. Forty-third supplement to the Ornithologists' Union checklist of North American birds. Auk. 2002;119:897–906. [Google Scholar]
  10. Banks RC, Cicero C, Dunn JL, Kratter AW, Rasmussen PC, et al. Forty-fourth supplement to the Ornithologists' Union checklist of North American birds. Auk. 2003;120:923–931. [Google Scholar]
  11. Bates JM, Hackett SJ, Goerck JM. High levels of mitochondrial DNA differentiation in two lineages of Antbirds (Drymophila and Hypocnemis) . Auk. 1999;116:1093–1106. [Google Scholar]
  12. Bensasson D, Zhang D, Hartl DL, Hewitt GM. Mitochondrial pseudogenes: Evolution's misplaced witnesses. Trends Ecol Evol. 2001;16:314–321. doi: 10.1016/s0169-5347(01)02151-6. [DOI] [PubMed] [Google Scholar]
  13. Blaxter M. The promise of a DNA taxonomy. Philos Trans R Soc Lond B Biol Sci. 2004;359:769–779. doi: 10.1098/rstb.2003.1447. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Brown WM, George M, Wilson AC. Rapid evolution of animal mitochondrial DNA. Proc Natl Acad Sci U S A. 1979;76:1967–1971. doi: 10.1073/pnas.76.4.1967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Cooke F, Rockwell RF, Lank DB. Oxford: Oxford University Press; 1995. The snow geese of La Perouse Bay: Natural selection in the wild; 320 pp. [Google Scholar]
  16. Crochet PA, Lebreton JD, Bonhomme F. Systematics of large white-headed gulls: Patterns of variation in Western European taxa. Auk. 2002;119:603–620. [Google Scholar]
  17. Crochet PA, Chen JZ, Pons JM, Lebreton JD, Hebert PDN, et al. Genetic differentiation at nuclear and mitochondrial loci among large white-headed gulls: Sex biased interspecific gene flow? Evolution. 2003;57:2865–2878. doi: 10.1111/j.0014-3820.2003.tb01527.x. [DOI] [PubMed] [Google Scholar]
  18. Dove C. A descriptive and phylogenetic analysis of plumulaceous feather characters in Charadriiformes . Ornith Monogr. 2000;51:1–163. [Google Scholar]
  19. Funk DJ, Omland KE. Species-level paraphyly and polyphyly: Frequency, causes and consequences, with insights from animal mitochondrial DNA. Annu Rev Ecol Evol Syst. 2003;34:397–423. [Google Scholar]
  20. Gill FB, Slikas B. Patterns of mitochondrial DNA divergence in North American crested titmice. Condor. 1992;94:20–28. [Google Scholar]
  21. Godfrey WE. Ottawa: National Museums of Canada; 1976. Birds of Canada; 428 pp. [Google Scholar]
  22. Grant WS, Bowen BW. Shallow population histories in deep evolutionary lineages of marine fishes: Insights from sardines and anchovies and lessons for conservation. J Hered. 1998;89:415–427. [Google Scholar]
  23. Guglich E, Wilson PJ, White BN. Forensic application of repetitive DNA markers to the species identification of animal tissues. J Forensic Sci. 1994;39:353–361. [PubMed] [Google Scholar]
  24. Hackett SJ, Rosenberg KV. Comparison of phenotypic and genetic differentiation in South American antwrens (Formicariidae) . Auk. 1990;107:473–489. [Google Scholar]
  25. Hebert PDN, Cywinska A, Ball SL, DeWaard JR. Biological identifications through DNA barcodes. Proc R Soc Lond B Biol Sci. 2003a;270:313–321. doi: 10.1098/rspb.2002.2218. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hebert PDN, Ratnasingham S, DeWaard JR. Barcoding animal life: Cytochrome c oxidase subunit 1 divergences among closely related species. Proc R Soc Lond B Biol Sci. 2003b;270:S596–S599. doi: 10.1098/rsbl.2003.0025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hogg ID, Hebert PDN. Biological identification of springtails (Hexapoda: Collembola) from the Canadian arctic, using DNA barcodes. Can J Zool. 2004;82 In press. [Google Scholar]
  28. Holder M, Lewis PO. Phylogeny estimation: Traditional and Bayesian approaches. Nat Rev Genet. 2003;4:275–284. doi: 10.1038/nrg1044. [DOI] [PubMed] [Google Scholar]
  29. Janzen DH. Now is the time. Philos Trans R Soc Lond B Biol Sci. 2004;359:731–732. doi: 10.1098/rstb.2003.1444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Jehl JR Jr. Hybridization and evolution of oystercatchers on the Pacific Coast of Baja California. Neotrop Ornith AOU Monograph. 1985;36:484–504. [Google Scholar]
  31. Johns GC, Avise JC. A comparative summary of genetic distances in the vertebrates from the mitochondrial cytochrome b gene. Mol Biol Evol. 1998;15:1481–1490. doi: 10.1093/oxfordjournals.molbev.a025875. [DOI] [PubMed] [Google Scholar]
  32. Kimura M. A simple method for estimating evolutionary rate of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980;16:111–120. doi: 10.1007/BF01731581. [DOI] [PubMed] [Google Scholar]
  33. Klicka J, Zink RM. The importance of recent ice ages in speciation: A failed paradigm. Science. 1997;277:1666–1669. [Google Scholar]
  34. Lee JH, Hassan H, Hill G, Cupp EW, Higazi TB, et al. Identification of mosquito avian-derived blood meals by polymerase chain reaction-heteroduplex analysis. Am J Trop Med Hyg. 2002;66:599–604. doi: 10.4269/ajtmh.2002.66.599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Madge S, Burn H. Princeton: Princeton University Press; 1994. Crows and Jays; 216 pp. [Google Scholar]
  36. Mayr E, Short LL. Cambridge, Massachusetts: Nuttall Ornithological Club; 1970. Species taxa of North American birds: A contribution to comparative systematics; 127 pp. [Google Scholar]
  37. Michael E, Ramaiah KD, Hoti SL, Barker G, Paul MR, et al. Quantifying mosquito biting patterns on humans by DNA fingerprinting of bloodmeals. Am J Trop Med Hyg. 2001;65:722–728. doi: 10.4269/ajtmh.2001.65.722. [DOI] [PubMed] [Google Scholar]
  38. Mila B, Girman DJ, Kimura M, Smith TB. Genetic evidence for the effect of a postglacial population expansion on the phylogeography of a North American songbird. Proc R Soc Lond B Biol Sci. 2000;267:1033–1040. doi: 10.1098/rspb.2000.1107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Miller EH. Acoustic differentiation and speciation in shorebirds. In: Kroodsma DE, Miller EH, editors. Ecology and evolution of acoustic communication in birds. Ithaca: Comstock/Cornell University Press; 1996. pp. 241–257. [Google Scholar]
  40. Mindell DP, Sorenson MD, Huddleston CJ, Miranda HC, Knight A, et al. Phylogenetic relationships among and within select avian orders based on mitochondrial DNA. In: Mindell DP, editor. Avian molecular evolution and systematics. New York: Academic Press; 1997. pp. 214–247. [Google Scholar]
  41. Moore WS. Inferring phylogenies from mtDNA variation: Mitochondrial-gene trees versus nuclear-gene trees. Evolution. 1995;49:718–726. doi: 10.1111/j.1558-5646.1995.tb02308.x. [DOI] [PubMed] [Google Scholar]
  42. Murray BW, McGillivray WB, Barlow JC, Beech RN, Strobeck C. The use of cytochrome B sequence variation in estimation of phylogeny in the Vireonidae . Condor. 1994;96:1037–1054. [Google Scholar]
  43. Omland KE, Marzluff J, Boarman W, Tarr CL, Fleischer RC. Cryptic genetic variation and paraphyly in ravens. Proc R Soc Lond B Biol Sci. 2000;267:2475–2482. doi: 10.1098/rspb.2000.1308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Palumbi SR, Cipriano F. Species identification using genetic tools: The value of nuclear and mitochondrial gene sequences in whale conservation. J Hered. 1998;89:459–464. doi: 10.1093/jhered/89.5.459. [DOI] [PubMed] [Google Scholar]
  45. Pereira SL, Baker AJ. Low number of mitochondrial pseudogenes in the chicken (Gallus gallus) nuclear genome: Implications for molecular inference of population history and phylogenetics. BMC Evol Biol. 2004;4:17. doi: 10.1186/1471-2148-4-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Phuc HK, Ball AJ, Son L, Hahn NV, Tu ND, et al. Multiplex PCR assay for malaria vector Anopheles minimus and four related species in the Myzomyia series from Southeast Asia. Med Vet Entomol. 2003;17:423–428. doi: 10.1111/j.1365-2915.2003.00462.x. [DOI] [PubMed] [Google Scholar]
  47. Ryan PG, Bloomer P. The Long-Billed Lark complex: A species mosaic in southwestern Africa. Auk. 1999;116:194–208. [Google Scholar]
  48. Saitou N, Nei M. The neighbour-joining method: A new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4:406–425. doi: 10.1093/oxfordjournals.molbev.a040454. [DOI] [PubMed] [Google Scholar]
  49. Sibley CG, Monroe BL. New Haven: Yale University Press; 1990. Distribution and taxonomy of birds of the world; 1111 pp. [Google Scholar]
  50. Sorenson MD, Quinn TW. Numts: A challenge for avian systematics and population biology. Auk. 1998;115:214–221. [Google Scholar]
  51. Stoeckle M. Taxonomy, DNA and the bar code of life. BioScience. 2003;53:2–3. [Google Scholar]
  52. Symondson WOC. Molecular identification of prey in predator diets. Mol Ecol. 2002;83:137–147. doi: 10.1046/j.1365-294x.2002.01471.x. [DOI] [PubMed] [Google Scholar]
  53. Thalmann O, Hebler J, Poinar HN, Paabo S, Vigilant L. Unreliable mtDNA data due to nuclear insertions: A cautionary tale from analysis of humans and other great apes. Mol Ecol. 2004;13:321–335. doi: 10.1046/j.1365-294x.2003.02070.x. [DOI] [PubMed] [Google Scholar]
  54. Weibel AC, Moore WS. Molecular phylogeny of a cosmopolitan group of woodpeckers (genus Picoides) based on COI and cyt b mitochondrial gene sequences. Mol Phylogenet Evol. 2002;22:65–75. doi: 10.1006/mpev.2001.1010. [DOI] [PubMed] [Google Scholar]
  55. Wells MG. Bushey: Worldlist; 1998. World bird species checklist: With alternative English and scientific names; 671 pp. [Google Scholar]
  56. Wilson EO. The encyclopedia of life. Trends Ecol Evol. 2003;18:77–80. [Google Scholar]
  57. Woese CR. Interpreting the universal phylogenetic tree. Proc Natl Acad Sci U S A. 2000;97:8392–8396. doi: 10.1073/pnas.97.15.8392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Woese CR, Fox GE. Phylogenetic structure of the prokaryotic domain: The primary kingdoms. Proc Natl Acad Sci U S A. 1977;74:5088–5090. doi: 10.1073/pnas.74.11.5088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Zink RM, Blackwell-Rago RC. Species limits and recent population history in the curve-billed thrasher. Condor. 2000;102:881–887. [Google Scholar]
  60. Zink RM, Weckstein JD. Recent evolutionary history of the fox sparrows (genus: Passerella) Auk. 2003;120:522–527. [Google Scholar]
  61. Zink RM, Rohwer S, Andreev AV, Dittmann DL. Trans-Beringia comparisons of mitochondrial DNA differentiation in birds. Condor. 1995;97:639–649. [Google Scholar]
  62. Zink RM, Rohwer S, Drovetski S, Blackwell-Rago RC, Farrell SL. Holarctic phylogeography and species limits of three-toed woodpeckers. Condor. 2002;104:167–170. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1. Birds Appendix.

Complete NJ tree based on K2P distances at COI for 437 sequences from 260 species of North American birds. Entries marked with an asterisk represent COI sequences from GenBank.

(100 KB PDF).


Articles from PLoS Biology are provided here courtesy of PLOS

RESOURCES