Professional Documents
Culture Documents
Correspondence
I
n this issue of PLoS Biology, Hebert concept have ranged from gene a candidate for high-throughput
et al. (2004) have set out to test unbridled enthusiasm, especially DNA barcoding—highly constrained
the resolution and performance from ecologists (Janzen 2004), to amino acid sequence and thus broad
of “DNA barcoding,” using a single outright condemnation, largely from applicability of primers (Hebert et
mtDNA gene, cytochrome c oxidase I taxonomists (e.g., see the February al. 2003)—also limits its information
(COI), for a sample of North American 2003 issue of Trends in Ecology and content at deeper phylogenetic levels
birds. Before turning to details of Evolution). The former view reflects (e.g., Russo et al. 1996; Zardoya and
this study, it is useful as context to a real need to connect different Meyer 1996; Naylor and Brown 1997).
consider the following questions: What life history stages and to increase Finally, while superficially appealing,
is DNA barcoding, and what does it the precision and efficiency of field the very term DNA barcoding is
promise? What is new about it? Why is studies involving diverse and difficult- unfortunate, as it implies that each
it controversial? What are the potential to-identify taxa. The criticisms are species has a fixed and invariant
pitfalls? mainly in response to the view that characteristic—like a barcode on a
Put simply, the intent of DNA single-gene sequences should be supermarket product. As evolutionary
barcoding is to use large-scale the primary identifier for species biologists, we should question this
screening of one or a few reference (“DNA taxonomy”; Tautz et al. 2002; analogy.
genes in order to (i) assign unknown see also Blaxter 2004). At least for In evaluating the promise and
individuals to species, and (ii) the macrobiota, the DNA barcoding pitfalls of DNA barcoding, we need to
enhance discovery of new species community has moved away from separate the two areas of application:
(Hebert et al. 2003; Stoeckle 2003). this to emphasize the importance of molecular diagnostics of individuals
Proponents envisage development of a embedding any large-scale sequence relative to described taxa, and DNA-
comprehensive database of sequences, database within the existing framework led discovery of new species. Both are
preferably associated with voucher and practice of systematics, including inherently phylogenetic and rely on a
specimens representing described the importance of voucher specimens solid taxonomic foundation, including
species, against which sequences and of integrating molecular with adequate sampling of variation within
from sampled individuals can be morphological characters. Another species and inclusion of all previously
compared. Given the long history point of contention—that DNA
of use of molecular markers (e.g., barcodes have limited phylogenetic Citation: Moritz C, Cicero C (2004) DNA barcoding:
Promise and pitfalls. PLoS Biol 2(10): e354.
allozymes, rDNA, and mtDNA) for resolution—arises from confusion
these purposes (Avise 2004), there is about the scope of inference. At best, Copyright: © 2004 Craig Moritz and Carla Cicero. This
is an open-access article distributed under the terms
nothing fundamentally new in the DNA single-gene assays can hope to identify of the Creative Commons Attribution License, which
barcoding concept, except increased an individual to species or reveal permits unrestricted use, distribution, and reproduc-
scale and proposed standardization. inconsistencies between molecular tion in any medium, provided the original work is
properly cited.
The former is inevitable. variation and current perceptions of
Standardization, i.e., the selection species boundaries. DNA barcoding Abbreviations: COI, cytochrome c oxidase I
of one or more reference genes, should not be confused with efforts
Academic Editor: Charles Godfray, Imperial College
is of proven value in the microbial to resolve the “tree of life.” It should
community and in stimulating large- connect with and benefit from such Craig Moritz and Carla Cicero are at the Museum of
Vertebrate Zoology, University of California, Berkeley,
scale phylogenetic analyses, but projects, but resolving phylogeny at California, United States of America.
whether “one gene fits all” is open to scales from species to major eukaryotic
debate. clades requires a very different strategy *To whom correspondence should be addressed.
E-mail: cmoritz@socrates.berkeley.edu
Why, then, all the fuss? Initial for selecting genes. Indeed, the very
reactions to the DNA barcoding characteristic that makes the COI DOI: 10.1371/journal.pbio.0020354
PLoS Biology | www.plosbiology.org 1529 October 2004 | Volume 2 | Issue 10 | e279 | e354
described extant species within a given focusing on mtDNA (or any other analysis seeks to address both potential
genus. Accurate diagnosis depends on molecular) divergence as a primary applications of DNA barcoding.
low intraspecific variation compared criterion for recognizing species is Although Herbert et al. sampled a
with that between species, such that that it will lead us to overlook new or large number of species, a true test
a short DNA sequence will allow rapidly diverged species, such as might of the precision of mtDNA barcodes
precise allocation of an individual arise through divergent selection or to assign individuals to species would
to a described taxon. The extensive polyploidy, and thus to conclude that include comparisons with sister
literature on mtDNA phylogeography speciation requires long-term isolation. species—the most closely related extant
(Avise 2000) indicates that this For example, a recent mtDNA analysis relatives. This would require that all
condition often holds, although of North American birds (Johnson and members of a genus be examined,
there are exceptions. Furthermore, Cicero 2004) showed that numerous rather than a random sample of
within many species there is sufficient avian species have low divergences and imprecisely defined close relatives, and
structure that it will be possible to that speciation can occur relatively that taxa be included from more than
allocate an individual to a particular rapidly under certain circumstances. one geographic region. Johnson and
geographic population. Such We contend, therefore, that whereas Cicero (2004) showed the importance
identifications should be accompanied divergent or discordant mtDNA of comparing sister species when
by a statement of confidence—e.g., sequences might stimulate taxonomic examining genetic divergence values in
node support in a phylogenetic analysis reassessment based on nuclear genes North American birds, with results that
and caveats in relation to the breath as well as morphology, ecology, or contrast strongly with those of Hebert
of sampling in the reference database behavior, mtDNA divergence is et al. as well as previous studies (e.g.,
(e.g., whale forensics; Palumbi and neither necessary nor sufficient as a Klicka and Zink 1997). For 39 pairs of
Cipriano 1998). criterion for delineating species. This avian sister species, mtDNA sequence
DNA-led species discovery is more view accords with existing practice: divergences ranged from 0.0% to 8.2%,
contentious, but again is not new. taxonomic splits in North American with an average of 1.9% (cf. 7% to 8%
In animals, inclusion of mtDNA birds typically are based on multiple among closely related species in Hebert
evidence in biogeographic and lines of biological evidence, e.g., et al.). Of these, 29 pairs (74%) are at
systematic analyses often reveals morphological and vocal differences or below the 2.7% threshold proposed
unexpected diversity or discordance as well as genetic data (American by Herbert et al. and thus would
with morphology, which then prompts Ornithologists’ Union 1998). not be recognized as species despite
re-evaluation of morphological and We turn now to the core of Hebert biological differences. Moreover,
ecological characteristics and, if et al.’s paper—COI sequencing of a although only a few of these 39 pairs
warranted, taxonomic revision. But, substantial sample of North American (see Table 1 in Johnson and Cicero
despite recent proposals (Wiens birds (260 of 667 species) and its [2004]) had sufficient sampling to
and Penkrot 2002; Hebert et al. validity as a test of the barcoding assess intraspecific variation in mtDNA
2004), it does not follow that mtDNA concept. Their aim is to test “the sequences, these typically showed
divergence should be a primary correspondence between species paraphyly in mtDNA haplotypes.
criterion for recognizing species boundaries signaled by COI barcodes Therefore, there are still too few cases
boundaries (see also Sites and and those established by prior with adequate sampling of intraspecific
Marshall 2003). Potential limitations taxonomic research.” North American diversity for sister species pairs to know
of using mtDNA to infer species birds are an interesting choice because how common paraphyly is, although
boundaries include retention of their species-level taxonomy is relatively a recent meta-analysis found that
ancestral polymorphism, male-biased well resolved and there has been 17% of bird species deviated from
gene flow, selection on any mtDNA extensive previous analysis of levels of mtDNA monophyly (Funk and Omland
nucleotide (as the whole genome is mtDNA sequence divergence within 2003). Collectively, these observations
one linkage group), introgression and among described species (Klicka cast doubt on the precision of DNA
following hybridization, and paralogy and Zink 1997; Avise and Walker 1998; barcoding for allocating individuals to
resulting from transfer of mtDNA Johnson and Cicero 2004). Herbert previously described avian species.
gene copies to the nucleus. These are et al. (2004) found differences in COI Empidonax flycatchers, which are
acknowledged by Hebert et al. (2004) sequences “between closely related renowned for their morphological
and well documented in the literature species” that were 19–24 times greater similarity and could thereby benefit
(Bensasson et al. 2001; Ballard and in magnitude than the differences from DNA-based identification tools,
Whitlock 2004), including that on within species (7.05%–7.93% versus provide an example of the importance
birds (Degnan 1993; Quinn and 0.27%–0.43%, respectively). From of a more detailed analysis. A complete
White 1987; Lovette and Bermingham these data, they conclude that most molecular phylogeny for this group
2001; Weckstein et al. 2001). More North American bird species can be (Johnson and Cicero 2002) yielded
specifically, using some level of mtDNA discriminated via molecular diagnosis distances between four pairs of sister
divergence as a yardstick for species of individuals and propose a “standard species that ranged from 0.7% (E.
boundaries ignores the low precision sequence threshold” of ten times difficilis versus E. occidentalis) to 4.6%
with which coalescence of mtDNA the mean intraspecific variation (E. traillii versus E. alnorum); notably,
predicts phylogenetic divergence at (yielding a 2.7% threshold in birds) the genetic distance between mainland
nuclear genes (Hudson and Turelli to flag genetically divergent taxa as and island populations of E. difficilis
2003). An additional problem with “provisional species.” Thus, their (E. d. difficilis and E. d. insulicola, 0.9%)