The Barcode Blog

A mostly scientific blog about short DNA sequences for species identification and discovery. I encourage your commentary. -- Mark Stoeckle

Subscribe to this blog

Sign up for email notifications

Archive for October, 2008

Genetics is essential framework for microbiology, eukaryotes next?

Friday, October 24th, 2008

Robert Koch (1842-1910), father of medical microbiology, isolated agents of mankind’s major plagues: Vibrio cholera, Bacillus anthracis (anthrax and bubonic plague), and Mycobacterium tuberculosis. He laid down four conditions, “Koch’s postulates“, for establishing that an organism is the agent of disease, and subsequent generations of researchers applied these principles to determine the etiology of a multitude of infectious diseases. One legacy of Koch’s postulates was that isolation of organisms in pure culture became the backbone of diagnostic and research microbiology.

A century later, genetics has replaced culture as the essential framework for exploring microbial life. Metagenomic analysis of environmental samples, including from anatomic sites, has identified an unsuspected plethora of organisms, most of which are unculturable, at least under standard laboratory conditions. Even for organisms that can be grown in the laboratory, genetic detection is often the preferred diagnostic method, including for example detection of HIV, Neisseria gonorrhea, and Chlamydia sp.  Following Carl Woese’s early lead (PNAS 1977, 74:5088), microbiologists have generally included a standard locus, 16s rRNA, in genetic work, enabling phylogenetic trees spanning the diversity of life, and allowing each new isolate to be analyzed in conjunction with the work of others (as of 24 oct 2008, 75,257 16S rRNA sequences in GenBank).

Are genetic methods equally necessary for eukaryotes? In October 2008 Mol Ecol researchers from Cardiff University analyze mitochondrial COI differences among nine species of British lumbricid earthworms which were first described between 1758 and 1843, over 150 years ago. Partial COI sequences (a 582 bp segment which overlaps 648 bp DNA barcode region) from 71 individuals showed 2-5 deeply divergent clusters (average 13-15% sequence difference) in 4 of the 8 multiply-sampled species, and small divergences within each cluster, “indicative of the presence of multiple previously undescribed species”.  COI sequences from 270 individuals of one species, Allolobophora chlorotica, collected at 24 British and 5 mainland European sites showed 5 divergent clusters and surprisingly no clear geographic distribution pattern; over half the sites had 2 or more lineages, and one site had 4 lineages. As expected the same clusters were found by comparing another mitochondrial gene, 16s rRNA. Two of the lineages were found only in green color morphs; prior work indicated this form has distinct ecological preferences compared to pink morph Allo. chlorotica and that F1 hybrids are sterile, suggesting species status. As an aside, if earthworm specialists find morphological and ecological differences and mating incompatibility, why not designate as distinct species? As another example, two forms of European corn borer Ostrinia nubilalis are sympatric, genetically distinct, develop on different host plants, have different mating pheromones, and exhibit >95% reproductive isolation, yet are described as “host races” rather than separate species (Science 2005, 308:258). It sometimes seems there is an arbitrary aspect of how species status is awarded, or perhaps the process is slow.  

To see if mtDNA clusters were also reflected in nuclear genome, King et al performed AFLP (amplified fragment length polymorphism) mapping on 4-12 individuals from each of the 5 lineages. The nuclear results corresponded exactly to COI clusters except that the 2 green morph forms could not be distinguished, suggesting these are either a single interbreeding species (despite 14% mtCOI sequence difference!) or are young species which have not yet accumulated differences in nuclear DNA. It is hard to see how a 14% sequence difference could accumulate in mtDNA without accompanying nuclear changes, so I wonder if one of the genetic forms might reflect a relatively recent introgression from another earthworm species which has not yet been sequenced. It will be interesting to see whether the two green morph lineages, which were often found together at the same site, show assortative mating or restricted fertility. The authors conclude “extraordinary species-level genetic diversity was revealed among the British earthworms”….”four of nine ecologically generalist earthworms are probably complexes of multiple cryptic species”. And finally “further earthworm research in areas such as ecology and ecotoxicology, should be conducted in the knowledge that there are multiple cryptic species within many earthworm species”.

I conclude that genetics is equally essential for eukaryotic taxonomy as for microbiology. I believe there is no getting around the need to genetically reexamine most or all of the species named in the past 200 years to see if what we recognize as single and distinct species are really so. If there can be cryptic species in large visible animals such as birds, and males and females can be given different species names in fish, then there must be many more such oversights among the less easily observed. A standardized approach (ie DNA barcoding) is the most expeditious way forward and will leave a permanent marked trail that can easily be followed by non-experts who wish to identify their specimens. As in bacteria, standardizing on a single locus (ie barcode region COI for animals) enables new work to be seamlessly combined with old, leveraging its value (497,851 barcode records from 48,459 species in BOLD so far). Regarding higher-level evolutionary relationships, I find routine dismissal based on mathematical modeling of mtDNA single-locus trees, but not much effort to see what the potential is. Perhaps translated amino acid sequences and/or GC content can be informative for deeper branches, and nucleotide sequences for family- and generic-level relationships. At the very least, mtDNA trees serve to generate hypotheses, which can be  corroborated or disproved by more extensive genetic, morphologic, ecologic, behavioral, or fossil record data.

Building DNA libraries: one-quarter world birds so far

Sunday, October 12th, 2008

As of October 11, 2008 researchers have deposited 14,594 DNA barcodes in BOLD representing 2,586 avian species, 26% of world’s 9,933 birds. You can browse taxonomic coverage to date at All Birds Barcoding Initiative (ABBI) and BOLD taxonomy browser sites. Coverage includes representatives of all 27 orders and 159 families of world birds and nearly half of avian genera [1,014/2,101 (48%)]. To uncover possible hidden diversity, most researchers are sampling species across their geographic ranges rather than focusing on named subspecies, many or most of which appear to represent clinal variation (see for example Zink 2004Phillimore and Owens 2006).

How far along are researchers toward mapping COI barcode resolution of avian species? Birds are of particular interest because species limits are generally well-defined, supported by a wealth of morphologic, ecological, behavioral, and other genetic data. Looked at regionally, there is good coverage in northern North America, parts of Central and South America, western Europe, Korea, and New Zealand, so it should be possible to see how well COI barcodes distinguish among local species in these areas. Published studies so far show >95% resolution of named species and have identified genetically divergent clusters which may represent unrecognized cryptic or “hidden” species (Vilaca et al 2006, Yoo et al 2006, Nyari 2007Kerr et al 2007). As an aside, “cryptic” is an awkward term for genetically divergent populations of birds since most of these have diagnostic differences in morphology or behavior; “hidden” is more accurate to my ear. On a separate note, COI surveys have regularly revealed misidentified voucher specimens of birds, suggesting routine application of DNA barcode analysis could enhance quality of avian collections. 

Looked at globally, there is 100% coverage of 104 polytypic genera (having 2 or more species) representing 324 birds, so this should include the sister species and/or “nearest neighbors” for these, plus there are 853 monotypic genera (having only 1 species) in world birds, which are likely or known to be genetically divergent from birds in other genera. In addition, there are likely many other sister species or “nearest neighbors” within the remaining 1,982 birds with DNA barcodes so far (for example, BOLD includes 28 of 29 Dendroica sp wood warblers). It would be interesting to look at the nearest neighbor differences within the global data set. To my knowledge, comparisons among regions with COI barcode data have been not been published. My impression based on other avian genetic work is that named taxa in different biogeographic regions are genetically distinct, plus there are many unrecognized genetic divisions within species that range across biogeographic regions. I look forward to trans-regional and global comparisons!

New places to find DNA

Monday, October 6th, 2008

In 29 july 2008 Fish Biology scientists from Macquarie University, Sydney describe successful recovery of mitochondrial DNA from contemporary and historical shark teeth and jaws. After developing the method on 11 recently collected teeth from Gray nurse shark Carcharias taurus and Ornate wobbegong (excellent name!) Orectolobus halei, Ahonen and Stowe applied it to 20-40 year old museum specimens, including 5 jaws from 3 species and 19 individual teeth from 2 species. They collected approximately 0.02-0.06 g of “tooth powder” by drilling several small holes into a tooth or jaw; DNA was extracted using a standard silica-based method or Qiagen DNAeasy tissue kit.

The authors are interested in historical population sizes for sharks; following the theory that genetic variation within species is an indicator of population size, they picked the hypervariable control region as their target. As an aside, results so far with mitochondrial surveys including DNA barcoding generally show very low variation within most animal species and no relationship between intraspecific variation and census population size. In any case, a 700 bp fragment of mtDNA control region was amplified with a single pair of primers. The two extraction methods gave similar results.  DNA was amplified and sequenced from 100% of the contemporary samples and 15/34 (44%) historical samples. 700 bp is a relatively long sequence to amplify from historical samples, suggesting it may be possible to obtain standard COI barcodes (648 bp) from museum skeletons of sharks and bony fish, which would be particularly useful for those species which are rare or otherwise difficult to collect. A standard set of fish primers (see for example Hubert et al June 2008 PLoS ONE) amplifies COI barcode region from most fish (more than 5,000 species so far, including including representatives of all major divisions of Chondrichthyes (cartilaginous fish) and Osteichthyes (bony fish), both marine and freshwater).

To date most fish specimens are preserved in formaldehyde, which makes routine DNA recovery difficult or impossible. If DNA can be recovered from skeletons, there are many museum specimens that might be used. For example, the American Museum of Natural History Icthyology Department collection includes over 35,000 fish skeletons as compared to about 2,500 tissue samples so far.


Contact: mark.stoeckle@rockefeller.edu

About this site

This web site is an outgrowth of the Taxonomy, DNA, and Barcode of Life meeting held at Banbury Center, Cold Spring Harbor Laboratory, September 9-12, 2003. It is designed and managed by Mark Stoeckle, Perrin Meyer, and Jason Yung at the Program for the Human Environment (PHE) at The Rockefeller University.

About the Program for the Human Environment

The involvement of the Program for the Human Environment in DNA barcoding dates to Jesse Ausubel's attendance in February 2002 at a conference in Nova Scotia organized by the Canadian Center for Marine Biodiversity. At the conference, Paul Hebert presented for the first time his concept of large-scale DNA barcoding for species identification. Impressed by the potential for this technology to address difficult challenges in the Census of Marine Life, Jesse agreed with Paul on encouraging a conference to explore the contribution taxonomy and DNA could make to the Census as well as other large-scale terrestrial efforts. In his capacity as a Program Director of the Sloan Foundation, Jesse turned to the Banbury Conference Center of Cold Spring Harbor Laboratory, whose leader Jan Witkowski prepared a strong proposal to explore both the scientific reliability of barcoding and the processes that might bring it to broad application. Concurrently, PHE researcher Mark Stoeckle began to work with the Hebert lab on analytic studies of barcoding in birds. Our involvement in barcoding now takes 3 forms: assisting the organizational development of the Consortium for the Barcode of Life and the Barcode of Life Initiative; contributing to the scientific development of the field, especially by studies in birds, and contributing to public understanding of the science and technology of barcoding and its applications through improved visualization techniques and preparation of brochures and other broadly accessible means, including this website. While the Sloan Foundation continues to support CBOL through a grant to the Smithsonian Institution, it does not provide financial support for barcoding research itself or support to the PHE for its research in this field.