The Barcode Blog

A mostly scientific blog about short DNA sequences for species identification and discovery. I encourage your commentary. -- Mark Stoeckle

Subscribe to this blog

Sign up for email notifications

Archive for March, 2009

DNA sorts out bewildering morphology

Tuesday, March 31st, 2009

DNA helps flag genetically divergent forms that may represent cryptic species and is equally valuable the other way around: in linking morphologically diverse forms that occur within species. In 20 jan 2009 Biol Lett, researchers from National Museum of Natural History, Washington, DC; Australian Museum, Sydney; Virginia Institute of Marine Science; University of Tokyo; and Natural History Museum, Tokyo, solve the mystery of “the most extreme example of ontogenetic morphoses and sexual dimorphism in vertebrates.”

Johnson and colleagues examined specimens of small (body size 4-408 mm) deep water (1000-4000m) fishes thought to represent 3 families in the order Stephanoberyciformes (whalefish and relatives). The authors analyzed morphology and whole mitochondrial genomes from 34 individuals of 16 species including representatives of all 5 whalefish families. They found three whalefish “families” are one: “Mirapinnidae (tapetails), Megalomycteridae (bignose fishes), and Cetomimidae (whalefishes), are larvae, males and females, respectively of a single family Cetomimidae.” These are strange-looking fish–the males, which do not feed as adults, are sustained by enormous livers, and the minute larvae have streamers up to 75 cm. For fun, see deep ocean video of live female whalefish swimming (and narration of the amazed icthyologists) in supplementary material. Next up is to link the three life stages of each species; here DNA will help along with meristic data (quantitative features such as number of fins or scales).

Why do mitochondria differ among species?

Thursday, March 19th, 2009

Mitochondria are the power plants of the cell, consuming oxygen and breakdown products of sugars, amino acids, and fatty acids to produce energy as ATP and heat.  As originally proposed by Lynn Margulis in 1967, mitochondria, which have their own circular genome and replicate independently of the cell, are derived from an ancient symbiosis of an an alpha-proteobacterium related to gram-negative bacteria. 

In multicellular animals, most of the 100+ proteins in mitochondria are encoded by nuclear genes. The mitochondrial genome is only about 16 kb (vs about 2000 kb for nearest bacterial relatives Rickettsia sp) and encodes just 13 proteins, all of which function in the electron transport chain, plus 2 ribosomal RNAs and approximately 20 tRNAs.  Multiple protein-protein interactions (for example, complex I comprises 34 nuclear-encoded and 7 mitochondrial-encoded proteins) suggest there must be close co-evolution between nuclear and mitochondrial genomes; this might be one of the constraints on mitochondrial variation. Although an enormous amount of information on mitochondrial sequence differences among and within species has been been compiled (through DNA barcoding initiative and other efforts), there is surprisingly little study so far on whether mitochondrial differences among species reflect functional adaptation (although see Ruiz-Pesini et al 2004 Science 303:223, Bayona-Balfaluy et al 2004 Mol Biol Evol 22:716).  

In 25 February 2009 Proc R Soc B researchers from University of Groningen, The Netherlands; Max Planck Institute for Ornithology, Germany; and Ohio State University investigate whether mitochondrial differences modulate energy metabolism in birds. As mitochondria consume 90% of respired oxygen, mitochondrial activity presumably determines basal metabolism. Tieleman and colleagues performed crosses among 3 captive bred populations of stonechats (Saxicola torquata spp.) that differ in basal metabolic rate, which presumably reflects adaptation to different climates: Africa (Kenya, Saxicola torquata axillaris), Asia (Kazakhstan, Saxicola torquata maura), and Europe (Austria, Saxicola torquata rubicola). As an aside, I note that these three taxa are elevated to species status in current world checklists (Clements 2007, IOC Checklist; even 1992 edition of Birds of Europe notes “Siberian race may be a full species.”) This does not change the interpretation of the findings, but it does reflect the confused nature of taxonomic science that even for a group as well studied as birds, publication standards accept this laxity in taxonomic classification. Naming of bacteria in medical studies is more uniformly up to date than for multicellular animals; it seems that animal taxonomists have not found a way to establish a regularly updated consensus. In this regard the IOC Checklist suggests a way forward: “in this global world of wiki-style sharing of knowledge, we invite world birders and ornithologsts alike to help us keep the IOC list accurate, vital, and accessible.” 

Back to the paper. Tieleman and colleagues “tested for a genetic effect on BMR based on mitochondrial-nuclear coadaptation using hybrids between ancestral populations with high and low BMR (Europe-Africa and Asia-Europe), with different parental configurations (female-high x male-low or female-low x male-high). Hybrids with different parental configurations have on average identical mixtures of nuclear DNA, but differ in mitochondrial DNA because it is inherited only from the mother.”  The researchers found that metabolic rate differed between hybrids with contrasting parental configurations, “providing evidence for the importance of a match between mitochondrial and nuclear genomes to regulate metabolic rate.” So far so good. However, contrary to expectations, in both sets of crosses, metabolic rates in hybrids were more similar to that of the father than the mother! (see adapted figure). This result is a puzzler; it suggests there might be another factor such as genomic imprinting at work. 

Looking at the bigger picture, for those interested in mitochondrial evolution, there is a lot of opportunity: a large and growing database of COI sequences (>500,000 individuals, >50,000 species so far) that is waiting to be analyzed for evidence of purifying or positive selection, for example, or for limits to plasticity in COI amino acid sequence. I wonder if there might be convergent evolution of COI, such that diverse organisms in very cold or very hot environments environments, for example, might exhibit similar amino acid substitutions.

DNA analysis helps unravel food webs

Sunday, March 8th, 2009

What do leaf beetles (Chrysomelidae) eat? In 11 Nov 2008 Proc R Soc B researchers from Spain, London, and Australia, apply DNA analysis to 76 species (1 individual/species) of Australian leaf beetles. Jurado-Rivera and colleagues extracted DNA from whole beetles using DNAeasy kit. To identify plant DNA in beetle extracts, they amplified chloroplast trnL intron (313 to 581 bp in analyzed samples). 70 (92%) of samples gave high quality reads after direct sequencing of the PCR products, consistent with ingestion of a single plant species; the remaining samples were sequenced from cloned PCR products; these gave 2 divergent clones in 3 of seven cases, for a total of 81 different trnL intron sequences. Why use trnL intron? The authors cite the large number of sequences in GenBank and favorable experience (ie successful amplification and good taxonomic resolution) in their prior work and that of others (eg Taberlet et al 2007 Nucleic Acids Res 35:e14). This certainly makes sense, but I hope a general agreement for plant barcode standards will be published shortly, otherwise the field will continue to be hobbled by having multiple incomplete and non-overlapping databases for the various markers. For example, according to the authors “only 14 and 15 of approximately 1000 and 800 described Australian species of Acacia and Eucalyptus, respectively, are represented in GenBank by trnL intron sequences. As to what resolution is possible with current trnL database, the authors found “reliable identification to plant family in every case and very frequently the inference is possible at lower taxonomic levels.” 

There also needs to be an agreement to have a curated plant barcode database. As the authors report, “in the course of this study, we found several examples of erroneous taxonomic assignments (e.g. Sapindaceae identified as Cypripedium, Cypripedioideae; Apocynaceae labelled as Sesamum, Pedaliaceae; one case of names switched between Pittosporum and Cheiranthera, both Pittosporaceae; suspicious generic assignment for Aesculus x carnea), and of sequencing artefacts (e.g. Tragopogon spp., Acacia usumatensis) and chimeras (e.g.Pentaphylax euryoides). Problems introduced by these sequences were only apparent after careful inspection of trees revealing suspicious relationships, and required phylogenetic re-evaluation after removing problematic sequence data.” 

This is helpful for the present study, but the problematic sequences remain in the reference databases, ready to trip up the next set of researchers who might not be so careful. To fix this problem, Jurado-Rivera and colleagues make what I think is the wrong suggestion, namely “all of the above would argue for the use of additional markers”. Adding markers may improve the ability to make species-level identifications in plants, but if the goal is to construct an error-free database, adding markers is an expensive and likely ineffective way to ferret out mislabeled or otherwise inaccurate sequences. What is needed is a stand-alone database, closely-linked to GenBank, in which problematic sequences can be weeded out or re-labeled (ie Barcode of Life Database (BOLD) www.barcodinglife.org).

To construct a beetle phylogeny, the authors amplified COI and EF1a from their specimens. They found strong concordance between the evolutionary histories of Australian Chrysomelinae beetles and their host plants, indicating long-term co-evolution. They conclude “our analysis not only shows the details of ecological associations for a dominant herbivore group but also offers the basis for their evolutionary interpretation.”

I am puzzled that the authors amplified a segment in the 3′ half of COI that does not overlap with the standard animal barcode region, making it impossible to combine their data with the 500,000+ COI sequences analyzed to date (www.barcodinglife.org). This important caveat aside, I look forward to many more studies that utilize DNA barcoding to join ecology and phylogenetics.


Contact: mark.stoeckle@rockefeller.edu

About this site

This web site is an outgrowth of the Taxonomy, DNA, and Barcode of Life meeting held at Banbury Center, Cold Spring Harbor Laboratory, September 9-12, 2003. It is designed and managed by Mark Stoeckle, Perrin Meyer, and Jason Yung at the Program for the Human Environment (PHE) at The Rockefeller University.

About the Program for the Human Environment

The involvement of the Program for the Human Environment in DNA barcoding dates to Jesse Ausubel's attendance in February 2002 at a conference in Nova Scotia organized by the Canadian Center for Marine Biodiversity. At the conference, Paul Hebert presented for the first time his concept of large-scale DNA barcoding for species identification. Impressed by the potential for this technology to address difficult challenges in the Census of Marine Life, Jesse agreed with Paul on encouraging a conference to explore the contribution taxonomy and DNA could make to the Census as well as other large-scale terrestrial efforts. In his capacity as a Program Director of the Sloan Foundation, Jesse turned to the Banbury Conference Center of Cold Spring Harbor Laboratory, whose leader Jan Witkowski prepared a strong proposal to explore both the scientific reliability of barcoding and the processes that might bring it to broad application. Concurrently, PHE researcher Mark Stoeckle began to work with the Hebert lab on analytic studies of barcoding in birds. Our involvement in barcoding now takes 3 forms: assisting the organizational development of the Consortium for the Barcode of Life and the Barcode of Life Initiative; contributing to the scientific development of the field, especially by studies in birds, and contributing to public understanding of the science and technology of barcoding and its applications through improved visualization techniques and preparation of brochures and other broadly accessible means, including this website. While the Sloan Foundation continues to support CBOL through a grant to the Smithsonian Institution, it does not provide financial support for barcoding research itself or support to the PHE for its research in this field.