The Barcode Blog

A mostly scientific blog about short DNA sequences for species identification and discovery. I encourage your commentary. -- Mark Stoeckle

Subscribe to this blog

Sign up for email notifications

Minimalist DNA barcodes to help with museum specimens

Analyzing shorter barcode sequences is an inexpensive way to link museum specimens with degraded DNA to the barcode database. In Molecular Ecology Notes July 2006, Hajibabaei et al first demonstrate in silico that COI sequences as short as 109 base pairs contain enough information to assign most specimens to known species, using simulated “minibarcodes” taken from two full-length barcode datasets. The researchers then analyzed the recovery and performance of various lengths of “minibarcodes” amplified from 33 dried and 91 ethanol-preserved insect specimens ranging in age from 1 to 21 years. As shown in the below, although full-length barcodes were recovered from only 24-39% of specimens, there was encouragingly high success amplifying shorter segments. 

 

As expected from the in silico analysis, in most cases species could be distinguished as well as with full-length barcodes, ie sequences formed distinct non-overlapping clusters in a NJ tree. Hajibabaei et al’s results indicate that analyzing shorter minibarcode sequences can link museum specimens with degraded DNA to the gold standard full-length barcode database. Rather than spend time and money optimizing primers and amplification conditions on individual specimens, instead apply a general method that recovers a 100-400 bp fragment. They point out this approach will be useful “when barcoding reveals several cryptic species within what had been viewed as one species, and it is not morphologically evident which of them matches the holotype” and as “a cost-effective way of building barcode libraries with broad geographical coverage”. They caution that “very short barcode sequences are..valuable for the identification of old specimens from SELECTED NARROW taxonomic arrays” (emphasis added)

I agree a mini-barcode approach can be useful in certain situations, and emphasize their caution that it is not a substitute for a standardized full-length barcode database. First, if widely used, a minimalist approach could easily devolve into a Tower of Babel, with a hodgepodge of non-overlapping minibarcodes that cannot be compared to each other. Second, even if the minibarcodes were standardized so they all overlapped, a simple calculation implies that they would lump together most species with less than 1% sequence difference (in birds, this is about 15% of species). Less than 1% sequence difference means less than 6.5 diagnostic differences with a full-length barcode, and assuming randomly distributed substitutions, a shorter barcode could easily fail to capture any diagnostic differences. 

This entry was posted on Monday, August 7th, 2006 at 9:30 am and is filed under General. You can follow any responses to this entry through the RSS 2.0 feed. Both comments and pings are currently closed.

Comments are closed.

Contact: mark.stoeckle@rockefeller.edu

About this site

This web site is an outgrowth of the Taxonomy, DNA, and Barcode of Life meeting held at Banbury Center, Cold Spring Harbor Laboratory, September 9-12, 2003. It is designed and managed by Mark Stoeckle, Perrin Meyer, and Jason Yung at the Program for the Human Environment (PHE) at The Rockefeller University.

About the Program for the Human Environment

The involvement of the Program for the Human Environment in DNA barcoding dates to Jesse Ausubel's attendance in February 2002 at a conference in Nova Scotia organized by the Canadian Center for Marine Biodiversity. At the conference, Paul Hebert presented for the first time his concept of large-scale DNA barcoding for species identification. Impressed by the potential for this technology to address difficult challenges in the Census of Marine Life, Jesse agreed with Paul on encouraging a conference to explore the contribution taxonomy and DNA could make to the Census as well as other large-scale terrestrial efforts. In his capacity as a Program Director of the Sloan Foundation, Jesse turned to the Banbury Conference Center of Cold Spring Harbor Laboratory, whose leader Jan Witkowski prepared a strong proposal to explore both the scientific reliability of barcoding and the processes that might bring it to broad application. Concurrently, PHE researcher Mark Stoeckle began to work with the Hebert lab on analytic studies of barcoding in birds. Our involvement in barcoding now takes 3 forms: assisting the organizational development of the Consortium for the Barcode of Life and the Barcode of Life Initiative; contributing to the scientific development of the field, especially by studies in birds, and contributing to public understanding of the science and technology of barcoding and its applications through improved visualization techniques and preparation of brochures and other broadly accessible means, including this website. While the Sloan Foundation continues to support CBOL through a grant to the Smithsonian Institution, it does not provide financial support for barcoding research itself or support to the PHE for its research in this field.