The deep sea projects of the Census of Marine Life make news with their discoveries, as for example covered by AP’s Cain Burdeau in a 22 November 2009 story “Thousands of strange creatures found deep in ocean.†Meanwhile, the weekly Economist magazine features the Census on the cover of its “The World in 2010†issue and in an excellent article by Alun Anderson, “Introducing the transparent ocean.â€
News
“Son et lumiere” – Exciting Updates
A slightly revised version of Jesse’s May 2009 Dalhousie University Commencement Address, Son et lumiere, is published on 23 November 2009 by the monthly science magazine, SEED, as Broadening the Scope of Global Change to Include Illumination and Noise.
The essay’s recommendation for consideration of an International Quiet Ocean Experiment earns coverage by Mike Seccombe in the Martha’s Vineyard Gazette in a 27 November 2009 article, Quiet please, across the seas: Pause ships to hear the fish.
EPRI and the Lamellibrancid Worm
What does electric power have to do with sea worms? Learn in Jesse’s talk “EPRI and the Lamellibrancid Worm” which spans zero emission power plants and deep carbon.
The (New) Web of Life by Alan Burdick
On Earth, the quarterly magazine of the NRDC, published a detailed article, The (New) Web of Life , by Alan Burdick about the history and progress of the Encyclopedia of Life.
Identifying ocean’s racehorses with DNA
Bluefin tuna are enormous (up to 15 ft/4.5 m, 680 kg/1500 lbs), high-speed (up to 54 km/h, as fast as racehorses) creatures that roam across oceans and return to ancestral waters to spawn. High demand has fueled intensive fishing by international fleets, resulting in 90% population declines heading towards extinction for all three species, Southern (Thunnus maccoyii), Northern (T. thynnus), and Pacific bluefin (T. orientalis). This week in PLoS ONE researchers from the American Museum of Natural History describe DNA-based identification of bluefin and other tuna species using character analysis of COI barcode sequences. Lowenstein and colleagues’ report provides a basis for routine identification of marketplace items to inform consumers and enable enforcement of regulations, including a proposed listing as endangered under Convention of International Trade in Endangered Species (CITES).
The eight species in genus Thunnus are not discriminated by regularly used nuclear loci and differ by about 1% or less in mitochondrial coding regions (e.g., Ward et al 2005 Phil Trans R Soc B), challenging DNA-based identification. To construct a diagnostic key, Lowenstein and colleagues analyzed 89 COI sequences in GenBank representing the eight tuna species and by visual inspection found 14 sites that provided 17 “compound characteristic attributes (CAs)” (terminology from Sarkar et al 2008 Mol Ecol Res). Turning marketplace detectives, the AMNH team collected 68 sushi samples from 31 establishments in New York and Denver over 6 month period in 2008. Nearly one-third (22; 32%) of samples were sold as species contradicted by the molecular data, including items from over half (19; 61%) of the restaurants.
Lowenstein found their character-based identifications were more accurate and precise than those provided by BOLD ID engine (www.barcodinglife.org), largely reflecting that the ID engine uses a 2% cutoff for assigning specimens to species, which encompasses all eight Thunnus sp. In addition, BOLD is a workbench for researchers and so contains many as yet unpublished sequences from ongoing studies; these need to be viewed as provisional data. Indeed, in constructing their key Lowenstein and colleagues set aside 2 of the 89 GenBank tuna sequences as these grouped with other species. These anomalous sequences might reflect hybridization or introgression which is reported to occur in 2-3% of Atlantic bluefin, for example (Viñas and Tudela 2009 PLoS ONE). In this study, researchers from Universitat de Girona, Spain and World Wildlife Fund describe a DNA-based method for distinguishing tuna species using mitochondrial control region and nuclear ITS. Here again the method is validated using published data, in this case 42 GenBank records representing the 8 species. As an aside, I find it remarkable there are so few records that might enable identification of such commercially-important and now endangered species. These two studies establish a scientific and possible legal standard for tuna identification. Now we begin.
Tracking disease vectors with DNA
What hosts sustain arthropod disease vectors when they are not biting humans? In September 2009 PLoS ONE, researchers from Doñana Research Station, Seville, Spain, report on a “universal DNA barcoding method to identify vertebrate hosts from arthropod bloodmeals.” The investigators collected “wildlife engorged mosquitoes, culicoids [biting midges] and sand flies (Phlebotomiae)…using CDC traps supplied with dry ice to attract ectoparasites through light and CO2.”
To design vertebrate-specific primers that would not amplify the more abundant arthropod DNA, Alcaide and colleagues “downloaded all vertebrate COI sequences (N = 18,2980 from the Classes Mammalia, Aves, Amphibia, and Reptilia that were available in the public domain managed by BOLD Systems database in January 2009” and compared these to “6,784 arthropod COI sequences from taxonomic groups that included blood-feeding species.” From this comparison they designed degenerate (multiple nucleotides at some positions) primers that were >99% matched to vertebrate target sequences and >99% mismatched to invertebrate targets. It would be helpful in this and other studies if the description of new primer(s) gave the position of the 3′ end of each primer as compared to mouse mitochondrial COI for instance. This would make it clear which portion of the COI barcode region is being amplified.
The first pass test with these primers gave PCR products in 43 of 100 mosquito bloodmeals, and reamplification with a slightly different primer set yielded sequenceable products in 97 of 100 cases; this re-amplification protocol was applied to the other vector species with “satisfactory” results. All except 5 matched at >99% level to vertebrate sequences from museum voucher specimens. For 3 of the uncertain identity sequences, they used the closest BOLD matches and knowledge of local fauna to “deduce that these species could be the Iberian hare Lepus granatensis, the red-legged partridge Alectoris rufa and the Egyptian mongoose Herpestes ichneumon.” The other two without close matches were from ticks collected while still feeding so the hosts were known. By my count they detected 18 mammalian and 26 avian host species in arthropod bloodmeals; to me this is remarkable variety given the relatively small number of bloodmeals tested. I look forward to learning more through DNA tracking of biting arthropods.
Tropical tree identification with DNA
Two groups of researchers explore tropical forest plots with DNA barcodes in October 2009 PLoS ONE and Proc Natl Acad Sci USA (both open access, the latter Twittered!). It is just three months ago a community standard for DNA barcoding land plants was announced, namely the plastid genes rbcL and matK, with species-level identification in 72% of cases tested and identification to “species groups” in the remainder. The two papers mentioned above represent the early roll-out so we can expect much more will be learned about DNA barcoding in plants in particular and about plant biology in general.
In PLoS ONE, researchers from France, French Guiana, and New York apply DNA barcoding to two 1-hectare plots in the “pristine lowland tropical rainforest” of central French Guiana, which represents one of the largest tracts of intact Amazonian rainforest. Working out of the Nouragues Research Station (“gateway to European rainforest”) Gonzales and colleagues collected leaf and cambium (living outer layer of wood) samples from all trees 10 cm or greater in diameter, with the assistance of professional tree climbers for large trees and use of climbing spikes for smaller specimens. The extreme efforts required to collect morphologically-identifiable specimens highlights the desirability of a DNA-based approach that could be applied nearer to ground level! A total of 1073 trees were sampled, which were sorted into 301 morphospecies; of these, 254 (85%) were “matched to a reference voucher with an acceptable species name…[encompassing] 143 genera and 54 angiosperm families, so that is a lot of tree diversity! For comparison there about 1000 native tree species in all of North America. PCR was carried out for multiple loci: in addition to above-mentioned standards rbcL and matK, these included plastid genes rpoC1, rpoB, and ycf5, non-coding trnL and psb-trnH, and nuclear ITS. The researchers also applied DNA barcoding to “juveniles” i.e. saplings in the same plots, of which just 27% could be identified to species, plus another 45% to morphotype, and 11% to genus (this leaves 17% not identified to genus). Not surprisingly given the diversity of species, sample types, markers, and uncertainties in the underlying taxonomy, the researchers’ results are complex. Regarding tissue types, they obtained amplifiable DNA from most or all leaf and cambium samples, with high success for some markers (e.g., rbcL sequencing rate 93%), supporting ground-level sampling strategies. Regarding markers, they had difficulty amplifying matK (68% success) and ITS (41%). Similar to prior observations, the overall rate for species-level identification using plastid markers plateaued at about 70%, thus two loci capture most of what is available from this genetic compartment.
In Proc Natl Acad Sci USA, researchers from Smithsonian Institution, Smithsonian Tropical Research Institute (STRI), Imperial College, and Harvard University apply DNA barcoding with rbcL, matK, and trnH-psbA to 1035 tree samples representing 296 species in STRI’s 1,000 x 500 m Forest Dynamics Plot on Barro Colorado Island, Panama. They had similar sequencing success to Guiana study (rbcL, 93%; trnH-psbA, 94%; matK, 69%). Overall success at species-level identification was 92% for rbcL + matK; 95% for rbcL + trnH-psbA, and 98% for all three markers, with the denominator in these comparisons apparently being #samples with available sequences. I am uncertain as to why species-level identification was higher in Panama as compared to Guiana study; the total number of samples and species is similar so presumably this reflects particular aspects of the species composition such as recent radiations in these locations. Kress and colleagues constructed a supermatrix with this data, generating a “robust community phylogeny for 281 of the 296 species in the plot.” They conclude “DNA barcodes stand poised to serve as an efficient and effective approach to building community phylogenies…[aiding] understanding niche conservation and the dynamics of species composition at landscape and global scales.” Sounds promising!
World species census updated
How many species are there? One widely cited estimate, now 24 years old, is 1.7 million named species (EO Wilson 1985 Science 230:1227). This estimate is updated in detailed form in September 2009 publication from Australian Government “Numbers of Living Species in Australia and the World, 2nd edition” by Arthur Chapman (illustrated report open access for perusing online or as pdf for download). According to Chapman’s analysis, there are 1.9 million published species in the world. Approximately 18,000 new species are described each year, 75% of which are invertebrates, 11% vascular plants, and 7% vertebrates. Chapman estimates the true number of world species is about 11 million. The largest uncertainties, for which it is estimated fewer than 10% of species have been named, are for fungi, single-celled eukaryotes (protocista, cyanophyta, chromista), and “prokaryotes”, i.e. eubacteria and archaea.
This overview brings to mind pictures of the distribution of matter and dark matter in the universe. On a large scale, is the “density” of species uniform? For example, given there about about 10,000 bird and about 40,000 fish species, do fish take up 4x as much diversity space? We know on a small scale there are some “high-density” closely-related groups of species, like cichlid fishes in Africa, but can we map the distribution of diversity on a larger scale? Large databases of homologous sequences representing diverse species (aka DNA barcodes; as of today, BOLD has over 700,000 records representing over 64,000 species) and new mathematical approaches to calculating diversity from nucleotide sequences (eg Sirovich 2009 PLoS ONE; I am co-author) may help provide a biological macroscope (Ausubel PNAS 2009) for understanding the genetic structure of biodiversity, complementary to the historical view expressed in the Tree of Life.
A Scalable Method for Analysis and Display of DNA Sequences
Together with colleagues at Mt. Sinai School of Medicine, we report a new mathematical approach to the genetic structure of biodiversity, using indicator vectors calculated from short DNA sequences. Sirovich L, Stoeckle MY, Zhang Y (2009) A Scalable Method for Analysis and Display of DNA Sequences. PLoS ONE 4(10): e7051. This method is scalable to the largest datasets envisioned in this field and provides a macroscopic view of “biodiversity space”. It offers a complement to tree-building techniques and could enable automated classification at various taxonomic levels.
From the Abstract:
The indicator vectors preserved DNA character information and provided quantitative measures of correlations among taxonomic groups. This method is scalable to the largest datasets envisioned in this field, provides a visually-intuitive display that captures relational affinities derived from sequence data across a diversity of life forms, and is potentially a useful complement to current tree-building techniques for studying evolutionary processes based on DNA data.
To download zip files containing MatLab code and datasets utilized in this paper, select the following links:
- PLoS_ThreeGroups_v1_2.zip (updated March 2010)
- PLoS_Bird_Analysis_v1_2.zip (updated March 2010)
Technology will save us
The 23 September 2009 issue of New Scientist magazine publishes an interview about population with Jesse.