Phylogenetic classification of short environmental DNA fragments

Krause L, Diaz NN, Goesmann A, Kelley S, Nattkemper TW, Rohwer F, Edwards RA, Stoye J (2008)
Nucleic Acids Research 36(7): 2230-2239.

Download
OA
Journal Article | Original Article | Published | English
Author
; ; ; ; ; ; ;
Abstract
Metagenomics is providing striking insights into the ecology of microbial communities. The recently developed massively parallel 454 pyrosequencing technique gives the opportunity to rapidly obtain metagenomic sequences at a low cost and without cloning bias. However, the phylogenetic analysis of the short reads produced represents a significant computational challenge. The phylogenetic algorithm CARMA for predicting the source organisms of environmental 454 reads is described. The algorithm searches for conserved Pfam domain and protein families in the unassembled reads of a sample. These gene fragments (environmental gene tags, EGTs), are classified into a higher-order taxonomy based on the reconstruction of a phylogenetic tree of each matching Pfam family. The method exhibits high accuracy for a wide range of taxonomic groups, and EGTs as short as 27 amino acids can be phylogenetically classified up to the rank of genus. The algorithm was applied in a comparative study of three aquatic microbial samples obtained by 454 pyrosequencing. Profound differences in the taxonomic composition of these samples could be clearly revealed.
Publishing Year
ISSN
eISSN
PUB-ID

Cite this

Krause L, Diaz NN, Goesmann A, et al. Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research. 2008;36(7):2230-2239.
Krause, L., Diaz, N. N., Goesmann, A., Kelley, S., Nattkemper, T. W., Rohwer, F., Edwards, R. A., et al. (2008). Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research, 36(7), 2230-2239. doi:10.1093/nar/gkn038
Krause, L., Diaz, N. N., Goesmann, A., Kelley, S., Nattkemper, T. W., Rohwer, F., Edwards, R. A., and Stoye, J. (2008). Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research 36, 2230-2239.
Krause, L., et al., 2008. Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research, 36(7), p 2230-2239.
L. Krause, et al., “Phylogenetic classification of short environmental DNA fragments”, Nucleic Acids Research, vol. 36, 2008, pp. 2230-2239.
Krause, L., Diaz, N.N., Goesmann, A., Kelley, S., Nattkemper, T.W., Rohwer, F., Edwards, R.A., Stoye, J.: Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research. 36, 2230-2239 (2008).
Krause, Lutz, Diaz, Naryttza N., Goesmann, Alexander, Kelley, Scott, Nattkemper, Tim Wilhelm, Rohwer, Forest, Edwards, Robert A., and Stoye, Jens. “Phylogenetic classification of short environmental DNA fragments”. Nucleic Acids Research 36.7 (2008): 2230-2239.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Main File(s)
File Name
Access Level
OA Open Access

This data publication is cited in the following publications:
This publication cites the following data publications:

131 Citations in Europe PMC

Data provided by Europe PubMed Central.

Genetic diversity and composition of a plasmid metagenome from a wastewater treatment plant.
Schlüter A, Krause L, Szczepanowski R, Goesmann A, Pühler A., J Biotechnol 136(1-2), 2008
PMID: 18603322
The metagenome of a biogas-producing microbial community of a production-scale biogas plant fermenter analysed by the 454-pyrosequencing technology.
Schlüter A, Bekel T, Diaz NN, Dondrup M, Eichenlaub R, Gartemann KH, Krahn I, Krause L, Krömeke H, Kruse O, Mussgnug JH, Neuweger H, Niehaus K, Pühler A, Runte KJ, Szczepanowski R, Tauch A, Tilker A, Viehöver P, Goesmann A., J Biotechnol 136(1-2), 2008
PMID: 18597880
Taxonomic composition and gene content of a methane-producing microbial community isolated from a biogas reactor.
Krause L, Diaz NN, Edwards RA, Gartemann KH, Krömeke H, Neuweger H, Pühler A, Runte KJ, Schlüter A, Stoye J, Szczepanowski R, Tauch A, Goesmann A., J Biotechnol 136(1-2), 2008
PMID: 18611419
A comparison of random sequence reads versus 16S rDNA sequences for estimating the biodiversity of a metagenomic library.
Manichanh C, Chapple CE, Frangeul L, Gloux K, Guigo R, Dore J., Nucleic Acids Res 36(16), 2008
PMID: 18682527
Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers.
Liu Z, DeSantis TZ, Andersen GL, Knight R., Nucleic Acids Res 36(18), 2008
PMID: 18723574
MetaSim: a sequencing simulator for genomics and metagenomics.
Richter DC, Ott F, Auch AF, Schmid R, Huson DH., PLoS One 3(10), 2008
PMID: 18841204

33 References

Data provided by Europe PubMed Central.

Phylip: phylogeny inference package (version 3.2)
Felsenstein J., 1989
Phylogenetic analysis of general bacterial porins: a phylogenomic case study.
Nguyen TX, Alegre ER, Kelley ST., J. Mol. Microbiol. Biotechnol. 11(6), 2006
PMID: 17114893
Estimating phylogenies from lacunose distance matrices: additive is superior to ultrametric estimation
Landry P-A, Lapointe F-J, Kirsch JAW., 1996

Shannon CE, Weaver W., 1963
The ribosomal database project (RDP-II): introducing myRDP space and quality controlled public data.
Cole JR, Chai B, Farris RJ, Wang Q, Kulam-Syed-Mohideen AS, McGarrell DM, Bandela AM, Cardenas E, Garrity GM, Tiedje JM., Nucleic Acids Res. 35(Database issue), 2007
PMID: 17090583
Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy.
Wang Q, Garrity GM, Tiedje JM, Cole JR., Appl. Environ. Microbiol. 73(16), 2007
PMID: 17586664
Stromatolite reef from the Early Archaean era of Australia.
Allwood AC, Walter MR, Kamber BS, Marshall CP, Burch IW., Nature 441(7094), 2006
PMID: 16760969
Composition and structure of microbial communities from stromatolites of Hamelin Pool in Shark Bay, Western Australia.
Papineau D, Walker JJ, Mojzsis SJ, Pace NR., Appl. Environ. Microbiol. 71(8), 2005
PMID: 16085880

Export

0 Marked Publications

Open Data PUB

Web of Science

View record in Web of Science®

Sources

PMID: 18285365
PubMed | Europe PMC

Search this title in

Google Scholar