Phylogenetic classification of short environmental DNA fragments

Krause L, Diaz NN, Goesmann A, Kelley S, Nattkemper TW, Rohwer F, Edwards RA, Stoye J (2008)
Nucleic Acids Research 36(7): 2230-2239.

Download
OA
Journal Article | Original Article | Published | English
Author
; ; ; ; ; ; ;
Abstract
Metagenomics is providing striking insights into the ecology of microbial communities. The recently developed massively parallel 454 pyrosequencing technique gives the opportunity to rapidly obtain metagenomic sequences at a low cost and without cloning bias. However, the phylogenetic analysis of the short reads produced represents a significant computational challenge. The phylogenetic algorithm CARMA for predicting the source organisms of environmental 454 reads is described. The algorithm searches for conserved Pfam domain and protein families in the unassembled reads of a sample. These gene fragments (environmental gene tags, EGTs), are classified into a higher-order taxonomy based on the reconstruction of a phylogenetic tree of each matching Pfam family. The method exhibits high accuracy for a wide range of taxonomic groups, and EGTs as short as 27 amino acids can be phylogenetically classified up to the rank of genus. The algorithm was applied in a comparative study of three aquatic microbial samples obtained by 454 pyrosequencing. Profound differences in the taxonomic composition of these samples could be clearly revealed.
Publishing Year
ISSN
eISSN
PUB-ID

Cite this

Krause L, Diaz NN, Goesmann A, et al. Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research. 2008;36(7):2230-2239.
Krause, L., Diaz, N. N., Goesmann, A., Kelley, S., Nattkemper, T. W., Rohwer, F., Edwards, R. A., et al. (2008). Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research, 36(7), 2230-2239. doi:10.1093/nar/gkn038
Krause, L., Diaz, N. N., Goesmann, A., Kelley, S., Nattkemper, T. W., Rohwer, F., Edwards, R. A., and Stoye, J. (2008). Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research 36, 2230-2239.
Krause, L., et al., 2008. Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research, 36(7), p 2230-2239.
L. Krause, et al., “Phylogenetic classification of short environmental DNA fragments”, Nucleic Acids Research, vol. 36, 2008, pp. 2230-2239.
Krause, L., Diaz, N.N., Goesmann, A., Kelley, S., Nattkemper, T.W., Rohwer, F., Edwards, R.A., Stoye, J.: Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Research. 36, 2230-2239 (2008).
Krause, Lutz, Diaz, Naryttza N., Goesmann, Alexander, Kelley, Scott, Nattkemper, Tim Wilhelm, Rohwer, Forest, Edwards, Robert A., and Stoye, Jens. “Phylogenetic classification of short environmental DNA fragments”. Nucleic Acids Research 36.7 (2008): 2230-2239.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Main File(s)
File Name
Access Level
OA Open Access

This data publication is cited in the following publications:
This publication cites the following data publications:

128 Citations in Europe PMC

Data provided by Europe PubMed Central.

Analysis of composition-based metagenomic classification.
Higashi S, Barreto Ada M, Cantao ME, de Vasconcelos AT., BMC Genomics 13 Suppl 5(), 2012
PMID: 23095761
Peptide markers of aminoacyl tRNA synthetases facilitate taxa counting in metagenomic data.
Persi E, Weingart U, Freilich S, Horn D., BMC Genomics 13(), 2012
PMID: 22325056
The impact of normalization and phylogenetic information on estimating the distance for metagenomes.
Su CH, Wang TY, Hsu MT, Weng FC, Kao CY, Wang D, Tsai HK., IEEE/ACM Trans Comput Biol Bioinform 9(2), 2012
PMID: 21844636

33 References

Data provided by Europe PubMed Central.

Phylip: phylogeny inference package (version 3.2)
Felsenstein J., 1989
Phylogenetic analysis of general bacterial porins: a phylogenomic case study.
Nguyen TX, Alegre ER, Kelley ST., J. Mol. Microbiol. Biotechnol. 11(6), 2006
PMID: 17114893
Estimating phylogenies from lacunose distance matrices: additive is superior to ultrametric estimation
Landry P-A, Lapointe F-J, Kirsch JAW., 1996

Shannon CE, Weaver W., 1963
The ribosomal database project (RDP-II): introducing myRDP space and quality controlled public data.
Cole JR, Chai B, Farris RJ, Wang Q, Kulam-Syed-Mohideen AS, McGarrell DM, Bandela AM, Cardenas E, Garrity GM, Tiedje JM., Nucleic Acids Res. 35(Database issue), 2007
PMID: 17090583
Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy.
Wang Q, Garrity GM, Tiedje JM, Cole JR., Appl. Environ. Microbiol. 73(16), 2007
PMID: 17586664
Stromatolite reef from the Early Archaean era of Australia.
Allwood AC, Walter MR, Kamber BS, Marshall CP, Burch IW., Nature 441(7094), 2006
PMID: 16760969
Composition and structure of microbial communities from stromatolites of Hamelin Pool in Shark Bay, Western Australia.
Papineau D, Walker JJ, Mojzsis SJ, Pace NR., Appl. Environ. Microbiol. 71(8), 2005
PMID: 16085880

Export

0 Marked Publications

Open Data PUB

Web of Science

View record in Web of Science®

Sources

PMID: 18285365
PubMed | Europe PMC

Search this title in

Google Scholar