The binning of metagenomic contigs for microbial physiology of mixed cultures.

Strous M, Kraft B, Bisdorf R, Tegetmeyer H (2012)
Frontiers in Microbiology 3.

Download
No fulltext has been uploaded. References only!
Journal Article | Original Article | Published | English

No fulltext has been uploaded

Abstract
So far, microbial physiology has dedicated itself mainly to pure cultures. In nature, cross feeding and competition are important aspects of microbial physiology and these can only be addressed by studying complete communities such as enrichment cultures. Metagenomic sequencing is a powerful tool to characterize such mixed cultures. In the analysis of metagenomic data, well established algorithms exist for the assembly of short reads into contigs and for the annotation of predicted genes. However, the binning of the assembled contigs or unassembled reads is still a major bottleneck and required to understand how the overall metabolism is partitioned over different community members. Binning consists of the clustering of contigs or reads that apparently originate from the same source population. In the present study eight metagenomic samples from the same habitat, a laboratory enrichment culture, were sequenced. Each sample contained 13-23 Mb of assembled contigs and up to eight abundant populations. Binning was attempted with existing methods but they were found to produce poor results, were slow, dependent on non-standard platforms or produced errors. A new binning procedure was developed based on multivariate statistics of tetranucleotide frequencies combined with the use of interpolated Markov models. Its performance was evaluated by comparison of the results between samples with BLAST and in comparison to existing algorithms for four publicly available metagenomes and one previously published artificial metagenome. The accuracy of the new approach was comparable or higher than existing methods. Further, it was up to a 100 times faster. It was implemented in Java Swing as a complete open source graphical binning application available for download and further development (http://sourceforge.net/projects/metawatt).
Publishing Year
ISBN
ISSN
eISSN
PUB-ID

Cite this

Strous M, Kraft B, Bisdorf R, Tegetmeyer H. The binning of metagenomic contigs for microbial physiology of mixed cultures. Frontiers in Microbiology. 2012;3.
Strous, M., Kraft, B., Bisdorf, R., & Tegetmeyer, H. (2012). The binning of metagenomic contigs for microbial physiology of mixed cultures. Frontiers in Microbiology, 3. doi:10.3389/fmicb.2012.00410
Strous, M., Kraft, B., Bisdorf, R., and Tegetmeyer, H. (2012). The binning of metagenomic contigs for microbial physiology of mixed cultures. Frontiers in Microbiology 3.
Strous, M., et al., 2012. The binning of metagenomic contigs for microbial physiology of mixed cultures. Frontiers in Microbiology, 3.
M. Strous, et al., “The binning of metagenomic contigs for microbial physiology of mixed cultures.”, Frontiers in Microbiology, vol. 3, 2012.
Strous, M., Kraft, B., Bisdorf, R., Tegetmeyer, H.: The binning of metagenomic contigs for microbial physiology of mixed cultures. Frontiers in Microbiology. 3, (2012).
Strous, Marc, Kraft, Beate, Bisdorf, Regina, and Tegetmeyer, Halina. “The binning of metagenomic contigs for microbial physiology of mixed cultures.”. Frontiers in Microbiology 3 (2012).
This data publication is cited in the following publications:
This publication cites the following data publications:

29 Citations in Europe PMC

Data provided by Europe PubMed Central.

Metagenome from a Spirulina digesting biogas reactor: analysis via binning of contigs and classification of short reads.
Nolla-Ardevol V, Peces M, Strous M, Tegetmeyer HE., BMC Microbiol. 15(), 2015
PMID: 26680455
Abundant toxin-related genes in the genomes of beneficial symbionts from deep-sea hydrothermal vent mussels.
Sayavedra L, Kleiner M, Ponnudurai R, Wetzel S, Pelletier E, Barbe V, Satoh N, Shoguchi E, Fink D, Breusing C, Reusch TB, Rosenstiel P, Schilhabel MB, Becher D, Schweder T, Markert S, Dubilier N, Petersen JM., Elife 4(), 2015
PMID: 26371554
Metabolic diversity and ecological niches of Achromatium populations revealed with single-cell genomic sequencing.
Mansor M, Hamilton TL, Fantle MS, Macalady JL., Front Microbiol 6(), 2015
PMID: 26322031
Selection Maintains Low Genomic GC Content in Marine SAR11 Lineages.
Luo H, Thompson LR, Stingl U, Hughes AL., Mol. Biol. Evol. 32(10), 2015
PMID: 26116859

10 References

Data provided by Europe PubMed Central.

Unsupervised statistical clustering of environmental shotgun sequences.
Kislyuk A, Bhatnagar S, Dushoff J, Weitz JS., BMC Bioinformatics 10(), 2009
PMID: 19799776
Phylogenetic classification of short environmental DNA fragments.
Krause L, Diaz NN, Goesmann A, Kelley S, Nattkemper TW, Rohwer F, Edwards RA, Stoye J., Nucleic Acids Res. 36(7), 2008
PMID: 18285365
TETRA: a web-service and a stand-alone program for the analysis and comparison of tetranucleotide usage patterns in DNA sequences.
Teeling H, Waldmann J, Lombardot T, Bauer M, Glockner FO., BMC Bioinformatics 5(), 2004
PMID: 15507136
BLAST+: architecture and applications.
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL., BMC Bioinformatics 10(), 2009
PMID: 20003500
Identifying bacterial genes and endosymbiont DNA with Glimmer.
Delcher AL, Bratke KA, Powers EC, Salzberg SL., Bioinformatics 23(6), 2007
PMID: 17237039
TACOA: taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach.
Diaz NN, Krause L, Goesmann A, Niehaus K, Nattkemper TW., BMC Bioinformatics 10(), 2009
PMID: 19210774
Integrative analysis of environmental sequences using MEGAN4.
Huson DH, Mitra S, Ruscheweyh HJ, Weber N, Schuster SC., Genome Res. 21(9), 2011
PMID: 21690186
Clustering metagenomic sequences with interpolated Markov models.
Kelley DR, Salzberg SL., BMC Bioinformatics 11(), 2010
PMID: 21044341

Export

0 Marked Publications

Open Data PUB

Web of Science

View record in Web of Science®

Sources

PMID: 23227024
PubMed | Europe PMC

Search this title in

Google Scholar
ISBN Search