Finding novel genes in bacterial communities isolated from the environment

Krause L, Diaz NN, Bartels D, Edwards RA, Pühler A, Rohwer F, Meyer F, Stoye J (2006)
BIOINFORMATICS 22(14): e281-e289.

Journal Article | Published | English

No fulltext has been uploaded

Author
; ; ; ; ; ; ;
Abstract
Motivation: Novel sequencing techniques can give access to organisms that are difficult to cultivate using conventional methods. When applied to environmental samples, the data generated has some drawbacks, e. g. short length of assembled contigs, in-frame stop codons and frame shifts. Unfortunately, current gene finders cannot circumvent these difficulties. At the same time, the automated prediction of genes is a prerequisite for the increasing amount of genomic sequences to ensure progress in metagenomics. Results: We introduce a novel gene finding algorithm that incorporates features overcoming the short length of the assembled contigs from environmental data, in-frame stop codons as well as frame shifts contained in bacterial sequences. The results show that by searching for sequence similarities in an environmental sample our algorithm is capable of detecting a high fraction of its gene content, depending on the species composition and the overall size of the sample. The method is valuable for hunting novel unknown genes that may be specific for the habitat where the sample is taken. Finally, we show that our algorithm can even exploit the limited information contained in the short reads generated by 454 technology for the prediction of protein coding genes.
Publishing Year
ISSN
eISSN
PUB-ID

Cite this

Krause L, Diaz NN, Bartels D, et al. Finding novel genes in bacterial communities isolated from the environment. BIOINFORMATICS. 2006;22(14):e281-e289.
Krause, L., Diaz, N. N., Bartels, D., Edwards, R. A., Pühler, A., Rohwer, F., Meyer, F., et al. (2006). Finding novel genes in bacterial communities isolated from the environment. BIOINFORMATICS, 22(14), e281-e289.
Krause, L., Diaz, N. N., Bartels, D., Edwards, R. A., Pühler, A., Rohwer, F., Meyer, F., and Stoye, J. (2006). Finding novel genes in bacterial communities isolated from the environment. BIOINFORMATICS 22, e281-e289.
Krause, L., et al., 2006. Finding novel genes in bacterial communities isolated from the environment. BIOINFORMATICS, 22(14), p e281-e289.
L. Krause, et al., “Finding novel genes in bacterial communities isolated from the environment”, BIOINFORMATICS, vol. 22, 2006, pp. e281-e289.
Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Pühler, A., Rohwer, F., Meyer, F., Stoye, J.: Finding novel genes in bacterial communities isolated from the environment. BIOINFORMATICS. 22, e281-e289 (2006).
Krause, Lutz, Diaz, Naryttza N., Bartels, Daniela, Edwards, Robert A., Pühler, Alfred, Rohwer, Forest, Meyer, Folker, and Stoye, Jens. “Finding novel genes in bacterial communities isolated from the environment”. BIOINFORMATICS 22.14 (2006): e281-e289.
This data publication is cited in the following publications:
This publication cites the following data publications:

24 Citations in Europe PMC

Data provided by Europe PubMed Central.

Successful heterologous expression of a novel chitinase identified by sequence analyses of the metagenome from a chitin-enriched soil sample.
Stoveken J, Singh R, Kolkenbrock S, Zakrzewski M, Wibberg D, Eikmeyer FG, Puhler A, Schluter A, Moerschbacher BM., J. Biotechnol. 201(), 2015
PMID: 25240439
Gene prediction in metagenomic fragments based on the SVM algorithm.
Liu Y, Guo J, Hu G, Zhu H., BMC Bioinformatics 14 Suppl 5(), 2013
PMID: 23735199
FragGeneScan: predicting genes in short and error-prone reads.
Rho M, Tang H, Ye Y., Nucleic Acids Res. 38(20), 2010
PMID: 20805240
Ab initio gene identification in metagenomic sequences.
Zhu W, Lomsadze A, Borodovsky M., Nucleic Acids Res. 38(12), 2010
PMID: 20403810
Metagenomics: Facts and Artifacts, and Computational Challenges*
Wooley JC, Ye Y., J Comput Sci Technol 25(1), 2009
PMID: 20648230
The microbial ocean from genomes to biomes.
DeLong EF., Nature 459(7244), 2009
PMID: 19444206
Orphelia: predicting genes in metagenomic sequencing reads.
Hoff KJ, Lingner T, Meinicke P, Tech M., Nucleic Acids Res. 37(Web Server issue), 2009
PMID: 19429689
Laboratory procedures to generate viral metagenomes.
Thurber RV, Haynes M, Breitbart M, Wegley L, Rohwer F., Nat Protoc 4(4), 2009
PMID: 19300441
A bioinformatician's guide to metagenomics.
Kunin V, Copeland A, Lapidus A, Mavromatis K, Hugenholtz P., Microbiol. Mol. Biol. Rev. 72(4), 2008
PMID: 19052320
The development and impact of 454 sequencing.
Rothberg JM, Leamon JH., Nat. Biotechnol. 26(10), 2008
PMID: 18846085
The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes.
Meyer F, Paarmann D, D'Souza M, Olson R, Glass EM, Kubal M, Paczian T, Rodriguez A, Stevens R, Wilke A, Wilkening J, Edwards RA., BMC Bioinformatics 9(), 2008
PMID: 18803844
The metagenome of a biogas-producing microbial community of a production-scale biogas plant fermenter analysed by the 454-pyrosequencing technology.
Schluter A, Bekel T, Diaz NN, Dondrup M, Eichenlaub R, Gartemann KH, Krahn I, Krause L, Kromeke H, Kruse O, Mussgnug JH, Neuweger H, Niehaus K, Puhler A, Runte KJ, Szczepanowski R, Tauch A, Tilker A, Viehover P, Goesmann A., J. Biotechnol. 136(1-2), 2008
PMID: 18597880
Insight into the plasmid metagenome of wastewater treatment plant bacteria showing reduced susceptibility to antimicrobial drugs analysed by the 454-pyrosequencing technology.
Szczepanowski R, Bekel T, Goesmann A, Krause L, Kromeke H, Kaiser O, Eichler W, Puhler A, Schluter A., J. Biotechnol. 136(1-2), 2008
PMID: 18586057
Metagenomics in animal gastrointestinal ecosystem: Potential biotechnological prospects.
Singh B, Gautam SK, Verma V, Kumar M, Singh B., Anaerobe 14(3), 2008
PMID: 18457965
Identification of new genes in Sinorhizobium meliloti using the Genome Sequencer FLX system.
Mao C, Evans C, Jensen RV, Sobral BW., BMC Microbiol. 8(), 2008
PMID: 18454850
Gene prediction in metagenomic fragments: a large scale machine learning approach.
Hoff KJ, Tech M, Lingner T, Daniel R, Morgenstern B, Meinicke P., BMC Bioinformatics 9(), 2008
PMID: 18442389
Genomic DNA sequence comparison between two inbred soybean cyst nematode biotypes facilitated by massively parallel 454 micro-bead sequencing.
Bekal S, Craig JP, Hudson ME, Niblack TL, Domier LL, Lambert KN., Mol. Genet. Genomics 279(5), 2008
PMID: 18324416
Bioinformatics challenges of new sequencing technology.
Pop M, Salzberg SL., Trends Genet. 24(3), 2008
PMID: 18262676
Get the most out of your metagenome: computational analysis of environmental sequence data.
Raes J, Foerstner KU, Bork P., Curr. Opin. Microbiol. 10(5), 2007
PMID: 17936679
Miniaturizing chemistry and biology in microdroplets.
Kelly BT, Baret JC, Taly V, Griffiths AD., Chem. Commun. (Camb.) (18), 2007
PMID: 17476389
Updating the metagenomics toolbox.
Gabor E, Liebeton K, Niehaus F, Eck J, Lorenz P., Biotechnol J 2(2), 2007
PMID: 17294408

Export

0 Marked Publications

Open Data PUB

Web of Science

View record in Web of Science®

Sources

PMID: 16873483
PubMed | Europe PMC

Search this title in

Google Scholar