Finding novel genes in bacterial communities isolated from the environment

Krause L, Diaz NN, Bartels D, Edwards RA, Pühler A, Rohwer F, Meyer F, Stoye J (2006)
BIOINFORMATICS 22(14): e281-e289.

Es wurde kein Volltext hochgeladen. Nur Publikationsnachweis!
Zeitschriftenaufsatz | Veröffentlicht | Englisch
; ; ; ; ; ; ;
Abstract / Bemerkung
Motivation: Novel sequencing techniques can give access to organisms that are difficult to cultivate using conventional methods. When applied to environmental samples, the data generated has some drawbacks, e. g. short length of assembled contigs, in-frame stop codons and frame shifts. Unfortunately, current gene finders cannot circumvent these difficulties. At the same time, the automated prediction of genes is a prerequisite for the increasing amount of genomic sequences to ensure progress in metagenomics. Results: We introduce a novel gene finding algorithm that incorporates features overcoming the short length of the assembled contigs from environmental data, in-frame stop codons as well as frame shifts contained in bacterial sequences. The results show that by searching for sequence similarities in an environmental sample our algorithm is capable of detecting a high fraction of its gene content, depending on the species composition and the overall size of the sample. The method is valuable for hunting novel unknown genes that may be specific for the habitat where the sample is taken. Finally, we show that our algorithm can even exploit the limited information contained in the short reads generated by 454 technology for the prediction of protein coding genes.


Krause L, Diaz NN, Bartels D, et al. Finding novel genes in bacterial communities isolated from the environment. BIOINFORMATICS. 2006;22(14):e281-e289.
Krause, L., Diaz, N. N., Bartels, D., Edwards, R. A., Pühler, A., Rohwer, F., Meyer, F., et al. (2006). Finding novel genes in bacterial communities isolated from the environment. BIOINFORMATICS, 22(14), e281-e289. doi:10.1093/bioinformatics/btl247
Krause, L., Diaz, N. N., Bartels, D., Edwards, R. A., Pühler, A., Rohwer, F., Meyer, F., and Stoye, J. (2006). Finding novel genes in bacterial communities isolated from the environment. BIOINFORMATICS 22, e281-e289.
Krause, L., et al., 2006. Finding novel genes in bacterial communities isolated from the environment. BIOINFORMATICS, 22(14), p e281-e289.
L. Krause, et al., “Finding novel genes in bacterial communities isolated from the environment”, BIOINFORMATICS, vol. 22, 2006, pp. e281-e289.
Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Pühler, A., Rohwer, F., Meyer, F., Stoye, J.: Finding novel genes in bacterial communities isolated from the environment. BIOINFORMATICS. 22, e281-e289 (2006).
Krause, Lutz, Diaz, Naryttza N., Bartels, Daniela, Edwards, Robert A., Pühler, Alfred, Rohwer, Forest, Meyer, Folker, and Stoye, Jens. “Finding novel genes in bacterial communities isolated from the environment”. BIOINFORMATICS 22.14 (2006): e281-e289.

28 Zitationen in Europe PMC

Daten bereitgestellt von Europe PubMed Central.

Discovering viral genomes in human metagenomic data by predicting unknown protein families.
Barrientos-Somarribas M, Messina DN, Pou C, Lysholm F, Bjerkner A, Allander T, Andersson B, Sonnhammer ELL., Sci Rep 8(1), 2018
PMID: 29311716
Successful heterologous expression of a novel chitinase identified by sequence analyses of the metagenome from a chitin-enriched soil sample.
Stöveken J, Singh R, Kolkenbrock S, Zakrzewski M, Wibberg D, Eikmeyer FG, Pühler A, Schlüter A, Moerschbacher BM., J Biotechnol 201(), 2015
PMID: 25240439
Gene prediction in metagenomic fragments based on the SVM algorithm.
Liu Y, Guo J, Hu G, Zhu H., BMC Bioinformatics 14 Suppl 5(), 2013
PMID: 23735199
Ab initio gene identification in metagenomic sequences.
Zhu W, Lomsadze A, Borodovsky M., Nucleic Acids Res 38(12), 2010
PMID: 20403810
FragGeneScan: predicting genes in short and error-prone reads.
Rho M, Tang H, Ye Y., Nucleic Acids Res 38(20), 2010
PMID: 20805240
Metagenomics approaches in systems microbiology.
Vieites JM, Guazzaroni ME, Beloqui A, Golyshin PN, Ferrer M., FEMS Microbiol Rev 33(1), 2009
PMID: 19054115
Laboratory procedures to generate viral metagenomes.
Thurber RV, Haynes M, Breitbart M, Wegley L, Rohwer F., Nat Protoc 4(4), 2009
PMID: 19300441
Metagenomics: Facts and Artifacts, and Computational Challenges*
Wooley JC, Ye Y., J Comput Sci Technol 25(1), 2009
PMID: 20648230
The microbial ocean from genomes to biomes.
DeLong EF., Nature 459(7244), 2009
PMID: 19444206
Orphelia: predicting genes in metagenomic sequencing reads.
Hoff KJ, Lingner T, Meinicke P, Tech M., Nucleic Acids Res 37(web server issue), 2009
PMID: 19429689
Bioinformatics challenges of new sequencing technology.
Pop M, Salzberg SL., Trends Genet 24(3), 2008
PMID: 18262676
Metagenomics in animal gastrointestinal ecosystem: Potential biotechnological prospects.
Singh B, Gautam SK, Verma V, Kumar M, Singh B., Anaerobe 14(3), 2008
PMID: 18457965
Gene prediction in metagenomic fragments: a large scale machine learning approach.
Hoff KJ, Tech M, Lingner T, Daniel R, Morgenstern B, Meinicke P., BMC Bioinformatics 9(), 2008
PMID: 18442389
Genomic DNA sequence comparison between two inbred soybean cyst nematode biotypes facilitated by massively parallel 454 micro-bead sequencing.
Bekal S, Craig JP, Hudson ME, Niblack TL, Domier LL, Lambert KN., Mol Genet Genomics 279(5), 2008
PMID: 18324416
Identification of new genes in Sinorhizobium meliloti using the Genome Sequencer FLX system.
Mao C, Evans C, Jensen RV, Sobral BW., BMC Microbiol 8(), 2008
PMID: 18454850
The metagenome of a biogas-producing microbial community of a production-scale biogas plant fermenter analysed by the 454-pyrosequencing technology.
Schlüter A, Bekel T, Diaz NN, Dondrup M, Eichenlaub R, Gartemann KH, Krahn I, Krause L, Krömeke H, Kruse O, Mussgnug JH, Neuweger H, Niehaus K, Pühler A, Runte KJ, Szczepanowski R, Tauch A, Tilker A, Viehöver P, Goesmann A., J Biotechnol 136(1-2), 2008
PMID: 18597880
Insight into the plasmid metagenome of wastewater treatment plant bacteria showing reduced susceptibility to antimicrobial drugs analysed by the 454-pyrosequencing technology.
Szczepanowski R, Bekel T, Goesmann A, Krause L, Krömeke H, Kaiser O, Eichler W, Pühler A, Schlüter A., J Biotechnol 136(1-2), 2008
PMID: 18586057
Emerging high-throughput approaches to analyze bioremediation of sites contaminated with hazardous and/or recalcitrant wastes.
Stenuit B, Eyers L, Schuler L, Agathos SN, George I., Biotechnol Adv 26(6), 2008
PMID: 18725284
The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes.
Meyer F, Paarmann D, D'Souza M, Olson R, Glass EM, Kubal M, Paczian T, Rodriguez A, Stevens R, Wilke A, Wilkening J, Edwards RA., BMC Bioinformatics 9(), 2008
PMID: 18803844
The development and impact of 454 sequencing.
Rothberg JM, Leamon JH., Nat Biotechnol 26(10), 2008
PMID: 18846085
A bioinformatician's guide to metagenomics.
Kunin V, Copeland A, Lapidus A, Mavromatis K, Hugenholtz P., Microbiol Mol Biol Rev 72(4), 2008
PMID: 19052320
Updating the metagenomics toolbox.
Gabor E, Liebeton K, Niehaus F, Eck J, Lorenz P., Biotechnol J 2(2), 2007
PMID: 17294408
Miniaturizing chemistry and biology in microdroplets.
Kelly BT, Baret JC, Taly V, Griffiths AD., Chem Commun (Camb) (18), 2007
PMID: 17476389
Get the most out of your metagenome: computational analysis of environmental sequence data.
Raes J, Foerstner KU, Bork P., Curr Opin Microbiol 10(5), 2007
PMID: 17936679


Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Web of Science

Dieser Datensatz im Web of Science®


PMID: 16873483
PubMed | Europe PMC

Suchen in

Google Scholar