Index-based algorithms for motif search and their integration in a system for differential genome analysis

Beckstette M (2007)
Bielefeld (Germany): Bielefeld University.

Bielefelder E-Dissertation | Englisch
 
Download
OA
Autor*in
Beckstette, Michael
Gutachter*in / Betreuer*in
Giegerich, Robert (Prof. Dr.)
Abstract / Bemerkung
In this thesis, we present new efficient index-based algorithms for searching with position specific scoring matrices (PSSMs for short), a well known motif model, in large sequence sets, and their integration into an interactive system capable for large-scale differential comparative genome analyses. The newly developed and implemented index-based algorithms for searching with PSSMs clearly outperform existing methods in terms of running time. We also demonstrate how index based PSSM searching in combination with a fragment chaining approach can be used for efficient protein family classification, and for speeding up computation intensive database searching with hidden Markov models. With the PoSSuM software distribution, we also provide implementations of the presented algorithms in form of a flexible command line tool. We further integrated our newly developed algorithm possumsearch as a database search method in our integrated high-throughput sequence analysis system GENLIGHT, which is also a contribution of this work. GENLIGHT offers an interactive, biologist compatible, and user friendly environment for a variety of large-scale sequence analysis tasks with a special focus on (differential) comparative genome analyses. It employs a set oriented operational model, that allows to reuse generated results, and to perform complete analysis workflows in an interactive way. The system integrates several widely used sequence analysis methods and databases in a common environment, and is capable to perform analyses on a complete genome or proteome scale by employing a distributed client server approach, even for non index-based analysis methods. We demonstrate the practical usability of GENLIGHT with different case studies in which the system was used and which lead to substantial new scientific findings.
Stichworte
Genomprojekt , Mustervergleich , Algorithmus , Annotation , Bioinformatik , Sequenzanalyse , Vergleichende Genomanalyse , Position specific scoring matrice (PSSM) , Pattern matching , Annotation , Classification , Comparative genomics
Jahr
2007
Page URI
https://pub.uni-bielefeld.de/record/2305575

Zitieren

Beckstette M. Index-based algorithms for motif search and their integration in a system for differential genome analysis. Bielefeld (Germany): Bielefeld University; 2007.
Beckstette, M. (2007). Index-based algorithms for motif search and their integration in a system for differential genome analysis. Bielefeld (Germany): Bielefeld University.
Beckstette, Michael. 2007. Index-based algorithms for motif search and their integration in a system for differential genome analysis. Bielefeld (Germany): Bielefeld University.
Beckstette, M. (2007). Index-based algorithms for motif search and their integration in a system for differential genome analysis. Bielefeld (Germany): Bielefeld University.
Beckstette, M., 2007. Index-based algorithms for motif search and their integration in a system for differential genome analysis, Bielefeld (Germany): Bielefeld University.
M. Beckstette, Index-based algorithms for motif search and their integration in a system for differential genome analysis, Bielefeld (Germany): Bielefeld University, 2007.
Beckstette, M.: Index-based algorithms for motif search and their integration in a system for differential genome analysis. Bielefeld University, Bielefeld (Germany) (2007).
Beckstette, Michael. Index-based algorithms for motif search and their integration in a system for differential genome analysis. Bielefeld (Germany): Bielefeld University, 2007.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]
Volltext(e)
Access Level
OA Open Access
Zuletzt Hochgeladen
2019-09-06T08:57:48Z
MD5 Prüfsumme
ea9da757e9dbe43844e1d4484242f6a7


Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar