A New Approach for HMM based Protein Sequence Modeling and its Application to Remote Homology Classification

Plötz T, Fink GA (2005)
In: Proc. Workshop Statistical Signal Processing. Bordeaux, France: IEEE.

Konferenzbeitrag | Veröffentlicht | Englisch
 
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Plötz, Thomas; Fink, Gernot A.
Abstract / Bemerkung
Currently probabilistic models of protein families, namely HMMs, are the methodology of choice for remote homology analysis. Unfortunately, the topology of such so-called Profile HMMs is rather complex which, despite sophisticated regularization techniques, is problematic for robust model estimation when only little training data is available. We propose a new HMM based protein family modeling method using building blocks which capture the essentials of particular targets only. They are estimated in a fully data-driven and unsupervised procedure. Contrary to current motif detection procedures we use a feature based protein sequence representation we developed earlier. Such small building blocks are automatically combined to global protein family HMMs which can be applied to remote homology analysis tasks. The results of an experimental evaluation on a challenging task of remote homology classification prove that robust models containing substantially smaller amounts of parameters can be estimated using the new modeling approach. The smaller the number of parameters to be trained, the smaller the number of training samples required which is of major importance for e.g.\ drug discovery tasks.
Erscheinungsjahr
2005
Titel des Konferenzbandes
Proc. Workshop Statistical Signal Processing
Page URI
https://pub.uni-bielefeld.de/record/2618342

Zitieren

Plötz T, Fink GA. A New Approach for HMM based Protein Sequence Modeling and its Application to Remote Homology Classification. In: Proc. Workshop Statistical Signal Processing. Bordeaux, France: IEEE; 2005.
Plötz, T., & Fink, G. A. (2005). A New Approach for HMM based Protein Sequence Modeling and its Application to Remote Homology Classification. Proc. Workshop Statistical Signal Processing
Plötz, Thomas, and Fink, Gernot A. 2005. “A New Approach for HMM based Protein Sequence Modeling and its Application to Remote Homology Classification”. In Proc. Workshop Statistical Signal Processing. Bordeaux, France: IEEE.
Plötz, T., and Fink, G. A. (2005). “A New Approach for HMM based Protein Sequence Modeling and its Application to Remote Homology Classification” in Proc. Workshop Statistical Signal Processing (Bordeaux, France: IEEE).
Plötz, T., & Fink, G.A., 2005. A New Approach for HMM based Protein Sequence Modeling and its Application to Remote Homology Classification. In Proc. Workshop Statistical Signal Processing. Bordeaux, France: IEEE.
T. Plötz and G.A. Fink, “A New Approach for HMM based Protein Sequence Modeling and its Application to Remote Homology Classification”, Proc. Workshop Statistical Signal Processing, Bordeaux, France: IEEE, 2005.
Plötz, T., Fink, G.A.: A New Approach for HMM based Protein Sequence Modeling and its Application to Remote Homology Classification. Proc. Workshop Statistical Signal Processing. IEEE, Bordeaux, France (2005).
Plötz, Thomas, and Fink, Gernot A. “A New Approach for HMM based Protein Sequence Modeling and its Application to Remote Homology Classification”. Proc. Workshop Statistical Signal Processing. Bordeaux, France: IEEE, 2005.
Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar