Large scale hierarchical clustering of protein sequences

Krause A, Stoye J, Vingron M (2005)
BMC Bioinformatics 6(1): 15.

Zeitschriftenaufsatz | Veröffentlicht | Englisch
 
Download
OA
Autor/in
; ;
Abstract / Bemerkung
Background: Searching a biological sequence database with a query sequence looking for homologues has become a routine operation in computational biology. In spite of the high degree of sophistication of currently available search routines it is still virtually impossible to identify quickly and clearly a group of sequences that a given query sequence belongs to. Results: We report on our developments in grouping all known protein sequences hierarchically into superfamily and family clusters. Our graph-based algorithms take into account the topology of the sequence space induced by the data itself to construct a biologically meaningful partitioning. We have applied our clustering procedures to a non-redundant set of about 1,000,000 sequences resulting in a hierarchical clustering which is being made available for querying and browsing at http://systers.molgen.mpg.de/. Conclusions: Comparisons with other widely used clustering methods on various data sets show the abilities and strengths of our clustering methods in producing a biologically meaningful grouping of protein sequences.
Stichworte
Protein Clustering; Clustering
Erscheinungsjahr
2005
Zeitschriftentitel
BMC Bioinformatics
Band
6
Ausgabe
1
Seite(n)
15
ISSN
1471-2105
Page URI
https://pub.uni-bielefeld.de/record/1775168

Zitieren

Krause A, Stoye J, Vingron M. Large scale hierarchical clustering of protein sequences. BMC Bioinformatics. 2005;6(1):15.
Krause, A., Stoye, J., & Vingron, M. (2005). Large scale hierarchical clustering of protein sequences. BMC Bioinformatics, 6(1), 15. doi:10.1186/1471-2105-6-15
Krause, A., Stoye, J., and Vingron, M. (2005). Large scale hierarchical clustering of protein sequences. BMC Bioinformatics 6, 15.
Krause, A., Stoye, J., & Vingron, M., 2005. Large scale hierarchical clustering of protein sequences. BMC Bioinformatics, 6(1), p 15.
A. Krause, J. Stoye, and M. Vingron, “Large scale hierarchical clustering of protein sequences”, BMC Bioinformatics, vol. 6, 2005, pp. 15.
Krause, A., Stoye, J., Vingron, M.: Large scale hierarchical clustering of protein sequences. BMC Bioinformatics. 6, 15 (2005).
Krause, Antje, Stoye, Jens, and Vingron, Martin. “Large scale hierarchical clustering of protein sequences”. BMC Bioinformatics 6.1 (2005): 15.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Volltext(e)
Access Level
OA Open Access
Zuletzt Hochgeladen
2019-09-06T08:48:16Z
MD5 Prüfsumme
473a25334a1f7370968ee41ee08aec11

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Web of Science

Dieser Datensatz im Web of Science®

Quellen

PMID: 15663796
PubMed | Europe PMC

Suchen in

Google Scholar