A novel approach to remote homology detection: jumping alignments

Spang R, Rehmsmeier M, Stoye J (2002)
Journal of Computational Biology 9(5): 747-760.

Download
OA
Journal Article | Published | English
Author
Abstract
We describe a new algorithm for protein classification and the detection of remote homologs. The rationale is to exploit both vertical and horizontal information of a multiple alignment in a well-balanced manner. This is in contrast to established methods such as profiles and profile hidden Markov models which focus on vertical information as they model the columns of the alignment independently and to family pairwise search which focuses on horizontal information as it treats given sequences separately. In our setting, we want to select from a given database of "candidate sequences" those proteins that belong to a given superfamily. In order to do so, each candidate sequence is separately tested against a multiple alignment of the known members of the superfamily by means of a new jumping alignment algorithm. This algorithm is an extension of the Smith-Waterman algorithm and computes a local alignment of a single sequence and a multiple alignment. In contrast to traditional methods, however, this alignment is not based on a summary of the individual columns of the multiple alignment. Rather, the candidate sequence is at each position aligned to one sequence of the multiple alignment, called the "reference sequence". In addition, the reference sequence may change within the alignment, while each such jump is penalized. To evaluate the discriminative quality of the jumping alignment algorithm, we compare it to profiles, profile hidden Markov models, and family pairwise search on a subset of the SCOP database of protein domains. The discriminative quality is assessed by median false positive counts (med-FP-counts). For moderate med-FP-counts, the number of successful searches with our method is considerably higher than with the competing methods.
Publishing Year
ISSN
eISSN
PUB-ID

Cite this

Spang R, Rehmsmeier M, Stoye J. A novel approach to remote homology detection: jumping alignments. Journal of Computational Biology. 2002;9(5):747-760.
Spang, R., Rehmsmeier, M., & Stoye, J. (2002). A novel approach to remote homology detection: jumping alignments. Journal of Computational Biology, 9(5), 747-760.
Spang, R., Rehmsmeier, M., and Stoye, J. (2002). A novel approach to remote homology detection: jumping alignments. Journal of Computational Biology 9, 747-760.
Spang, R., Rehmsmeier, M., & Stoye, J., 2002. A novel approach to remote homology detection: jumping alignments. Journal of Computational Biology, 9(5), p 747-760.
R. Spang, M. Rehmsmeier, and J. Stoye, “A novel approach to remote homology detection: jumping alignments”, Journal of Computational Biology, vol. 9, 2002, pp. 747-760.
Spang, R., Rehmsmeier, M., Stoye, J.: A novel approach to remote homology detection: jumping alignments. Journal of Computational Biology. 9, 747-760 (2002).
Spang, Rainer, Rehmsmeier, Marc, and Stoye, Jens. “A novel approach to remote homology detection: jumping alignments”. Journal of Computational Biology 9.5 (2002): 747-760.
Main File(s)
Access Level
OA Open Access
Last Uploaded
2016-09-21T05:32:36Z

This data publication is cited in the following publications:
This publication cites the following data publications:

14 Citations in Europe PMC

Data provided by Europe PubMed Central.

Pareto optimization in algebraic dynamic programming.
Saule C, Giegerich R., Algorithms Mol Biol 10(), 2015
PMID: 26150892
Haploid to diploid alignment for variation calling assessment.
Makinen V, Rahkola J., BMC Bioinformatics 14 Suppl 15(), 2013
PMID: 24564537
Probabilistic inference of viral quasispecies subject to recombination.
Topfer A, Zagordi O, Prabhakaran S, Roth V, Halperin E, Beerenwinkel N., J. Comput. Biol. 20(2), 2013
PMID: 23383997
Classification of HIV-1 sequences using profile Hidden Markov Models.
Dwivedi SK, Sengupta S., PLoS ONE 7(5), 2012
PMID: 22623958
TCRep 3D: an automated in silico approach to study the structural properties of TCR repertoires.
Leimgruber A, Ferber M, Irving M, Hussain-Kahn H, Wieckowski S, Derre L, Rufer N, Zoete V, Michielin O., PLoS ONE 6(10), 2011
PMID: 22053188
HIV classification using the coalescent theory.
Bulla I, Schultz AK, Schreiber F, Zhang M, Leitner T, Korber B, Morgenstern B, Stanke M., Bioinformatics 26(11), 2010
PMID: 20400454
Homology and phylogeny and their automated inference.
Fuellen G., Naturwissenschaften 95(6), 2008
PMID: 18288471
jpHMM at GOBICS: a web server to detect genomic recombinations in HIV-1.
Zhang M, Schultz AK, Calef C, Kuiken C, Leitner T, Korber B, Morgenstern B, Stanke M., Nucleic Acids Res. 34(Web Server issue), 2006
PMID: 16845050
A jumping profile Hidden Markov Model and applications to recombination sites in HIV and HCV genomes.
Schultz AK, Zhang M, Leitner T, Kuiken C, Korber B, Morgenstern B, Stanke M., BMC Bioinformatics 7(), 2006
PMID: 16716226
Recco: recombination analysis using cost optimization.
Maydt J, Lengauer T., Bioinformatics 22(9), 2006
PMID: 16488909
A sequence sub-sampling algorithm increases the power to detect distant homologues.
Johnston CR, Shields DC., Nucleic Acids Res. 33(12), 2005
PMID: 16006623
A robust method to detect structural and functional remote homologues.
Shachar O, Linial M., Proteins 57(3), 2004
PMID: 15382232

26 References

Data provided by Europe PubMed Central.

Improving the sensitivity of the sequence profile method
Lüthy, Protein Science 3(1), 2008

Export

0 Marked Publications

Open Data PUB

Web of Science

View record in Web of Science®

Sources

PMID: 12487762
PubMed | Europe PMC

Search this title in

Google Scholar