A general method for fast multiple sequence alignment
Tönges U, Perrey SW, Stoye J, Dress A (1996)
Gene 172(1): GC33-GC41.
Zeitschriftenaufsatz
| Veröffentlicht | Englisch
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Abstract / Bemerkung
We have developed a fast heuristic algorithm for multiple sequence alignment which provides near-to-optimal results for sufficiently homologous sequences. The algorithm makes use of the standard dynamic programming procedure by applying it to all pairs of sequences. The resulting score matrices for pair-wise alignment give rise to secondary matrices containing the additional charges imposed by forcing the alignment path to run through a particular vertex. Such a constraint corresponds to slicing the sequences at the positions defining that vertex, and aligning the remaining pairs of prefix and suffix sequences separately. From these secondary matrices, one can compute - for any given family of sequences - suitable positions for cutting all of these sequences simultaneously, thus reducing the problem of aligning a family of n sequences of average length l in a Divide and Conquer fashion to aligning two families of n sequences of approximately half that length. In this paper, we explain the method for the case of 3 sequences in detail, and we demonstrate its potential and its limits by discussing its behaviour for several test families. A generalization for aligning more than 3 sequences is lined out, and some actual alignments constructed by our algorithm for various user-defined parameters are presented.
Stichworte
Secondary matrix;
Pair-wise sequence alignment;
Divide and conquer;
Multiple sequence alignment;
Dynamic programming
Erscheinungsjahr
1996
Zeitschriftentitel
Gene
Band
172
Ausgabe
1
Seite(n)
GC33-GC41
ISSN
0378-1119
Page URI
https://pub.uni-bielefeld.de/record/1773351
Zitieren
Tönges U, Perrey SW, Stoye J, Dress A. A general method for fast multiple sequence alignment. Gene. 1996;172(1):GC33-GC41.
Tönges, U., Perrey, S. W., Stoye, J., & Dress, A. (1996). A general method for fast multiple sequence alignment. Gene, 172(1), GC33-GC41. https://doi.org/10.1016/0378-1119(96)00123-0
Tönges, Udo, Perrey, Sören W., Stoye, Jens, and Dress, Andreas. 1996. “A general method for fast multiple sequence alignment”. Gene 172 (1): GC33-GC41.
Tönges, U., Perrey, S. W., Stoye, J., and Dress, A. (1996). A general method for fast multiple sequence alignment. Gene 172, GC33-GC41.
Tönges, U., et al., 1996. A general method for fast multiple sequence alignment. Gene, 172(1), p GC33-GC41.
U. Tönges, et al., “A general method for fast multiple sequence alignment”, Gene, vol. 172, 1996, pp. GC33-GC41.
Tönges, U., Perrey, S.W., Stoye, J., Dress, A.: A general method for fast multiple sequence alignment. Gene. 172, GC33-GC41 (1996).
Tönges, Udo, Perrey, Sören W., Stoye, Jens, and Dress, Andreas. “A general method for fast multiple sequence alignment”. Gene 172.1 (1996): GC33-GC41.
Daten bereitgestellt von European Bioinformatics Institute (EBI)
11 Zitationen in Europe PMC
Daten bereitgestellt von Europe PubMed Central.
MISHIMA--a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data.
Kryukov K, Saitou N., BMC Bioinformatics 11(), 2010
PMID: 20298584
Kryukov K, Saitou N., BMC Bioinformatics 11(), 2010
PMID: 20298584
Incorporation of gap characters and lineage-specific regions into phylogenetic analyses of gene families from divergent clades: an example from the kinesin superfamily across eukaryotes
Simmons MarkP, Richardson Dale, Reddy AnireddySN., Cladistics 24(3), 2008
PMID: IND44057158
Simmons MarkP, Richardson Dale, Reddy AnireddySN., Cladistics 24(3), 2008
PMID: IND44057158
The relative sensitivity of different alignment methods and character codings in sensitivity analysis
Simmons MarkP, Müller KaiF, Webb ColleenT., Cladistics 24(6), 2008
PMID: IND44127586
Simmons MarkP, Müller KaiF, Webb ColleenT., Cladistics 24(6), 2008
PMID: IND44127586
Dynamic programming algorithms for comparing multineuronal spike trains via cost-based metrics and alignments.
Victor JD, Goldberg DH, Gardner D., J Neurosci Methods 161(2), 2007
PMID: 17174403
Victor JD, Goldberg DH, Gardner D., J Neurosci Methods 161(2), 2007
PMID: 17174403
Global multiple-sequence alignment with repeats.
Sammeth M, Heringa J., Proteins 64(1), 2006
PMID: 16609972
Sammeth M, Heringa J., Proteins 64(1), 2006
PMID: 16609972
A memory-efficient algorithm for multiple sequence alignment with constraints.
Lu CL, Huang YP., Bioinformatics 21(1), 2005
PMID: 15374876
Lu CL, Huang YP., Bioinformatics 21(1), 2005
PMID: 15374876
Detailed protein sequence alignment based on Spectral Similarity Score (SSS).
Gupta K, Thomas D, Vidya SV, Venkatesh KV, Ramakumar S., BMC Bioinformatics 6(), 2005
PMID: 15850477
Gupta K, Thomas D, Vidya SV, Venkatesh KV, Ramakumar S., BMC Bioinformatics 6(), 2005
PMID: 15850477
MCALIGN: stochastic alignment of noncoding DNA sequences based on an evolutionary model of sequence evolution.
Keightley PD, Johnson T., Genome Res 14(3), 2004
PMID: 14993209
Keightley PD, Johnson T., Genome Res 14(3), 2004
PMID: 14993209
The performance of several multiple-sequence alignment programs in relation to secondary-structure features for an rRNA sequence.
Hickson RE, Simon C, Perrey SW., Mol Biol Evol 17(4), 2000
PMID: 10742045
Hickson RE, Simon C, Perrey SW., Mol Biol Evol 17(4), 2000
PMID: 10742045
Multiple sequence alignment: algorithms and applications.
Gotoh O., Adv Biophys 36(), 1999
PMID: 10463075
Gotoh O., Adv Biophys 36(), 1999
PMID: 10463075
Multiple sequence alignment with the Divide-and-Conquer method.
Stoye J., Gene 211(2), 1998
PMID: 9669886
Stoye J., Gene 211(2), 1998
PMID: 9669886
38 References
Daten bereitgestellt von Europe PubMed Central.
Introduction to Computational Biology
Waterman, 1995
Waterman, 1995
AUTHOR UNKNOWN, 1983
A survey of multiple sequence comparison methods.
Chan SC, Wong AK, Chiu DK., Bull. Math. Biol. 54(4), 1992
PMID: 1591533
Chan SC, Wong AK, Chiu DK., Bull. Math. Biol. 54(4), 1992
PMID: 1591533
An overview of sequence comparison algorithms in molecular biology
Myers, 1991
Myers, 1991
Simultaneous comparison of three protein sequences
Murata, 1985
Murata, 1985
Three-way Needleman-Wunsch algorithm
Murata, 1990
Murata, 1990
The Multiple Sequence Alignment Problem in Biology.
Carrillo H, Lipman D., SIAM J Appl Math 48(5), 1988
PMID: c6805
Carrillo H, Lipman D., SIAM J Appl Math 48(5), 1988
PMID: c6805
Alignment of three sequences in quadratic space
Huang, Appl. Comput. Rev. 1(2), 1993
Huang, Appl. Comput. Rev. 1(2), 1993
A fast algorithm for the optimal alignment of three strings.
Allison L., J. Theor. Biol. 164(2), 1993
PMID: 8246519
Allison L., J. Theor. Biol. 164(2), 1993
PMID: 8246519
A flexible method to align large numbers of biological sequences.
Taylor WR., J. Mol. Evol. 28(1-2), 1988
PMID: 3148736
Taylor WR., J. Mol. Evol. 28(1-2), 1988
PMID: 3148736
CLUSTAL: a package for performing multiple sequence alignment on a microcomputer.
Higgins DG, Sharp PM., Gene 73(1), 1988
PMID: 3243435
Higgins DG, Sharp PM., Gene 73(1), 1988
PMID: 3243435
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.
Thompson JD, Higgins DG, Gibson TJ., Nucleic Acids Res. 22(22), 1994
PMID: 7984417
Thompson JD, Higgins DG, Gibson TJ., Nucleic Acids Res. 22(22), 1994
PMID: 7984417
The maximum weight trace problem in multiple sequence alignment
Kececioglu, 1993
Kececioglu, 1993
Approximation algorithms for multiple sequence alignment
Bafna, 1994
Bafna, 1994
Progressive sequence alignment as a prerequisite to correct phylogenetic trees.
Feng DF, Doolittle RF., J. Mol. Evol. 25(4), 1987
PMID: 3118049
Feng DF, Doolittle RF., J. Mol. Evol. 25(4), 1987
PMID: 3118049
MASCOT: multiple alignment system for protein sequences based on three-way dynamic programming.
Hirosawa M, Hoshida M, Ishikawa M, Toya T., Comput. Appl. Biosci. 9(2), 1993
PMID: 8481818
Hirosawa M, Hoshida M, Ishikawa M, Toya T., Comput. Appl. Biosci. 9(2), 1993
PMID: 8481818
A method for the simultaneous alignment of three or more amino acid sequences.
Johnson MS, Doolittle RF., J. Mol. Evol. 23(3), 1986
PMID: 3100815
Johnson MS, Doolittle RF., J. Mol. Evol. 23(3), 1986
PMID: 3100815
Multiple sequence alignment by a pairwise algorithm.
Taylor WR., Comput. Appl. Biosci. 3(2), 1987
PMID: 3453222
Taylor WR., Comput. Appl. Biosci. 3(2), 1987
PMID: 3453222
Motif recognition and alignment for many sequences by comparison of dot-matrices.
Vingron M, Argos P., J. Mol. Biol. 218(1), 1991
PMID: 1900535
Vingron M, Argos P., J. Mol. Biol. 218(1), 1991
PMID: 1900535
Line geometries for sequence comparisons
Waterman, Bull. Math. Biol. 48(), 1984
Waterman, Bull. Math. Biol. 48(), 1984
Alignment of three biological sequences with an efficient traceback procedure.
Gotoh O., J. Theor. Biol. 121(3), 1986
PMID: 3795999
Gotoh O., J. Theor. Biol. 121(3), 1986
PMID: 3795999
A general method applicable to the search for similarities in the amino acid sequence of two proteins.
Needleman SB, Wunsch CD., J. Mol. Biol. 48(3), 1970
PMID: 5420325
Needleman SB, Wunsch CD., J. Mol. Biol. 48(3), 1970
PMID: 5420325
A divide and conquer approach to multiple alignment
Dress, 1995
Dress, 1995
A linear space algorithm for computing maximal common subsequences
Hirschberg, Commun. ACM 18(6), 1975
Hirschberg, Commun. ACM 18(6), 1975
Determination of reliable regions in protein sequence alignments.
Vingron M, Argos P., Protein Eng. 3(7), 1990
PMID: 2217130
Vingron M, Argos P., Protein Eng. 3(7), 1990
PMID: 2217130
On near-optimal alignments of biological sequences
Naor, J. Comp. Biol. 1(4), 1994
Naor, J. Comp. Biol. 1(4), 1994
Weights for data related by a tree.
Altschul SF, Carroll RJ, Lipman DJ., J. Mol. Biol. 207(4), 1989
PMID: 2760928
Altschul SF, Carroll RJ, Lipman DJ., J. Mol. Biol. 207(4), 1989
PMID: 2760928
Weighting aligned protein or nucleic acid sequences to correct for unequal representation.
Sibbald PR, Argos P., J. Mol. Biol. 216(4), 1990
PMID: 2176240
Sibbald PR, Argos P., J. Mol. Biol. 216(4), 1990
PMID: 2176240
Weighting in sequence space: a comparison of methods in terms of generalized sequences
Vingron, 1993
Vingron, 1993
A tool for multiple sequence alignment
Lipman, 1989
Lipman, 1989
Improving the practical space and time efficiency of the shortest-Paths approach to sum-of-pairs multiple sequence alignment
Gupta, J. Comp. Biol. 2(3), 1995
Gupta, J. Comp. Biol. 2(3), 1995
Optimal alignments in linear space.
Myers EW, Miller W., Comput. Appl. Biosci. 4(1), 1988
PMID: 3382986
Myers EW, Miller W., Comput. Appl. Biosci. 4(1), 1988
PMID: 3382986
On the complexity of multiple sequence alignment
Wang, J. Comp. Biol. 1(4), 1994
Wang, J. Comp. Biol. 1(4), 1994
Unified approach to alignment and phylogenies
Hein, 1990
Hein, 1990
Medians in median graphs
Bandelt, Discr. Appl. Math. 8(), 1984
Bandelt, Discr. Appl. Math. 8(), 1984
Frequency of insertion-deletion, transversion and transition in the evolution of 5S ribosomal RNA
Sankoff, J. Mol. Biol. 7(), 1976
Sankoff, J. Mol. Biol. 7(), 1976
Towards integration of multiple alignment and phylogenetic tree reconstruction
Vingron, Arbeitspapiere 852(), 1994
Vingron, Arbeitspapiere 852(), 1994
Export
Markieren/ Markierung löschen
Markierte Publikationen
Web of Science
Dieser Datensatz im Web of Science®Quellen
PMID: 8654965
PubMed | Europe PMC
Suchen in