Comparative homology agreement search: An effective combination of homology-search methods

Alam I, Dress A, Rehmsmeier M, Fuellen G (2004)
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 101(38): 13814-13819.

Download
Es wurde kein Volltext hochgeladen. Nur Publikationsnachweis!
Zeitschriftenaufsatz | Veröffentlicht | Englisch
Autor
; ; ;
Abstract / Bemerkung
Many methods have been developed to search for homologous members of a protein family in databases, and the reliability of results and conclusions may be compromised if only one method is used, neglecting the others. Here we introduce a general scheme for combining such methods. Based on this scheme, we implemented a tool called comparative homology agreement search (CHASE) that integrates different search strategies to obtain a combined "E value." Our results show that a consensus method integrating distinct strategies easily outperforms any of its component algorithms. More specifically, an evaluation based on the Structural Classification of Proteins database reveals that, on average, a coverage of 47% can be obtained in searches for distantly related homologues (i.e., members of the same superfamily but not the same family, which is a very difficult task), accepting only 10 false positives, whereas the individual methods obtain a coverage of 28-38%.
Erscheinungsjahr
Zeitschriftentitel
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
Band
101
Ausgabe
38
Seite(n)
13814-13819
ISSN
eISSN
PUB-ID

Zitieren

Alam I, Dress A, Rehmsmeier M, Fuellen G. Comparative homology agreement search: An effective combination of homology-search methods. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA. 2004;101(38):13814-13819.
Alam, I., Dress, A., Rehmsmeier, M., & Fuellen, G. (2004). Comparative homology agreement search: An effective combination of homology-search methods. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 101(38), 13814-13819. doi:10.1073/pnas.0405612101
Alam, I., Dress, A., Rehmsmeier, M., and Fuellen, G. (2004). Comparative homology agreement search: An effective combination of homology-search methods. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 101, 13814-13819.
Alam, I., et al., 2004. Comparative homology agreement search: An effective combination of homology-search methods. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 101(38), p 13814-13819.
I. Alam, et al., “Comparative homology agreement search: An effective combination of homology-search methods”, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, vol. 101, 2004, pp. 13814-13819.
Alam, I., Dress, A., Rehmsmeier, M., Fuellen, G.: Comparative homology agreement search: An effective combination of homology-search methods. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA. 101, 13814-13819 (2004).
Alam, I, Dress, Andreas, Rehmsmeier, Marc, and Fuellen, Georg. “Comparative homology agreement search: An effective combination of homology-search methods”. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 101.38 (2004): 13814-13819.

10 Zitationen in Europe PMC

Daten bereitgestellt von Europe PubMed Central.

Improved performance of sequence search approaches in remote homology detection.
Joshi AG, Raghavender US, Sowdhamini R., F1000Res 2(), 2013
PMID: 25469226
Learning biomarkers of pluripotent stem cells in mouse.
Scheubert L, Schmidt R, Repsilber D, Lustrek M, Fuellen G., DNA Res 18(4), 2011
PMID: 21791477
Bioinformatics for biodefense: challenges and opportunities.
Valdivia-Granda WA., Biosecur Bioterror 8(1), 2010
PMID: 20230234
TransportTP: a two-phase classification approach for membrane transporter prediction and characterization.
Li H, Benedito VA, Udvardi MK, Zhao PX., BMC Bioinformatics 10(), 2009
PMID: 20003433
Homology and phylogeny and their automated inference.
Fuellen G., Naturwissenschaften 95(6), 2008
PMID: 18288471
Identification and preliminary function study of Xenopus laevis DRR1 gene.
Zhao XY, Liang SF, Yao SH, Ma FX, Hu ZG, Yan F, Yuan Z, Ruan XZ, Yang HS, Zhou Q, Wei YQ., Biochem Biophys Res Commun 361(1), 2007
PMID: 17637457
Comparative genome analysis across a kingdom of eukaryotic organisms: specialization and diversification in the fungi.
Cornell MJ, Alam I, Soanes DM, Wong HM, Hedeler C, Paton NW, Rattray M, Hubbard SJ, Talbot NJ, Oliver SG., Genome Res 17(12), 2007
PMID: 17984228

27 References

Daten bereitgestellt von Europe PubMed Central.

Basic local alignment search tool.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ., J. Mol. Biol. 215(3), 1990
PMID: 2231712
Applying motif and profile searches.
Bork P, Gibson TJ., Meth. Enzymol. 266(), 1996
PMID: 8743684
Profile hidden Markov models.
Eddy SR., Bioinformatics 14(9), 1998
PMID: 9918945
Homology detection via family pairwise search.
Grundy WN., J. Comput. Biol. 5(3), 1998
PMID: 9773344
Protein sequence similarity searches using patterns as seeds.
Zhang Z, Schaffer AA, Miller W, Madden TL, Lipman DJ, Koonin EV, Altschul SF., Nucleic Acids Res. 26(17), 1998
PMID: 9705509
Hidden Markov models.
Eddy SR., Curr. Opin. Struct. Biol. 6(3), 1996
PMID: 8804822
The PROSITE database, its status in 1997.
Bairoch A, Bucher P, Hofmann K., Nucleic Acids Res. 25(1), 1997
PMID: 9016539

AUTHOR UNKNOWN, 1999
Finding flexible patterns in unaligned protein sequences.
Jonassen I, Collins JF, Higgins DG., Protein Sci. 4(8), 1995
PMID: 8520485
Efficient discovery of conserved patterns using a pattern graph.
Jonassen I., Comput. Appl. Biosci. 13(5), 1997
PMID: 9367124
The InterPro database, an integrated documentation resource for protein families, domains and functional sites.
Apweiler R, Attwood TK, Bairoch A, Bateman A, Birney E, Biswas M, Bucher P, Cerutti L, Corpet F, Croning MD, Durbin R, Falquet L, Fleischmann W, Gouzy J, Hermjakob H, Hulo N, Jonassen I, Kahn D, Kanapin A, Karavidopoulou Y, Lopez R, Marx B, Mulder NJ, Oinn TM, Pagni M, Servant F, Sigrist CJ, Zdobnov EM., Nucleic Acids Res. 29(1), 2001
PMID: 11125043
The MetaFam Server: a comprehensive protein family resource.
Silverstein KA, Shoop E, Johnson JE, Kilian A, Freeman JL, Kunau TM, Awad IA, Mayer M, Retzel EF., Nucleic Acids Res. 29(1), 2001
PMID: 11125046
Pcons: a neural-network-based consensus predictor that improves fold recognition.
Lundstrom J, Rychlewski L, Bujnicki J, Elofsson A., Protein Sci. 10(11), 2001
PMID: 11604541
JPred: a consensus secondary structure prediction server.
Cuff JA, Clamp ME, Siddiqui AS, Finlay M, Barton GJ., Bioinformatics 14(10), 1998
PMID: 9927721
Phylogenetic information improves homology detection.
Rehmsmeier M, Vingron M., Proteins 45(4), 2001
PMID: 11746684
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ., Nucleic Acids Res. 25(17), 1997
PMID: 9254694
Combining evidence using p-values: application to sequence homology searches.
Bailey TL, Gribskov M., Bioinformatics 14(1), 1998
PMID: 9520501
The PROSITE database, its status in 2002.
Falquet L, Pagni M, Bucher P, Hulo N, Sigrist CJ, Hofmann K, Bairoch A., Nucleic Acids Res. 30(1), 2002
PMID: 11752303
The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.
Bairoch A, Apweiler R., Nucleic Acids Res. 28(1), 2000
PMID: 10592178
SCOP: a structural classification of proteins database for the investigation of sequences and structures.
Murzin AG, Brenner SE, Hubbard T, Chothia C., J. Mol. Biol. 247(4), 1995
PMID: 7723011

AUTHOR UNKNOWN, 1989

AUTHOR UNKNOWN, 1994
Phase4: automatic evaluation of database search methods.
Rehmsmeier M., Brief. Bioinformatics 3(4), 2002
PMID: 12511063
ProtoMap: automatic classification of protein sequences and hierarchy of protein families.
Yona G, Linial N, Linial M., Nucleic Acids Res. 28(1), 2000
PMID: 10592179

AUTHOR UNKNOWN, 1988

AUTHOR UNKNOWN, 2002

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Web of Science

Dieser Datensatz im Web of Science®

Quellen

PMID: 15367730
PubMed | Europe PMC

Suchen in

Google Scholar