Orthology Detection Combining Clustering and Synteny for Very Large Datasets

Lechner M, Hernandez-Rosales M, Dörr D, Wieseke N, Thévenin A, Stoye J, Hartmann RK, Prohaska SJ, Stadler PF (2014)
PLoS ONE 9(8).

Download
OA
Journal Article | Published | English
Author
; ; ; ; ; ; ; ;
Abstract
The elucidation of orthology relationships is an important step both in gene function prediction as well as towards understanding patterns of sequence evolution. Orthology assignments are usually derived directly from sequence similarities for large data because more exact approaches exhibit too high computational costs. Here we present PoFF, an extension for the standalone tool Proteinortho, which enhances orthology detection by combining clustering, sequence similarity, and synteny. In the course of this work, FFAdj-MCS, a heuristic that assesses pairwise gene order using adjacencies (a similarity measure related to the breakpoint distance) was adapted to support multiple linear chromosomes and extended to detect duplicated regions. PoFF largely reduces the number of false positives and enables more fine-grained predictions than purely similarity-based approaches. The extension maintains the low memory requirements and the efficient concurrency options of its basis Proteinortho, making the software applicable to very large datasets.
Publishing Year
ISSN
eISSN
Financial disclosure
Article Processing Charge funded by the Deutsche Forschungsgemeinschaft and the Open Access Publication Fund of Bielefeld University.
PUB-ID

Cite this

Lechner M, Hernandez-Rosales M, Dörr D, et al. Orthology Detection Combining Clustering and Synteny for Very Large Datasets. PLoS ONE. 2014;9(8).
Lechner, M., Hernandez-Rosales, M., Dörr, D., Wieseke, N., Thévenin, A., Stoye, J., Hartmann, R. K., et al. (2014). Orthology Detection Combining Clustering and Synteny for Very Large Datasets. PLoS ONE, 9(8).
Lechner, M., Hernandez-Rosales, M., Dörr, D., Wieseke, N., Thévenin, A., Stoye, J., Hartmann, R. K., Prohaska, S. J., and Stadler, P. F. (2014). Orthology Detection Combining Clustering and Synteny for Very Large Datasets. PLoS ONE 9.
Lechner, M., et al., 2014. Orthology Detection Combining Clustering and Synteny for Very Large Datasets. PLoS ONE, 9(8).
M. Lechner, et al., “Orthology Detection Combining Clustering and Synteny for Very Large Datasets”, PLoS ONE, vol. 9, 2014.
Lechner, M., Hernandez-Rosales, M., Dörr, D., Wieseke, N., Thévenin, A., Stoye, J., Hartmann, R.K., Prohaska, S.J., Stadler, P.F.: Orthology Detection Combining Clustering and Synteny for Very Large Datasets. PLoS ONE. 9, (2014).
Lechner, Marcus, Hernandez-Rosales, Maribel, Dörr, Daniel, Wieseke, Nicolas, Thévenin, Annelyse, Stoye, Jens, Hartmann, Roland K., Prohaska, Sonja J., and Stadler, Peter F. “Orthology Detection Combining Clustering and Synteny for Very Large Datasets”. PLoS ONE 9.8 (2014).
Main File(s)
Access Level
OA Open Access
Last Uploaded
2016-11-18T14:30:03Z

This data publication is cited in the following publications:
This publication cites the following data publications:

2 Citations in Europe PMC

Data provided by Europe PubMed Central.

Genomic legacy of the African cheetah, Acinonyx jubatus.
Dobrynin P, Liu S, Tamazian G, Xiong Z, Yurchenko AA, Krasheninnikova K, Kliver S, Schmidt-Kuntzel A, Koepfli KP, Johnson W, Kuderna LF, Garcia-Perez R, Manuel Md, Godinez R, Komissarov A, Makunin A, Brukhin V, Qiu W, Zhou L, Li F, Yi J, Driscoll C, Antunes A, Oleksyk TK, Eizirik E, Perelman P, Roelke M, Wildt D, Diekhans M, Marques-Bonet T, Marker L, Bhak J, Wang J, Zhang G, O'Brien SJ., Genome Biol. 16(), 2015
PMID: 26653294
An Effective Big Data Supervised Imbalanced Classification Approach for Ortholog Detection in Related Yeast Species.
Galpert D, Del Rio S, Herrera F, Ancede-Gallardo E, Antunes A, Aguero-Chapin G., Biomed Res Int 2015(), 2015
PMID: 26605337

60 References

Data provided by Europe PubMed Central.

Simulation of gene family histories
AUTHOR UNKNOWN, 2014
Biological sequence simulation for testing complex evolutionary hypotheses: indel-Seq-Gen version 2.0.
Strope CL, Abel K, Scott SD, Moriyama EN., Mol. Biol. Evol. 26(11), 2009
PMID: 19651852
ALF--a simulation framework for genome evolution.
Dalquen DA, Anisimova M, Gonnet GH, Dessimoz C., Mol. Biol. Evol. 29(4), 2012
PMID: 22160766
Ensembl 2011
AUTHOR UNKNOWN, 2011
Insertion of horizontally transferred genes within conserved syntenic regions of yeast genomes
AUTHOR UNKNOWN, 2009
Computational methods for Gene Orthology inference.
Kristensen DM, Wolf YI, Mushegian AR, Koonin EV., Brief. Bioinformatics 12(5), 2011
PMID: 21690100
Identifying single copy orthologs in Metazoa
AUTHOR UNKNOWN, 2011
Transcriptome profiling of Giardia intestinalis using strand-specific RNA-seq
AUTHOR UNKNOWN, 2013
Development of universal genetic markers based on single-copy orthologous (COSII) genes in Poaceae.
Liu H, Guo X, Wu J, Chen GB, Ying Y., Plant Cell Rep. 32(3), 2013
PMID: 23233129

Export

0 Marked Publications

Open Data PUB

Web of Science

View record in Web of Science®

Sources

PMID: 25137074
PubMed | Europe PMC

Search this title in

Google Scholar