Approximating the DCJ distance of balanced genomes in linear time
Rubert D, Feijão P, Dias Vieira Braga M, Stoye J, Martinez FHV (2017)
Algorithms for Molecular Biology 12(1): 3.
Zeitschriftenaufsatz
| Veröffentlicht | Englisch
Download
Autor*in
Einrichtung
Abstract / Bemerkung
Background
Rearrangements are large-scale mutations in genomes, responsible for complex changes and structural variations. Most rearrangements that modify the organization of a genome can be represented by the double cut and join (DCJ) operation. Given two balanced genomes, i.e., two genomes that have exactly the same number of occurrences of each gene in each genome, we are interested in the problem of computing the rearrangement distance between them, i.e., finding the minimum number of DCJ operations that transform one genome into the other. This problem is known to be NP-hard.
Results
We propose a linear time approximation algorithm with approximation factor O(k) for the DCJ distance problem, where k is the maximum number of occurrences of any gene in the input genomes. Our algorithm works for linear and circular unichromosomal balanced genomes and uses as an intermediate step an O(k)-approximation for the minimum common string partition problem, which is closely related to the DCJ distance problem.
Conclusions
Experiments on simulated data sets show that our approximation algorithm is very competitive both in efficiency and in quality of the solutions.
Stichworte
Double cut and join (DCJ);
Genome rearrangements;
Comparative genomics;
Approximation algorithms
Erscheinungsjahr
2017
Zeitschriftentitel
Algorithms for Molecular Biology
Band
12
Ausgabe
1
Art.-Nr.
3
ISSN
1748-7188
Finanzierungs-Informationen
Open-Access-Publikationskosten wurden durch die Deutsche Forschungsgemeinschaft und die Universität Bielefeld gefördert.
Page URI
https://pub.uni-bielefeld.de/record/2909057
Zitieren
Rubert D, Feijão P, Dias Vieira Braga M, Stoye J, Martinez FHV. Approximating the DCJ distance of balanced genomes in linear time. Algorithms for Molecular Biology. 2017;12(1): 3.
Rubert, D., Feijão, P., Dias Vieira Braga, M., Stoye, J., & Martinez, F. H. V. (2017). Approximating the DCJ distance of balanced genomes in linear time. Algorithms for Molecular Biology, 12(1), 3. https://doi.org/10.1186/s13015-017-0095-y
Rubert, Diego, Feijão, Pedro, Dias Vieira Braga, Marília, Stoye, Jens, and Martinez, Fábio Henrique Viduani. 2017. “Approximating the DCJ distance of balanced genomes in linear time”. Algorithms for Molecular Biology 12 (1): 3.
Rubert, D., Feijão, P., Dias Vieira Braga, M., Stoye, J., and Martinez, F. H. V. (2017). Approximating the DCJ distance of balanced genomes in linear time. Algorithms for Molecular Biology 12:3.
Rubert, D., et al., 2017. Approximating the DCJ distance of balanced genomes in linear time. Algorithms for Molecular Biology, 12(1): 3.
D. Rubert, et al., “Approximating the DCJ distance of balanced genomes in linear time”, Algorithms for Molecular Biology, vol. 12, 2017, : 3.
Rubert, D., Feijão, P., Dias Vieira Braga, M., Stoye, J., Martinez, F.H.V.: Approximating the DCJ distance of balanced genomes in linear time. Algorithms for Molecular Biology. 12, : 3 (2017).
Rubert, Diego, Feijão, Pedro, Dias Vieira Braga, Marília, Stoye, Jens, and Martinez, Fábio Henrique Viduani. “Approximating the DCJ distance of balanced genomes in linear time”. Algorithms for Molecular Biology 12.1 (2017): 3.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]
Volltext(e)
Name
Access Level
Open Access
Zuletzt Hochgeladen
2019-09-06T09:18:43Z
MD5 Prüfsumme
46aa2db63dc99fc0c8116c5eff36c4d9
Daten bereitgestellt von European Bioinformatics Institute (EBI)
1 Zitation in Europe PMC
Daten bereitgestellt von Europe PubMed Central.
Computing the family-free DCJ similarity.
Rubert DP, Hoshino EA, Braga MDV, Stoye J, Martinez FV., BMC Bioinformatics 19(suppl 6), 2018
PMID: 29745861
Rubert DP, Hoshino EA, Braga MDV, Stoye J, Martinez FV., BMC Bioinformatics 19(suppl 6), 2018
PMID: 29745861
15 References
Daten bereitgestellt von Europe PubMed Central.
Efficient sorting of genomic permutations by translocation, inversion and block interchange.
Yancopoulos S, Attie O, Friedberg R., Bioinformatics 21(16), 2005
PMID: 15951307
Yancopoulos S, Attie O, Friedberg R., Bioinformatics 21(16), 2005
PMID: 15951307
AUTHOR UNKNOWN, 0
Inapproximability of (1,2)-exemplar distance.
Bulteau L, Jiang M., IEEE/ACM Trans Comput Biol Bioinform 10(6), 2013
PMID: 24407297
Bulteau L, Jiang M., IEEE/ACM Trans Comput Biol Bioinform 10(6), 2013
PMID: 24407297
An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes.
Shao M, Lin Y, Moret BM., J. Comput. Biol. 22(5), 2014
PMID: 25517208
Shao M, Lin Y, Moret BM., J. Comput. Biol. 22(5), 2014
PMID: 25517208
The complexity of calculating exemplar distances
Bryant D., 2000
Bryant D., 2000
AUTHOR UNKNOWN, 0
Approximating the true evolutionary distance between two genomes
Swenson K, Marron M, Earnest-DeYong K, Moret BME., 2005
Swenson K, Marron M, Earnest-DeYong K, Moret BME., 2005
AUTHOR UNKNOWN, 0
Reversal distance for strings with duplicates: linear time approximation using hitting set
Kolman P, Waleń T., 2007
Kolman P, Waleń T., 2007
AUTHOR UNKNOWN, 0
The solution space of sorting by DCJ.
Braga MD, Stoye J., J. Comput. Biol. 17(9), 2010
PMID: 20874401
Braga MD, Stoye J., J. Comput. Biol. 17(9), 2010
PMID: 20874401
Efficient tools for computing the number of breakpoints and the number of adjacencies between two genomes with duplicate genes.
Angibaud S, Fertin G, Rusu I, Thevenin A, Vialette S., J. Comput. Biol. 15(8), 2008
PMID: 18774903
Angibaud S, Fertin G, Rusu I, Thevenin A, Vialette S., J. Comput. Biol. 15(8), 2008
PMID: 18774903
Optimal suffix tree construction with large alphabets
Farach M., 1997
Farach M., 1997
AUTHOR UNKNOWN, 0
A simpler and faster 1.5-approximation algorithm for sorting by transpositions
Hartman T, Shamir R., 2006
Hartman T, Shamir R., 2006
Export
Markieren/ Markierung löschen
Markierte Publikationen
Web of Science
Dieser Datensatz im Web of Science®Quellen
PMID: 28293275
PubMed | Europe PMC
Suchen in