Approximating the DCJ distance of balanced genomes in linear time

Rubert D, Feijão P, Dias Vieira Braga M, Stoye J, Martinez FHV (2017)
Algorithms for Molecular Biology 12(1): 3.

Zeitschriftenaufsatz | Veröffentlicht | Englisch
 
Download
OA 2.08 MB
Abstract / Bemerkung
Background Rearrangements are large-scale mutations in genomes, responsible for complex changes and structural variations. Most rearrangements that modify the organization of a genome can be represented by the double cut and join (DCJ) operation. Given two balanced genomes, i.e., two genomes that have exactly the same number of occurrences of each gene in each genome, we are interested in the problem of computing the rearrangement distance between them, i.e., finding the minimum number of DCJ operations that transform one genome into the other. This problem is known to be NP-hard. Results We propose a linear time approximation algorithm with approximation factor O(k) for the DCJ distance problem, where k is the maximum number of occurrences of any gene in the input genomes. Our algorithm works for linear and circular unichromosomal balanced genomes and uses as an intermediate step an O(k)-approximation for the minimum common string partition problem, which is closely related to the DCJ distance problem. Conclusions Experiments on simulated data sets show that our approximation algorithm is very competitive both in efficiency and in quality of the solutions.
Stichworte
Double cut and join (DCJ); Genome rearrangements; Comparative genomics; Approximation algorithms
Erscheinungsjahr
2017
Zeitschriftentitel
Algorithms for Molecular Biology
Band
12
Ausgabe
1
Art.-Nr.
3
ISSN
1748-7188
Finanzierungs-Informationen
Open-Access-Publikationskosten wurden durch die Deutsche Forschungsgemeinschaft und die Universität Bielefeld gefördert.
Page URI
https://pub.uni-bielefeld.de/record/2909057

Zitieren

Rubert D, Feijão P, Dias Vieira Braga M, Stoye J, Martinez FHV. Approximating the DCJ distance of balanced genomes in linear time. Algorithms for Molecular Biology. 2017;12(1): 3.
Rubert, D., Feijão, P., Dias Vieira Braga, M., Stoye, J., & Martinez, F. H. V. (2017). Approximating the DCJ distance of balanced genomes in linear time. Algorithms for Molecular Biology, 12(1), 3. https://doi.org/10.1186/s13015-017-0095-y
Rubert, Diego, Feijão, Pedro, Dias Vieira Braga, Marília, Stoye, Jens, and Martinez, Fábio Henrique Viduani. 2017. “Approximating the DCJ distance of balanced genomes in linear time”. Algorithms for Molecular Biology 12 (1): 3.
Rubert, D., Feijão, P., Dias Vieira Braga, M., Stoye, J., and Martinez, F. H. V. (2017). Approximating the DCJ distance of balanced genomes in linear time. Algorithms for Molecular Biology 12:3.
Rubert, D., et al., 2017. Approximating the DCJ distance of balanced genomes in linear time. Algorithms for Molecular Biology, 12(1): 3.
D. Rubert, et al., “Approximating the DCJ distance of balanced genomes in linear time”, Algorithms for Molecular Biology, vol. 12, 2017, : 3.
Rubert, D., Feijão, P., Dias Vieira Braga, M., Stoye, J., Martinez, F.H.V.: Approximating the DCJ distance of balanced genomes in linear time. Algorithms for Molecular Biology. 12, : 3 (2017).
Rubert, Diego, Feijão, Pedro, Dias Vieira Braga, Marília, Stoye, Jens, and Martinez, Fábio Henrique Viduani. “Approximating the DCJ distance of balanced genomes in linear time”. Algorithms for Molecular Biology 12.1 (2017): 3.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]
Volltext(e)
Access Level
OA Open Access
Zuletzt Hochgeladen
2019-09-06T09:18:43Z
MD5 Prüfsumme
46aa2db63dc99fc0c8116c5eff36c4d9


1 Zitation in Europe PMC

Daten bereitgestellt von Europe PubMed Central.

Computing the family-free DCJ similarity.
Rubert DP, Hoshino EA, Braga MDV, Stoye J, Martinez FV., BMC Bioinformatics 19(suppl 6), 2018
PMID: 29745861

15 References

Daten bereitgestellt von Europe PubMed Central.

Efficient sorting of genomic permutations by translocation, inversion and block interchange.
Yancopoulos S, Attie O, Friedberg R., Bioinformatics 21(16), 2005
PMID: 15951307

AUTHOR UNKNOWN, 0
Inapproximability of (1,2)-exemplar distance.
Bulteau L, Jiang M., IEEE/ACM Trans Comput Biol Bioinform 10(6), 2013
PMID: 24407297
The complexity of calculating exemplar distances
Bryant D., 2000

AUTHOR UNKNOWN, 0
Approximating the true evolutionary distance between two genomes
Swenson K, Marron M, Earnest-DeYong K, Moret BME., 2005

AUTHOR UNKNOWN, 0
Reversal distance for strings with duplicates: linear time approximation using hitting set
Kolman P, Waleń T., 2007

AUTHOR UNKNOWN, 0
The solution space of sorting by DCJ.
Braga MD, Stoye J., J. Comput. Biol. 17(9), 2010
PMID: 20874401
Efficient tools for computing the number of breakpoints and the number of adjacencies between two genomes with duplicate genes.
Angibaud S, Fertin G, Rusu I, Thevenin A, Vialette S., J. Comput. Biol. 15(8), 2008
PMID: 18774903
Optimal suffix tree construction with large alphabets
Farach M., 1997

AUTHOR UNKNOWN, 0
A simpler and faster 1.5-approximation algorithm for sorting by transpositions
Hartman T, Shamir R., 2006
Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Web of Science

Dieser Datensatz im Web of Science®
Quellen

PMID: 28293275
PubMed | Europe PMC

Suchen in

Google Scholar