On the probabilistic longest common subsequence problem for sequences of independent blocks

Torres F (2009)
Bielefeld (Germany): Bielefeld University.

Download
OA
Bielefeld Dissertation | English
Author
Supervisor
Götze, Friedrich (Prof. Dr.)
Abstract
Let X and Y be two binary random strings of length n independent of each other. Let Ln denote the length of the Longest Common Subsequence (LCS) of X and Y. In general the order of magnitude in n of VAR[Ln] is not known. So far, Matzinger and his collaborators had been able to prove that VAR[Ln] has order asymptotic n in few cases depending on the distribution of symbols in the sequences X and Y. In this thesis, we prove that VAR[Ln] has order asymptotic n considering a model which is not low entropy, in comparison with some previous cases which are low entropy models. The model for the distribution we consider here is i.i.d. sequences of blocks, where blocks are words consisting only of one symbol. In the present case all the blocks have length l-1, l or l+1 with probability 1/3 for a given integer parameter l>0. We reduce the problem of proving that VAR[Ln] has order asymptotic n to showing that a function under an entropy constraint does not go below zero. The method which we develop could be used for many other more complex cases wherever one pattern tends to influence the LCS score in a biased way. Also, a natural question is what happens if one has a more realistic situation than i.i.d, like Markov chains of words for instance (DNA models). This thesis partially answers this question since the model considered is a Markov chain, but the more general case where one can have blocks with different lengths (for instance, in the Bernoulli case a block of length j>0 has length 0.5j) is still open for future research though the techniques shown here give us new tools for approaching it.
Year
PUB-ID

Cite this

Torres F. On the probabilistic longest common subsequence problem for sequences of independent blocks. Bielefeld (Germany): Bielefeld University; 2009.
Torres, F. (2009). On the probabilistic longest common subsequence problem for sequences of independent blocks. Bielefeld (Germany): Bielefeld University.
Torres, F. (2009). On the probabilistic longest common subsequence problem for sequences of independent blocks. Bielefeld (Germany): Bielefeld University.
Torres, F., 2009. On the probabilistic longest common subsequence problem for sequences of independent blocks, Bielefeld (Germany): Bielefeld University.
F. Torres, On the probabilistic longest common subsequence problem for sequences of independent blocks, Bielefeld (Germany): Bielefeld University, 2009.
Torres, F.: On the probabilistic longest common subsequence problem for sequences of independent blocks. Bielefeld University, Bielefeld (Germany) (2009).
Torres, Felipe. On the probabilistic longest common subsequence problem for sequences of independent blocks. Bielefeld (Germany): Bielefeld University, 2009.
Main File(s)
File Name
Access Level
OA Open Access

This data publication is cited in the following publications:
This publication cites the following data publications:

Export

0 Marked Publications

Open Data PUB

Search this title in

Google Scholar