TY - JOUR
AB - Suffix arrays are used in various applications and research areas like data compression or computational biology. In this work, our goal is to characterise the combinatorial properties of suffix arrays and their enumeration. For a fixed alphabet size and string length, we divide the set of all strings into equivalence classes of strings that share the same suffix array. For each such equivalence class, we count the number of strings contained in it. We also give exact formulas for computing the number of equivalence classes. Our methods yield a lower bound for the compressibility of suffix arrays and build the foundation for the efficient generation of appropriate test data sets for suffix array based algorithms. We also show that summing up the elements of all equivalence classes forms a particular instance for some summation identities of Eulerian numbers. (c) 2008 Elsevier B.V. All rights reserved.
AU - SchÃ¼rmann, Klaus-Bernd
AU - Stoye, Jens
ID - 1587078
IS - 2-3
JF - THEORETICAL COMPUTER SCIENCE
KW - permutations
KW - strings
KW - suffix arrays
SN - 0304-3975
TI - Counting suffix arrays and strings
VL - 395
ER -