Representing the Suffix Tree with the CDAWG

dc.contributor.authorBelazzougui, Djamal
dc.contributor.authorCunial, Fabio
dc.date.accessioned2017-07-31T15:34:19Z
dc.date.available2017-07-31T15:34:19Z
dc.date.issued2017-07-04
dc.description.abstractGiven a string T, it is known that its suffix tree can be represented using the compact directed acyclic word graph (CDAWG) with e_T arcs, taking overall O(e_T+e_REV(T)) words of space, where REV(T) is the reverse of T, and supporting some key operations in time between O(1) and O(log(log(n))) in the worst case. This representation is especially appealing for highly repetitive strings, like collections of similar genomes or of version-controlled documents, in which e_T grows sublinearly in the length of T in practice. In this paper we augment such representation, supporting a number of additional queries in worst-case time between O(1) and O(log(n)) in the RAM model, without increasing space complexity asymptotically. Our technique, based on a heavy path decomposition of the suffix tree, enables also a representation of the suffix array, of the inverse suffix array, and of T itself, that takes O(e_T) words of space, and that supports random access in O(log(n)) time. Furthermore, we establish a connection between the reversed CDAWG of T and a context-free grammar that produces T and only T, which might have independent interest.fr_FR
dc.identifier.isbn978-3-95977-039-2fr_FR
dc.identifier.urihttp://dl.cerist.dz/handle/CERIST/895
dc.publisherSchloss Dagstuhl--Leibniz-Zentrum fuer Informatikfr_FR
dc.relation.ispartofseriesAnnual Symposium on Combinatorial Pattern Matching (CPM);78
dc.relation.pages7:1--7:13fr_FR
dc.relation.placeVarsoviefr_FR
dc.structureCalcul pervasif et mobile (Pervasive and Mobile Computing group)fr_FR
dc.subjectCDAWGfr_FR
dc.subjectsuffix treefr_FR
dc.subjectheavy path decompositionfr_FR
dc.subjectmaximal repeatfr_FR
dc.subjectcontext- free grammarfr_FR
dc.titleRepresenting the Suffix Tree with the CDAWGfr_FR
dc.typeConference paper

Files