Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1109/FOCS.2012.79guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

On-Line Indexing for General Alphabets via Predecessor Queries on Subsets of an Ordered List

Published: 20 October 2012 Publication History

Abstract

The problem of \textit{Text Indexing} is a fundamental algorithmic problem in which one wishes to preprocess a text in order to quickly locate pattern queries within the text. In the ever evolving world of dynamic and on-line data, there is also a need for developing solutions to index texts which arrive on-line, i.e.~a character at a time, and still be able to quickly locate said patterns. In this paper, a new solution for on-line indexing is presented by providing an on-line suffix tree construction in $O(\log \log n + \log\log |\Sigma|)$ worst-case expected time per character, where $n$ is the size of the string, and $\Sigma$ is the alphabet. This improves upon all previously known on-line suffix tree constructions for general alphabets, at the cost of having the run time in expectation. The main idea is to reduce the problem of constructing a suffix tree on-line to an interesting variant of the order maintenance problem, which may be of independent interest. In the famous order maintenance problem, one wishes to maintain a dynamic list $L$ of size $n$ under insertions, deletions, and order queries. In an order query, one is given two nodes from $L$ and must determine which node precedes the other in $L$. In an extension to this problem, named the {\em Predecessor search on Dynamic Subsets of an Ordered Dynamic List problem} (POLP for short), it is also necessary to maintain dynamic subsets $S_1, \cdots, S_k\subseteq L$, such that given some $u\in L$ it will be possible to quickly locate the predecessor of $u$ in $S_i$, for any integer $1\leq i \leq k$. This paper provides an efficient data structure capable of locating the predecessor of $u$ in $S_i$ in $O(\log \log n)$ worst-case time and answering order queries on $L$ in $O(1)$ worst-case time, while allowing updates to $L$ in $O(1)$ worst-case expected time and updates to the subsets in $O(\log \log n)$ worst-case expected time. This improves over a previous data structure which may be implicitly obtained from Dietz~\cite{Dietz89}, in which the updates to the sets and $L$ are done in $O(\log \log n)$ \textit{amortized} expected time. In addition, the bounds shown here match the currently best known bounds for predecessor search in the RAM model. Furthermore, this paper improves or simplifies bounds for several additional applications, including fully-persistent arrays, the monotonic list labeling problem, and the Order-Maintenance Problem.

Cited By

View all
  • (2022)Dynamic suffix array with polylogarithmic queries and updatesProceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing10.1145/3519935.3520061(1657-1670)Online publication date: 9-Jun-2022
  • (2022)Bamboo Trimming RevisitedProceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3490148.3538580(411-417)Online publication date: 11-Jul-2022
  • (2021)Randomized cup game algorithms against strong adversariesProceedings of the Thirty-Second Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3458064.3458187(2059-2077)Online publication date: 10-Jan-2021
  • Show More Cited By

Index Terms

  1. On-Line Indexing for General Alphabets via Predecessor Queries on Subsets of an Ordered List

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    FOCS '12: Proceedings of the 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science
    October 2012
    770 pages
    ISBN:9780769548746

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 20 October 2012

    Author Tags

    1. data structures
    2. order-maintenance
    3. pattern matching
    4. predecessor
    5. suffix tree

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Dynamic suffix array with polylogarithmic queries and updatesProceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing10.1145/3519935.3520061(1657-1670)Online publication date: 9-Jun-2022
    • (2022)Bamboo Trimming RevisitedProceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3490148.3538580(411-417)Online publication date: 11-Jul-2022
    • (2021)Randomized cup game algorithms against strong adversariesProceedings of the Thirty-Second Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3458064.3458187(2059-2077)Online publication date: 10-Jan-2021
    • (2021)How asymmetry helps buffer management: achieving optimal tail size in cup gamesProceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing10.1145/3406325.3451033(1248-1261)Online publication date: 15-Jun-2021
    • (2020)Predecessor SearchACM Computing Surveys10.1145/340937153:5(1-35)Online publication date: 28-Sep-2020
    • (2019)Achieving optimal backlog in multi-processor cup gamesProceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing10.1145/3313276.3316342(1148-1157)Online publication date: 23-Jun-2019
    • (2018)Optimal dynamic stringsProceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3174304.3175404(1509-1528)Online publication date: 7-Jan-2018
    • (2018)Online Labeling: Algorithms, Lower Bounds and Open QuestionsComputer Science – Theory and Applications10.1007/978-3-319-90530-3_3(23-28)Online publication date: 6-Jun-2018
    • (2017)File maintenanceProceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3039686.3039784(1503-1522)Online publication date: 16-Jan-2017
    • (2017)Full-Fledged Real-Time Indexing for Constant Size AlphabetsAlgorithmica10.1007/s00453-016-0199-779:2(387-400)Online publication date: 1-Oct-2017
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media