Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

An Optimal Ancestry Labeling Scheme with Applications to XML Trees and Universal Posets

Published: 12 February 2016 Publication History

Abstract

In this article, we solve the ancestry-labeling scheme problem, which aims at assigning the shortest possible labels (bit strings) to nodes of rooted trees, so ancestry queries between any two nodes can be answered by inspecting their assigned labels only. This problem was introduced more than 20 years ago by Kannan et al. [1988] and is among the most well-studied problems in the field of informative labeling schemes. We construct an ancestry-labeling scheme for n-node trees with label size log 2n + O(log log n) bits, thus matching the log 2n + Ω(log log n) bits lower bound given by Alstrup et al. [2003]. Our scheme is based on a simplified ancestry scheme that operates extremely well on a restricted set of trees. In particular, for the set of n-node trees with a depth of at most d, the simplified ancestry scheme enjoys label size of log 2n + 2log2d + O(1) bits. Since the depth of most XML trees is at most some small constant, such an ancestry scheme may be of practical use. In addition, we also obtain an adjacency-labeling scheme that labels n-node trees of depth d with labels of size log 2n + 3log 2d + O(1) bits. All our schemes assign the labels in linear time, and guarantee that any query can be answered in constant time.
Finally, our ancestry scheme finds applications to the construction of small universal partially ordered sets (posets). Specifically, for any fixed integer k, it enables the construction of a universal poset of size Õ(nk) for the family of n-element posets with a tree dimension of at most k. Up to lower-order terms, this bound is tight thanks to a lower bound of nko(1) by to Alon and Scheinerman [1988].

References

[1]
S. Abiteboul, S. Alstrup, H. Kaplan, T. Milo, and T. Rauhe. 2006. Compact labeling schemes for ancestor queries. SIAM J. Comput. 35 (2006), 1295--1309.
[2]
S. Abiteboul, P. Buneman, and D. Suciu. 1999. Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann, Burlington, MA.
[3]
S. Abiteboul, H. Kaplan, and T. Milo. 2001. Compact labeling schemes for ancestor queries. In Proc. 12th ACM-SIAM Symp. on Discrete Algorithms (SODA).
[4]
D. Adjiashvili and N. Rotbart. 2014. Labeling schemes for bounded degree graphs. In ICALP. 375--386.
[5]
S. Alstrup, P. Bille, and T. Rauhe. 2003. Labeling schemes for small distances in trees. In Proc. 14th ACM-SIAM Symp. on Discrete Algorithms (SODA).
[6]
S. Alstrup, C. Gavoille, H. Kaplan, and T. Rauhe. 2004. Nearest common ancestors: A survey and a new distributed algorithm. Theor. Comput. Syst. 37 (2004), 441--456.
[7]
S. Alstrup, S. Dahlgaard, M. Baek, and T. Knudsen. 2015. Optimal induced universal graphs and adjacency labeling for trees. In 56th Annual Symposium on Foundations of Computer Science (FOCS). 1311--1326.
[8]
S. Alstrup, E. B. Halvorsen, and K. G. Larsen. 2014. Near-optimal labeling schemes for nearest common ancestors. In Proc. 25th ACM-SIAM Symposium on Discrete Algorithms (SODA). 972--982.
[9]
S. Alstrup, H. Kaplan, M. Thorup, and U. Zwick. 2015. Adjacency labeling schemes and induced-universal graphs. In 47th Annual Symposium on the Theory of Computing (STOC).
[10]
S. Alstrup and T. Rauhe. 2002. Small induced-universal graphs and compact implicit graph representations. In Proc. 43rd IEEE Symp. on Foundations of Computer Science (FOCS).
[11]
S. Alstrup and T. Rauhe. 2002. Improved labeling scheme for ancestor queries. In Proc. 13th ACM-SIAM Symposium on Discrete Algorithms (SODA). 947--953.
[12]
N. Alon and E. R. Scheinerman. 1988. Degrees of freedom versus dimension for containment orders. Order 5 (1988), 11--16.
[13]
G. Behrendt. 1993. On trees and tree dimension of ordered sets. Order 10, 2, (1993), 153--160.
[14]
E. Cohen, H. Kaplan, and T. Milo. 2002. Labeling dynamic XML trees. In Proc. 21st ACM Symp. on Principles of Database Systems (PODS).
[15]
S. Dahlgaard, M. Baek, T. Knudsen, and Noy Rotbart. 2015. A simple and optimal ancestry labeling scheme for trees. In Proc. 42nd International Colloquium on Automata, Languages, and Programming (ICALP). 564--574.
[16]
L. Denoyer. 2009. Personal Communication.
[17]
L. Denoyer and P. Gallinari. 2006. The wikipedia XML corpus. SIGIR Forum Newsletter 40, 1 (2006), 64--69.
[18]
A. Deutsch, M. Fernndez, D. Florescu, A. Levy, and D. Suciu. 1999. A query language for XML. Computer Networks 31 (1999), 1155--1169.
[19]
R. Fraisse. 1953. Theory of Relations North-Holland, Amsterdam.
[20]
P. Fraigniaud and C. Gavoille. 2001. Routing in trees. In Proc. 28th Int. Colloq. on Automata, Languages, and Programing (ICALP), LNCS 2076. Springer, Berlin, 757--772.
[21]
P. Fraigniaud and A. Korman. 2009. On randomized representations of graphs using short labels. In Proc. Proc. of the 21st Ann. ACM Symp. on Parallel Algorithms and Architectures (SPAA), 131--137.
[22]
P. Fraigniaud and A. Korman. 2010a. Compact ancestry labeling schemes for XML trees. In Proc. 21st ACM-SIAM Symp. on Discrete Algorithms (SODA), 458--466.
[23]
P. Fraigniaud and A. Korman. 2010b. An optimal ancestry-labeling scheme and small universal posets. In Proc. of the 42nd ACM Symp. on Theory of Computing (STOC), 611--620.
[24]
C. Gavoille and A. Labourel. 2007. Shorter implicit representation for planar graphs and bounded treewidth graphs. In 15th Annual European Symposium on Algorithms (ESA), 582--593.
[25]
C. Gavoille and C. Paul. 2001. Split decomposition and distance labelling: An optimal scheme for distance hereditary graphs. In Proc. European Conf. on Combinatorics, Graph Theory and Applications.
[26]
C. Gavoille, D. Peleg, S. Pérennes, and R. Raz. 2001. Distance labeling in graphs. In Proc. 12th ACM-SIAM Symp. on Discrete Algorithms (SODA), 210--219.
[27]
T. Hsu and H. Lu. 2009. An optimal labeling for node connectivity. In Proc. 20th International Symp. on Algorithms and Computation (ISAAC). 303--310.
[28]
J. Hubicka and J. Neetril. 2005. Universal partial order represented by means of oriented trees and other simple graphs. Eur. J. Combinatorics 26, 5 (2005), 765--778.
[29]
B. Jonson. 1956. Universal relational systems. Math Scan.4, 193--208.
[30]
J. B. Johnston. 1957. Universal infinite partially ordered sets. Proc. Am. Math. Soc. 7 (1957), 507--514.
[31]
M. Katz, N. A. Katz, A. Korman, and D. Peleg. 2004. Labeling schemes for flow and connectivity. SIAM J. Comput. 34 (2004), 23--40.
[32]
A. Korman. 2010. Labeling schemes for vertex connectivity. ACM Transact. Algorithm. 6, 2 (2010).
[33]
A. Korman and S. Kutten. 2007. Distributed verification of minimum spanning trees. Distrib. Comput. 20, 4 (2007), 253--266.
[34]
H. Kaplan and T. Milo. 2001. Short and simple labels for small distances and other functions. In Workshop on Algorithms and Data Structures.
[35]
H. Kaplan, T. Milo, and R. Shabo. 2002. A comparison of labeling schemes for ancestor queries. In Proc. 19th ACM-SIAM Symp. on Discrete Algorithms (SODA).
[36]
S. Kannan, M. Naor, and S. Rudich. 1992. Implicit representation of graphs. SIAM J. Discr. Math 5, (1992), 596--603.
[37]
A. Korman, D. Peleg, and Y. Rodeh. 2010. Constructing labeling schemes through universal matrices. Algorithmica 57(4) (2010), 641--652.
[38]
L. Mignet, D. Barbosa, and P. Veltri. 2005. Studying the XML Web: Gathering Statistics from an XML Sample. World Wide Web 8, 4 (2005), 413--438.
[39]
I. Mlynkova, K. Toman, and J. Pokorny. 2006. Statistical analysis of real XML data collections. In Proc. 13th Int. Conf. on Management of Data (2006), 20--31.
[40]
D. Peleg. 2005. Informative labeling schemes for graphs. Theor. Comput. Sci. 340, 3, (2005), 577--593.
[41]
D. D. Sleator and R. E. Tarjan. 1983. A data structure for dynamic trees. J. Comp. Sys. Sci. 26, 3 (1983), 362--391.
[42]
M. Thorup. 2004. Compact oracles for reachability and approximate distances in planar digraphs. J. ACM 51, (2004), 993--1024.
[43]
M. Thorup and U. Zwick. 2001. Compact routing schemes. In Proc. 13th ACM Symp. on Parallel Algorithms and Architecture (SPAA). 1--10.
[44]
W. T. Trotter. 1992. Newblock Combinatorics and Partially Ordered Sets: Dimension Theory. The Johns Hopkins University Press, Baltimore, MD.
[45]
W. T. Trotter and J. I. Moore. 1977. The dimension of planar posets J. Comb. Theory B 22 (1977), 54--67.
[46]
E. S. Wolk. 1962. The comparability graph of a tree. Proc. Am. Math. Soc. 3 (1962), 789--795.

Cited By

View all
  • (2024)Hierarchical Indexing for Interactive Zooming of Document ClustersWeb Information Systems Engineering – WISE 202410.1007/978-981-96-0579-8_21(286-303)Online publication date: 29-Nov-2024
  • (2022)Shorter Labeling Schemes for Planar GraphsSIAM Journal on Discrete Mathematics10.1137/20M133046436:3(2082-2099)Online publication date: 31-Aug-2022
  • (2022)The Implicit Graph Conjecture is False2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS)10.1109/FOCS54457.2022.00109(1134-1137)Online publication date: Oct-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Journal of the ACM
Journal of the ACM  Volume 63, Issue 1
March 2016
353 pages
ISSN:0004-5411
EISSN:1557-735X
DOI:10.1145/2893450
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 February 2016
Accepted: 01 June 2015
Revised: 01 May 2015
Received: 01 October 2011
Published in JACM Volume 63, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Database
  2. XML search engine
  3. graph decomposition
  4. labeling scheme
  5. poset
  6. tree

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • INRIA project GANG
  • ANR project DISPLEXITY

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Hierarchical Indexing for Interactive Zooming of Document ClustersWeb Information Systems Engineering – WISE 202410.1007/978-981-96-0579-8_21(286-303)Online publication date: 29-Nov-2024
  • (2022)Shorter Labeling Schemes for Planar GraphsSIAM Journal on Discrete Mathematics10.1137/20M133046436:3(2082-2099)Online publication date: 31-Aug-2022
  • (2022)The Implicit Graph Conjecture is False2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS)10.1109/FOCS54457.2022.00109(1134-1137)Online publication date: Oct-2022
  • (2022)Decentralized Graph Processing for Reachability QueriesAdvanced Data Mining and Applications10.1007/978-3-031-22064-7_36(505-519)Online publication date: 24-Nov-2022
  • (2017)Optimal Induced Universal Graphs and Adjacency Labeling for TreesJournal of the ACM10.1145/308851364:4(1-22)Online publication date: 17-Aug-2017

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media