Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Logical definability and query languages over ranked and unranked trees

Published: 01 April 2007 Publication History

Abstract

We study relations on trees defined by first-order constraints over a vocabulary that includes the tree extension relation TT′ (holding if and only if every branch of T extends to a branch of T′), unary node-tests, and a binary relation checking whether the domains of two trees are equal. We consider both ranked and unranked trees. These are trees with and without a restriction on the number of children of nodes. We adopt the model-theoretic approach to tree relations and study relations definable over the structure consisting of the set of all trees and the aforementioned predicates. We relate definability of sets and relations of trees to computability by tree automata. We show that some natural restrictions correspond to familiar logics in the more classical setting where every tree is a structure over a fixed vocabulary, and to logics studied in the context of XML pattern languages. We then look at relational calculi over collections of trees, and obtain quantifier-restriction results that give us bounds on the expressive power and complexity. As unrestricted relational calculi can express problems that are complete for each level of the polynomial hierarchy, we look at their restrictions, corresponding to the restricted logics over the family of all unranked trees, and find several calculi with low (NC1) data complexity which still express properties important for database and document applications. We also give normal forms for safe queries in the calculus.

References

[1]
Abiteboul, S., Hull, R., and Vianu, V. 1995. Foundations of Databases. Addison-Wesley.
[2]
Aiken, A. 1994. Set constraints: Results, applications, and future directions. In PPCP: Proceedings of the 2nd International Workshop on Principles and Practice of Constraint Programming. Lecture Notes in Computer Science, vol. 874. Springer Verlag. 326--335.
[3]
Angluin, D. and Hoover, D. N. 1984. Regular prefix relations. Math. Syst. Theory 17, 3, 167--191.
[4]
Anthony, M. and Biggs, N. 1992. Computational Learning Theory. Cambridge University Press.
[5]
Benedikt, M. and Libkin, L. 2000. Relational queries over interpreted structures. J. ACM 47, 4, 644--680.
[6]
Benedikt, M., Libkin, L., Schwentick, T., and Segoufin, L. 2003. Definable relations and first-order query languages over strings. J. ACM 50, 5, 694--751.
[7]
Blumensath, A. and Gräel, E. 2000. Automatic structures. In LICS: Proceedings of the 15th Annual IEEE Symposium on Logic in Computer Science. 51.
[8]
Börger, E., Grädel, E., and Gurevich, Y. 1997. The Classical Decision Problem. Springer Verlag.
[9]
Bray, T., Paoli, J., Sperberg-McQueen, C. M., and Maler, E. 2000. Extensible Markup Language (XML) 1.0, 2nd ed. W3C recommendation. http://www.w3.org/XML.
[10]
Brüggemann-Klein, A., Murata, M., and Wood, D. 2001. Regular tree and regular hedge languages over unranked alphabets: Version 1. Tech. Rep. HKUST-TCSC-2001-0, The Hongkong University of Science and Technology.
[11]
Bruyère, V., Hansel, G., Michaux, C., and Villemaire, R. 1994. Logic and p-recognizable sets of integers. Bull. Belgian Math. Soc. 1, 191--238.
[12]
Cardelli, L. and Ghelli, G. 2004. TQL: A query language for semistructured data based on the ambient logic. Math. Structures Comput. Sci. 14, 3, 285--327.
[13]
Chamberlin, D., Clark, J., Florescu, D., Robie, J., Simeon, J., and Stefanascu, M. 2002. XQuery 1.0: An XML query language. http://www.w3.org/TR/xquery/.
[14]
Clark, J. 1999. XSL transformations version 1.0. http://www.w3.org/TR/WD-xslt.
[15]
Clark, J. and DeRose, S. 1999. XML path language (XPath). http://www.w3.org/TR/xpath.
[16]
Comon, H., Dauchet, M., Gilleron, R., Jacquemard, F., Lugiez, D., Tison, S., and Tommasi, M. 1997. Tree automata techniques and applications. http://www.grappa.univ-lille3.fr/tata.
[17]
Dantsin, E. and Voronkov, A. 2000. Expressive power and data complexity of nonrecursive query languages for lists and trees (extended abstract). In PODS: Proceedings of the 19th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems. ACM Press, New York. 157--165.
[18]
Doner, J. 1970. Tree acceptors and some of their applications. J. Comput. Syst. Sci. 4, 406--451.
[19]
Dörre, J. 1991. Feature logic with weak subsumption constraints. In Proceedings of the 29th Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, 256--263.
[20]
Ebbinghaus, H.-D. and Flum, J. 1999. Finite Model Theory, 2nd ed. Springer Verlag.
[21]
Elgaard, J., Klarlund, N., and Møller, A. 1998. Mona 1.x: New techniques for ws1s and ws2s. In CAV: Proceedings of the 10th International Conference on Computer Aided Verification, A. J. Hu and M. Y. Vardi, eds. Lecture Notes in Computer Science, vol. 1427. Springer Verlag. 516--520.
[22]
Ésik, Z. 1998. Axiomatizing the equational theory of regular tree languages (extended anstract). In STACS: Proceedings of the 15th Annual Symposium on Theoretical Aspects of Computer Science, M. Morvan et al. eds. Lecture Notes in Computer Science, vol. 1373. Springer Verlag. 455--465.
[23]
Flum, J. and Ziegler, M. 1999. Pseudo-Finite homogeneity and saturation. J. Symb. Logic 64, 1689--1699.
[24]
Gottlob, G., Koch, C., Pichler, R., and Segoufin, L. 2005. The complexity of Xpath query evaluation and XML typing. J. ACM 52, 2, 284--335.
[25]
Hodges, W. 1993. Model Theory. Cambridge University Press, New York.
[26]
Hodgson, B. 1983. Décadabilité par automate fini. Ann. Sci. Math. Québec 7, 39--57.
[27]
Immerman, N. 1998. Descriptive Complexity. Springer Verlag.
[28]
Khoussainov, B. and Nerode, A. 1995. Automatic presentations of structures. In LCC: Logical and Computational Complexity, D. Leivant, ed. Lecture Notes in Computer Science, vol. 960. Springer Verlag. 367--392.
[29]
Kolb, H.-P. and Mönnich, U. 1999. The Mathematics of Syntactic Structure: Trees and Their Logics. Walter De Gruyter, Hawthorne, NY.
[30]
Kupferman, O., Safra, S., and Vardi, M. Y. 1996. Relating word and tree automata. In LICS: Proceedings of the 11th Annual IEEE Symposium on Logic in Computer Science. 322.
[31]
Laskowski, M. C. 1992. Vapnik-Chervonenkis classes of definable sets. J. London Math. Soc. 45, 377--384.
[32]
Lohrey, M. 2001. On the parallel complexity of tree automata. In RTA: Proceedings of the 12th International Conference on Rewriting Techniques and Applications, A. Middeldorp, ed. Lecture Notes in Computer Science, vol. 2051. Springer Verlag. 201--215.
[33]
Marx, M. 2004. Conditional Xpath, the first order complete Xpath dialect. In PODS: Proceedings of the 23rd ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems. ACM Press, New York. 13--22.
[34]
Müller, M. and Niehren, J. 2000. Ordering constraints over feature trees expressed in second-order monadic logic. Inf. Comput. 159, 1--2, 22--58.
[35]
Müller, M., Niehren, J., and Podelski, A. 2000. Ordering constraints over feature trees. Constraints 5, 1--2, 7--41.
[36]
Müller, M., Niehren, J., and Treinen, R. 2001. The first-order theory of ordering constraints over feature trees. Discrete Math. Theor. Comput. Sci. 4, 2, 193--234.
[37]
Neven, F. and Schwentick, T. 2000. Expressive and efficient pattern languages for tree-structured data (extended abstract). In PODS: Proceedings of the 19th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems. ACM Press, New York. 145--156.
[38]
Neven, F. and Schwentick, T. 2002. Query automata on finite trees. Theor. Comput. Sci. 275, 633--674.
[39]
Niwinski, D. and Walukiewicz, I. 1998. Relating hierarchies of word and tree automata. In STACS: Proceedings of the 15th Annual Symposium on Theoretical Aspects of Computer Science, M. Morvan et al. eds. Lecture Notes in Computer Science, vol. 1373. Springer Verlag. 320--331.
[40]
Potthoff, A. and Thomas, W. 1993. Regular tree languages without unary symbols are star-free. In FCT: Proceedings of the 9th International Symposium on Fundamentals of Computation Theory, Z. Ésik, ed. Lecture Notes in Computer Science, vol. 710. Springer Verlag. 396--405.
[41]
Rabin, M. 1969. Decidability of second-order theories and automata on infinite trees. Trans. Amer. Math. Soci. 141, 1--35.
[42]
Smolka, G. 1995. The OZ programming model (extended abstract). In Euro-Par: Proceedings of the 1st International Euro-Par Conference on Parallel Processing, S. Haridi and P. Magnusson, eds. Lecture Notes in Computer Science, vol. 966. Springer Verlag. 5--8.
[43]
Smolka, G. and Treinen, R. 1994. Records for logic programming. J. Logic Program. 18, 229--258.
[44]
Stolboushkin, A. and Taitslin, M. 1999. Finite queries do not have effective syntax. Inf. Comput. 153, 99--116.
[45]
Su, Z., Aiken, A., Niehren, J., Priesnitz, T., and Treinen, R. 2002. The first-order theory of subtyping constraints. In POPL: Proceedings of the 29th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages. ACM Press, 203--216.
[46]
Suciu, D. 2002. Typechecking for semistructured data. In DBPL: Revised Papers from the 8th International Workshop on Database Programming Languages, G. Ghelli and G. Grahne, eds. Lecture Notes in Computer Science, vol. 2397. Springer Verlag. 1--20.
[47]
Thatcher, J. and Wright, J. 1968. Generalized finite automata theory with an application to a decision problem of second-order logic. Math. Syst. Theory 2, 1, 57--81.
[48]
Thomas, W. 1984. Logical aspects in the study of tree languages. In Proceedings of the 9th International Colloquium on Trees in Algebra and Programming, B. Courcelle, ed. Cambridge University Press, New York. 31--50.
[49]
Thomas, W. 1987. On chain logic, path logic, and first-order logic over infinite trees. In Symposium on Logic in Computer Science. 245--256.
[50]
Thomas, W. 1997. Languages, automata, and logic. In Handbook of Formal Languages, vol. 3. G. Rozenberg and A. Salomaa, eds. Springer Verlag. 389--456.
[51]
Vollmer, H. 1999. Introduction to Circuit Complexity. Springer Verlag.
[52]
Vorobyov, S. and Voronkov, A. 1998. Complexity of nonrecursive logic programs with complex values. In PODS: Proceedings of the 17th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. ACM Press, New York. 244--253.
[53]
Wilke, T. 1996. An algebraic characterization of frontier testable tree languages. Theor. Comput. Sci. 154, 1, 85--106.

Cited By

View all
  • (2023)Expressive Completeness of Two-Variable First-Order Logic with Counting for First-Order Logic Queries on Rooted Unranked Trees2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS)10.1109/LICS56636.2023.10175828(1-13)Online publication date: 26-Jun-2023
  • (2018)Regular Languages of Nested Words: Fixed Points, Automata, and SynchronizationTheory of Computing Systems10.1007/s00224-010-9292-549:3(639-670)Online publication date: 25-Dec-2018
  • (2011)Weak Kripke structures and LTLProceedings of the 22nd international conference on Concurrency theory10.5555/2040235.2040272(419-433)Online publication date: 6-Sep-2011
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Computational Logic
ACM Transactions on Computational Logic  Volume 8, Issue 2
April 2007
196 pages
ISSN:1529-3785
EISSN:1557-945X
DOI:10.1145/1227839
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 April 2007
Published in TOCL Volume 8, Issue 2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Ranked trees
  2. model theory
  3. query languages
  4. tree automata
  5. unranked trees

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Expressive Completeness of Two-Variable First-Order Logic with Counting for First-Order Logic Queries on Rooted Unranked Trees2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS)10.1109/LICS56636.2023.10175828(1-13)Online publication date: 26-Jun-2023
  • (2018)Regular Languages of Nested Words: Fixed Points, Automata, and SynchronizationTheory of Computing Systems10.1007/s00224-010-9292-549:3(639-670)Online publication date: 25-Dec-2018
  • (2011)Weak Kripke structures and LTLProceedings of the 22nd international conference on Concurrency theory10.5555/2040235.2040272(419-433)Online publication date: 6-Sep-2011
  • (2011)Queries on Xml streams with bounded delay and concurrencyInformation and Computation10.1016/j.ic.2010.08.003209:3(409-442)Online publication date: 1-Mar-2011
  • (2011)Weak Kripke Structures and LTLCONCUR 2011 – Concurrency Theory10.1007/978-3-642-23217-6_28(419-433)Online publication date: 2011
  • (2009)Hyperset Approach to Semi-structured DatabasesProceedings of the 26th British National Conference on Databases: Dataspace: The Final Frontier10.1007/978-3-642-02843-4_10(78-90)Online publication date: 30-Jun-2009
  • (2008)Recurrent Reachability Analysis in Regular Model CheckingProceedings of the 15th International Conference on Logic for Programming, Artificial Intelligence, and Reasoning10.1007/978-3-540-89439-1_15(198-213)Online publication date: 22-Nov-2008
  • (2007)Regular languages of nested wordsProceedings of the 34th international conference on Automata, Languages and Programming10.5555/2394539.2394642(888-900)Online publication date: 9-Jul-2007

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media