This volume contains the proceedings of the Twenty-first ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS 2002), held at Madison, Wisconsin on June 3-5, 2002 in conjunction with the 2002 ACM SIGMOD International Conference on Management of Data. It consists of a paper based on the invited talk by Rajeev Motwani, two papers based on the invited tutorials by Maurizio Lenzerini and Dennis Shasha, and 24 contributed papers that were selected by the program committee for presentation at the symposium.The contributed papers were selected from 109 submissions. Most of the papers are "extended abstracts" and are preliminary reports on work in progress. While they have been read by program committee members, they have not been formally refereed. It is expected that much of the research described in these papers will be publihsed in detail in computer science journals.The program committee selected "Monadic Datalog over Trees and the Expressive Power of Languages for Web Information Extraction" by Georg Gottlob and Christoph Koch for the PODS 2002 Best Paper Award and "From Discrepancy to Declustering: Near optimal multidimensional declustering strategies for range queries" by Chung-Min Chen and Christine T. Cheng for the PODS 2002 Best Newcomer Award. Warmest congratulations to the authors of these papers.
Models and issues in data stream systems
In this overview paper we motivate the need for and research issues arising from a new model of data processing. In this model, data does not take the form of persistent relations, but rather arrives in multiple, continuous, rapid, time-varying data ...
Monadic datalog and the expressive power of languages for web information extraction
Research on information extraction from Web pages (wrapping) has seen much activity in recent times (particularly systems implementations), but little work has been done on formally studying the expressiveness of the formalisms proposed or on the ...
From discrepancy to declustering: near-optimal multidimensional declustering strategies for range queries
Declustering schemes allocate data blocks among multiple disks to enable parallel retrieval. Given a declustering scheme D, its response time with respect to a query Q, rt(Q), is defined to be the maximum number of disk blocks of the query stored by the ...
Algorithmics and applications of tree and graph searching
Modern search engines answer keyword-based queries extremely efficiently. The impressive speed is due to clever inverted index structures, caching, a domain-independent knowledge of strings, and thousands of machines. Several research efforts have ...
Validating streaming XML documents
This paper investigates the on-line validation of streaming XML documents with respect to a DTD, under memory constraints. We first consider validation using constant memory, formalized by a finite-state automaton (FSA). We examine two flavors of the ...
Containment and equivalence for an XPath fragment
XPath is a simple language for navigating an XML document and selecting a set of element nodes. XPath expressions are used to query XML data, describe key constraints, express transformations, and reference elements in remote documents. This paper ...
On the power of walking for querying tree-structured data
XSLT is the prime example of an XML query language based on tree-walking. Indeed, stripped down, XSLT is just a tree-walking tree-transducer equipped with registers and look-ahead. Motivated by this connection, we want to pinpoint the computational ...
A normal form for XML documents
This paper takes a first step towards the design and normalization theory for XML documents. We show that, like relational databases, XML documents may contain redundant information, and may be prone to update anomalies. Furthermore, such problems are ...
Distributed computation of web queries using automata
We introduce and investigate a distributed computation model for querying the Web. Web queries are computed by interacting automata running at different nodes in the Web. The automata which we are concerned with can be viewed as register automata ...
Conjunctive selection conditions in main memory
We consider the fundamental operation of applying a conjunction of selection conditions to a set of records. With large main memories available cheaply, systems may choose to keep the data entirely in main memory, in order to improve query and/or update ...
Efficient aggregation over objects with extent
We examine the problem of efficiently computing sum/count/avg aggregates over objects with non-zero extent. Recent work on computing multi-dimensional aggregates has concentrated on objects with zero extent (points) on a multi-dimensional grid, or one-...
How to evaluate multiple range-sum queries progressively
Users of decision support system typically submit batches of range-sum queries simultaneously rather than issuing individual, unrelated queries. We propose a wavelet based technique that exploits T/O sharing across a query batch to evaluate the set of ...
On the computation of relational view complements
Views as a means to describe parts of a given data collection play an important role in many database applications. In dynamic environments, where data is updated, not only information provided by views, but also information provided by data sources but ...
On propagation of deletions and annotations through views
We study two classes of view update problems in relational databases. We are given a source database S, a monotone query Q, and the view Q(S) generated by the query. The first problem that we consider is the classical view deletion problem where we wish ...
The view-selection problem has an exponential-time lower bound for conjunctive queries and views
The view-selection problem is to design and materialize a set of views over a database schema, such that the choice of views minimizes the cost of evaluating the selected workload of queries, and the combined size of the materialized views does not ...
OLAP dimension constraints
In multidimensional data models intended for online analytic processing (OLAP), data are viewed as points in a multidimensional space. Each dimension has structure, described by a directed graph of categories, a set of members for each category, and a ...
Fast algorithms for hierarchical range histogram construction
Data Warehousing and OLAP applications typically view data an having multiple logical dimensions (e.g., product, location) with natural hierarchies defined on each dimension. OLAP queries usually involve hierarchical selections on some of the dimensions,...
On moving object queries: (extended abstract)
Database applications for moving objects pose new challenges in modeling, querying, and maintenance of objects whose locations are rapidly changing over time. Previous work on modeling and querying spatio-temporal databases and constraint databases ...
Knowledge compilation = query rewriting + view synthesis
In Knowledge Compilation (KC) an intractable deduction problem KB ⊨ f is split into two phases: 1) KB is preprocessed, thus obtaining a data structure DKB; 2) the problem is efficiently solved using DKB and f. Our goal is to study KC in the context of ...
Answering queries using views with arithmetic comparisons
We consider the problem of answering queries using views, where queries and views are conjunctive queries with arithmetic comparisons (CQACs) over dense orders. Previous work only considered limited variants of this problem, without giving a complete ...
Characterizing memory requirements for queries over continuous data streams
We consider conjunctive queries with arithmetic comparisons over multiple continuous data streams. We specify an algorithm for determining whether or not a query can be evaluated using a bounded amount of memory for all possible instances of the data ...
Data integration: a theoretical perspective
Data integration is the problem of combining data residing at different sources, and providing the user with a unified view of these data. The problem of designing data integration systems is important in current real world applications, and is ...
Lossless regular views
If the only information we have on a certain database is through a set of views, the question arises of whether this is sufficient to answer completely a given query. We say that the set of views is lossless with respect to the query, if, no matter what ...
On verifying consistency of XML specifications
XML specifications often consist of a type definition (typically, a DTD) and a set of integrity constraints. It has been shown previously that such specifications can be inconsistent, and thus it is often desirable to check consistency at compile-time. ...
Labeling dynamic XML trees
We present algorithms to label the nodes of an XML tree which is subject to insertions and deletions of nodes. The labeling is done such that (1) we label each node immediately when it is inserted and this label remains unchanged, and (2) from a pair of ...
On the complexity of approximate query optimization
In this work, we study the complexity of the problem of approximate query optimization. We show that, for any δ > 0, the problem of finding a join order sequence whose cost is within a factor 2Θ(log1-δ(K)) of K, where K is the cost of the optimal join ...
Least expected cost query optimization: what can we expect?
A standard assumption in the database query optimization literature is that it suffices to optimize for the "typical" case---that is, the case in which various parameters (e.g., the amount of available memory, the selectivities of predicates, etc.) take ...
Cited By
-
Zhou Q, Arulraj J, Navathe S, Harris W and Wu J (2022). SPES: A Symbolic Approach to Proving Query Equivalence Under Bag Semantics 2022 IEEE 38th International Conference on Data Engineering (ICDE), 10.1109/ICDE53745.2022.00250, 978-1-6654-0883-7, (2735-2748)
-
Tao F, Sui F, Liu A, Qi Q, Zhang M, Song B, Guo Z, Lu S and Nee A (2018). Digital twin-driven product design framework, International Journal of Production Research, 10.1080/00207543.2018.1443229, 57:12, (3935-3953), Online publication date: 18-Jun-2019.
Index Terms
- Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Recommendations
Acceptance Rates
Year | Submitted | Accepted | Rate |
---|---|---|---|
PODS '19 | 87 | 29 | 33% |
PODS '17 | 101 | 29 | 29% |
PODS '16 | 94 | 31 | 33% |
PODS '15 | 80 | 25 | 31% |
PODS '14 | 67 | 22 | 33% |
PODS '13 | 97 | 24 | 25% |
PODS '12 | 101 | 26 | 26% |
PODS '11 | 113 | 25 | 22% |
PODS '10 | 113 | 27 | 24% |
PODS '09 | 97 | 26 | 27% |
PODS '08 | 159 | 28 | 18% |
PODS '07 | 187 | 28 | 15% |
PODS '06 | 185 | 35 | 19% |
PODS '03 | 136 | 27 | 20% |
PODS '02 | 109 | 24 | 22% |
PODS '01 | 99 | 26 | 26% |
PODS '00 | 119 | 26 | 22% |
PODS '99 | 116 | 32 | 28% |
PODS '98 | 119 | 28 | 24% |
PODS '97 | 118 | 23 | 19% |
PODS '96 | 84 | 22 | 26% |
PODS '95 | 94 | 25 | 27% |
PODS '94 | 117 | 28 | 24% |
PODS '93 | 115 | 26 | 23% |
Overall | 2,707 | 642 | 24% |