Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1316689dlproceedingsBook PagePublication PagesvldbConference Proceedingsconference-collections
VLDB '04: Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
2004 Proceeding
Publisher:
  • VLDB Endowment
Conference:
Toronto Canada 31 August 2004- 3 September 2004
ISBN:
978-0-12-088469-8
Published:
31 August 2004
Sponsors:
VLDB Endowment

Reflects downloads up to 03 Oct 2024Bibliometrics
Abstract

No abstract available.

Article
Databases in a wireless world
Page 3

The traditional view of distributed databases is based on a number of database servers with regular communication. Today information is stored not only in these central databases, but on a myriad of computers and computer-based devices in addition to ...

Article
Structures, semantics and statistics
Pages 4–6

At a fundamental level, the key challenge in data integration is to reconcile the semantics of disparate data sets, each expressed with a different database structure. I argue that computing statistics over a large number of structures offers a powerful ...

Article
Whither data mining?
Page 9

The last decade has witnessed tremendous advances in data mining. We take a retrospective look at these developments, focusing on association rules discovery, and discuss the challenges and opportunities ahead.

Article
Compressing large boolean matrices using reordering techniques
Pages 13–23

Large boolean matrices are a basic representational unit in a variety of applications, with some notable examples being interactive visualization systems, mining large graph structures, and association rule mining. Designing space and time efficient ...

Article
On the performance of bitmap indices for high cardinality attributes
Pages 24–35

It is well established that bitmap indices are efficient for read-only attributes with low attribute cardinalities. For an attribute with a high cardinality, the size of the bitmap index can be very large. To overcome this size problem, specialized ...

Article
Practical suffix tree construction
Pages 36–47

Large string datasets are common in a number of emerging text and biological database applications. Common queries over such datasets include both exact and approximate string matches. These queries can be evaluated very efficiently by using a suffix ...

Article
Answering xpath queries over networks by sending minimal views
Pages 48–59

When a client submits a set of XPath queries to a XML database on a network, the set of answer sets sent back by the database may include redundancy in two ways: some elements may appear in more than one answer set, and some elements in some answer sets ...

Article
A framework for using materialized XPath views in XML query processing
Pages 60–71

XML languages, such as XQuery, XSLT and SQL/XML, employ XPath as the search and extraction language. XPath expressions often define complicated navigation, resulting in expensive query processing, especially when executed over large collections of ...

Article
Schema-free XQuery
Pages 72–83

The widespread adoption of XML holds out the promise that document structure can be exploited to specify precise database queries. However, the user may have only a limited knowledge of the XML structure, and hence may be unable to produce a correct ...

Article
Client-based access control management for XML documents
Pages 84–95

The erosion of trust put in traditional database servers and in Database Service Providers, the growing interest for different forms of data dissemination and the concern for protecting children from suspicious Internet content are different factors ...

Article
Secure XML publishing without information leakage in the presence of data inference
Pages 96–107

Recent applications are seeing an increasing need that publishing XML documents should meet precise security requirements. In this paper, we consider data-publishing applications where the publisher specifies what information is sensitive and should be ...

Article
Limiting disclosure in hippocratic databases
Pages 108–119

We present a practical and efficient approach to incorporating privacy policy enforcement into an existing application and database environment, and we explore some of the semantic tradeoffs introduced by enforcing these privacy policy rules at cell-...

Article
On testing satisfiability of tree pattern queries
Pages 120–131

XPath and XQuery (which includes XPath as a sublanguage) are the major query languages for XML. An important issue arising in efficient evaluation of queries expressed in these languages is satisfiability, i.e., whether there exists a database, ...

Article
Containment of nested XML queries
Pages 132–143

Query containment is the most fundamental relationship between a pair of database queries: a query Q is said to be contained in a query Q′ if the answer for Q is always a subset of the answer for Q′, independent of the current state of the database. ...

Article
Efficient XML-to-SQL query translation: where to add the intelligence?
Pages 144–155

We consider the efficiency of queries generated by XML to SQL translation. We first show that published XML-to-SQL query translation algorithms are suboptimal in that they often translate simple path expressions into complex SQL queries even when much ...

Article
Taming XPath queries by minimizing wildcard steps
Pages 156–167

This paper presents a novel and complementary technique to optimize an XPath query by minimizing its wildcard steps. Our approach is based on using a general composite axis called the layer axis, to rewrite a sequence of XPath steps (all of which are ...

Article
The NEXT framework for logical XQuery optimization
Pages 168–179

Classical logical optimization techniques rely on a logical semantics of the query language. The adaptation of these techniques to XQuery is precluded by its definition as a functional language with operational semantics. We introduce Nested XML ...

Article
Detecting change in data streams
Pages 180–191

Detecting changes in a data stream is an important area of research with many applications. In this paper, we present a novel method for the detection and estimation of change. In addition to providing statistical guarantees on the reliability of ...

Article
Stochastic consistency, and scalable pull-based caching for erratic data stream sources
Pages 192–203

We introduce the notion of stochastic consistency, and propose a novel approach to achieving it for caches of highly erratic data. Erratic data sources, such as stock prices, sensor data, are common and important in practice. However, their erratic ...

Article
False positive or false negative: mining frequent itemsets from high speed transactional data streams
Pages 204–215

The problem of finding frequent items has been recently studied over high speed data streams. However, mining frequent itemsets from transactional data streams has not been well addressed yet in terms of its bounds of memory consumption. The main ...

Article
Indexing temporal XML documents
Pages 216–227

Different models have been proposed recently for representing temporal data, tracking historical information, and recovering the state of the document as of any given time, in XML documents. We address the problem of indexing temporal XML documents. In ...

Article
Schema-based scheduling of event processors and buffer minimization for queries on structured data streams
Pages 228–239

We introduce an extension of the XQuery language, FluX, that supports event-based query processing and the conscious handling of main memory buffers. Purely event-based queries of this language can be executed on streaming XML data in a very direct way. ...

Article
Bloom histogram: path selectivity estimation for XML data with updates
Pages 240–251

Cost-based XML query optimization calls for accurate estimation of the selectivity of path expressions. Some other interactive and internet applications can also benefit from such estimations. While there are a number of estimation techniques proposed ...

Article
XQuery on SQL hosts
Pages 252–263

Relational database systems may be turned into efficient XML and XPath processors if the system is provided with a suitable relational tree encoding. This paper extends this relational XML processing stack and shows that an RDBMS can also serve as a ...

Article
ROX: relational over XML
Pages 264–275

An increasing percentage of the data needed by business applications is being generated in XML format. Storing the XML in its native format will facilitate new applications that exchange business objects in XML format and query portions of XML documents ...

Article
From XML view updates to relational view updates: old solutions to a new problem
Pages 276–287

This paper addresses the question of updating relational databases through XML views. Using query trees to capture the notions of selection, projection, nesting, grouping, and heterogeneous sets found throughout most XML query languages, we show how XML ...

Article
XWAVE: optimal and approximate extended wavelets
Pages 288–299

Wavelet synopses have been found to be of interest in query optimization and approximate query answering. Recently, extended wavelets were proposed by Deligiannakis and Roussopoulos for data sets containing multiple measures. Extended wavelets optimize ...

Article
REHIST: relative error histogram construction algorithms
Pages 300–311

Histograms and Wavelet synopses provide useful tools in query optimization and approximate query answering. Traditional histogram construction algorithms, such as V-Optimal, optimize absolute error measures for which the error in estimating a true value ...

Article
Distributed set-expression cardinality estimation
Pages 312–323

We consider the problem of estimating set-expression cardinality in a distributed streaming environment where rapid update streams originating at remote sites are continually transmitted to a central processing system. At the core of our algorithmic ...

Article
Memory-limited execution of windowed stream joins
Pages 324–335

We address the problem of computing approximate answers to continuous sliding-window joins over data streams when the available memory may be insufficient to keep the entire join state. One approximation scenario is to provide a maximum subset of the ...

Contributors
  • Khoury College of Computer Sciences
  • University of Waterloo
  • Microsoft Research
  • Northeastern University
  • Microsoft Corporation

Recommendations