Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- articleJune 1998
Memory management during run generation in external sorting
If replacement selection is used in an external mergesort to generate initial runs, individual records are deleted and inserted in the sort operation's workspace. Variable-length records introduce the need for possibly complex memory management and ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Efficient transparent application recovery in client-server information systems
Database systems recover persistent data, providing high database availability. However, database applications, typically residing on client or “middle-tier” application-server machines, may lose work because of a server failure. This prevents the ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Wavelet-based histograms for selectivity estimation
Query optimization is an integral part of relational database management systems. One important task in query optimization is selectivity estimation, that is, given a query P, we need to estimate the fraction of records in the database that satisfy P. ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Random sampling for histogram construction: how much is enough?
Random sampling is a standard technique for constructing (approximate) histograms for query optimization. However, any real implementation in commercial products requires solving the hard problem of determining “How much sampling is enough?” We address ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Approximate medians and other quantiles in one pass and with limited memory
We present new algorithms for computing approximate quantiles of large datasets in a single pass. The approximation guarantees are explicit, and apply for arbitrary value distributions and arrival distributions of the dataset. The main memory ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 -
- articleJune 1998
Bitmap index design and evaluation
Bitmap indexing has been touted as a promising approach for processing complex adhoc queries in read-mostly environments, like those of decision support systems. Nevertheless, only few possible bitmap schemes have been proposed in the past and very ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Integrating association rule mining with relational database systems: alternatives and implications
Data mining on large data warehouses is becoming increasingly important. In support of this trend, we consider a spectrum of architectural alternatives for coupling mining with database systems. These alternatives include: loose-coupling through a SQL ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
New sampling-based summary statistics for improving approximate query answers
In large data recording and warehousing environments, it is often advantageous to provide fast, approximate answers to queries, whenever possible. Before DBMSs providing highly-accurate approximate answers can become a reality, many new techniques for ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Cost-based optimization of decision support queries using transient-views
Next generation decision support applications, besides being capable of processing huge amounts of data, require the ability to integrate and reason over data from multiple, heterogeneous data sources. Often, these data sources differ in a variety of ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Enhanced hypertext categorization using hyperlinks
A major challenge in indexing unstructured hypertext databases is to automatically extract meta-data that enables structured search using topic taxonomies, circumvents keyword ambiguity, and improves the quality of search and profile-based routing and ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Extracting schema from semistructured data
Semistructured data is characterized by the lack of any fixed and rigid schema, although typically the data has some implicit structure. While the lack of fixed schema makes extracting semistructured data fairly easy and an attractive goal, presenting ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
NoDoSE—a tool for semi-automatically extracting structured and semistructured data from text documents
Often interesting structured or semistructured data is not in database systems but in HTML pages, text files, or on paper. The data in these formats is not usable by standard query processing engines and hence users need a way of extracting data from ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Simultaneous optimization and evaluation of multiple dimensional queries
Database researchers have made significant progress on several research issues related to multidimensional data analysis, including the development of fast cubing algorithms, efficient schemes for creating and maintaining precomputed group-bys, and the ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Caching multidimensional queries using chunks
Caching has been proposed (and implemented) by OLAP systems in order to reduce response times for multidimensional queries. Previous work on such caching has considered table level caching and query level caching. Table level caching is more suitable ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
An alternative storage organization for ROLAP aggregate views based on cubetrees
The Relational On-Line Analytical Processing (ROLAP) is emerging as the dominant approach in data warehousing with decision support applications. In order to enhance query performance, the ROLAP approach relies on selecting and materializing in summary ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Incremental distance join algorithms for spatial databases
Two new spatial join operations, distance join and distance semi-join, are introduced where the join output is ordered by the distance between the spatial attribute values of the joined tuples. Incremental algorithms are presented for computing these ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Similarity query processing using disk arrays
Similarity queries are fundamental operations that are used extensively in many modern applications, whereas disk arrays are powerful storage media of increasing importance. The basic trade-off in similarity query processing in such a system is that ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Integration of heterogeneous databases without common domains using queries based on textual similarity
Most databases contain “name constants” like course numbers, personal names, and place names that correspond to entities in the real world. Previous work in integration of heterogeneous databases has assumed that local name constants can be mapped into ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Using schematically heterogeneous structures
Schematic heterogeneity arises when information that is represented as data under one schema, is represented within the schema (as metadata) in another. Schematic heterogeneity is an important class of heterogeneity that arises frequently in integrating ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955 - articleJune 1998
Optimal multi-step k-nearest neighbor search
For an increasing number of modern database applications, efficient support of similarity search becomes an important task. Along with the complexity of the objects such as images, molecules and mechanical parts, also the complexity of the similarity ...
Also Published in:
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data: ISBN 0897919955