SIGMOD: Vol 25, No 2

Volume 25, Issue 2June 1996

Volume 25, Issue 2

June 1996

Editor:

Publisher:

Association for Computing Machinery
New York
NY
United States

ISSN:0163-5808

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

article

Free

Mining quantitative association rules in large relational tables

Pages 1–12https://doi.org/10.1145/235968.233311

We introduce the problem of mining association rules in large relational tables containing both quantitative and categorical attributes. An example of such an association might be "10% of married people between age 50 and 60 have at least 2 cars". We ...

article

Free

Data mining using two-dimensional optimized association rules: scheme, algorithms, and visualization

Pages 13–23https://doi.org/10.1145/235968.233313

We discuss data mining based on association rules for two numeric attributes and one Boolean attribute. For example, in a database of bank customers, "Age" and "Balance" are two numeric attributes, and "CardLoan" is a Boolean attribute. Taking the pair (...

article

Free

IDEA: interactive data exploration and analysis

Pages 24–34https://doi.org/10.1145/235968.233315

The analysis of business data is often an ill-defined task characterized by large amounts of noisy data. Because of this, business data analysis must combine two kinds of intertwined tasks: exploration and analysis. Exploration is the process of finding ...

article

Free

Rapid bushy join-order optimization with Cartesian products

Pages 35–46https://doi.org/10.1145/235968.233317

Query optimizers often limit the search space for join orderings, for example by excluding Cartesian products in subplans or by restricting plan trees to left-deep vines. Such exclusions are widely assumed to reduce optimization effort while minimally ...

article

Free

SQL query optimization: reordering for a general class of queries

Pages 47–56https://doi.org/10.1145/235968.233318

The strength of commercial query optimizers like DB2 comes from their ability to select an optimal order by generating all equivalent reorderings of binary operators. However, there are no known methods to generate all equivalent reorderings for a SQL ...

article

Free

Fundamental techniques for order optimization

Pages 57–67https://doi.org/10.1145/235968.233320

Decision support applications are growing in popularity as more business data is kept on-line. Such applications typically include complex SQL queries that can test a query optimizer's ability to produce an efficient access plan. Many access plan ...

article

Free

A Teradata content-based multimedia object manager for massively parallel architectures

Pages 68–78https://doi.org/10.1145/235968.233321

The Teradata Multimedia Object Manager is a general-purpose content analysis multimedia server designed for symmetric multiprocessing and massively parallel processing environments. The Multimedia Object Manager defines and manipulates user-defined ...

article

Free

Fault-tolerant architectures for continuous media servers

Pages 79–90https://doi.org/10.1145/235968.233322

Continuous media servers that provide support for the storage and retrieval of continuous media data (e.g., video, audio) at guaranteed rates are becoming increasingly important. Such servers, typically, rely on several disks to service a large number ...

article

Free

Optimizing queries over multimedia repositories

Pages 91–102https://doi.org/10.1145/235968.233323

Repositories of multimedia objects having multiple types of attributes (e.g., image, text) are becoming increasingly common. A selection on these attributes will typically produce not just a set of objects, as in the traditional relational query model (...

article

Free

BIRCH: an efficient data clustering method for very large databases

Pages 103–114https://doi.org/10.1145/235968.233324

Finding useful patterns in large datasets has attracted considerable interest recently, and one of the most widely studied problems in this area is the identification of clusters, or densely populated regions, in a multi-dimensional dataset. Prior work ...

article

Free

On-line reorganization of sparsely-populated B+-trees

Pages 115–124https://doi.org/10.1145/235968.233325

In this paper, we present an efficient method to do online reorganization of sparsely-populated B⁺-trees. It reorganizes the leaves first, compacting in short operations groups of leaves with the same parent. After compacting, optionally, the new leaves ...

article

Free

Two techniques for on-line index modification in shared nothing parallel databases

Pages 125–136https://doi.org/10.1145/235968.233326

Whenever data is moved across nodes in the parallel database system, the indexes need to be modified too. Index modification overhead can be quite severe because there can be a large number of indexes on a relation. In this paper, we study two ...

article

Free

Query caching and optimization in distributed mediator systems

Pages 137–146https://doi.org/10.1145/235968.233327

Query processing and optimization in mediator systems that access distributed non-proprietary sources pose many novel problems. Cost-based query optimization is hard because the mediator does not have access to source statistics information and ...

article

Free

Performance tradeoffs for client-server query processing

Pages 149–160https://doi.org/10.1145/235968.233328

The construction of high-performance database systems that combine the best aspects of the relational and object-oriented approaches requires the design of client-server architectures that can fully exploit client and server resources in a flexible ...

article

Free

Data access for the masses through OLE DB

José A. Blakeley

Pages 161–172https://doi.org/10.1145/235968.233329

This paper presents an overview of OLE DB, a set of interfaces being developed at Microsoft whose goal is to enable applications to have uniform access to data stored in DBMS and non-DBMS information containers. Applications will be able to take ...

article

Free

The dangers of replication and a solution

Pages 173–182https://doi.org/10.1145/235968.233330

Update anywhere-anytime-anyway transactional replication has unstable behavior as the workload scales up: a ten-fold increase in nodes and traffic gives a thousand fold increase in deadlocks or reconciliations. Master copy replication (primary copy) ...

article

Free

Hot mirroring: a method of hiding parity update penalty and degradation during rebuilds for RAID5

Pages 183–194https://doi.org/10.1145/235968.233331

This paper proposes a storage management scheme for disk arrays, named hot mirroring. In this scheme, storage space is partitioned into two regions. One is the mirrored region, which is characterized by high performance and low storage efficiency. The ...

article

Free

Random I/O scheduling in online tertiary storage systems

Pages 195–204https://doi.org/10.1145/235968.233332

New database applications that require the storage and retrieval of many terabytes of data are reaching the limits for disk-based storage systems, in terms of both cost and scalability. These limits provide a strong incentive for the development of ...

article

Free

Implementing data cubes efficiently

Pages 205–216https://doi.org/10.1145/235968.233333

Decision support applications involve complex queries on very large databases. Since response times should be small, query optimization is critical. Users typically view the data as multidimensional data cubes. Each cell of the data cube is a view ...

article

Free

Providing better support for a class of decision support queries

Pages 217–227https://doi.org/10.1145/235968.233334

Relational database systems do not effectively support complex queries containing quantifiers (quantified queries) that are increasingly becoming important in decision support applications. Generalized quantifiers provide an effective way of expressing ...

article

Free

A query language for multidimensional arrays: design, implementation, and optimization techniques

Pages 228–239https://doi.org/10.1145/235968.233335

While much recent research has focussed on extending databases beyond the traditional relational model, relatively little has been done to develop database tools for querying data organized in (multidimensional) arrays. The scientific computing ...

article

Free

A super scalar sort algorithm for RISC processors

Ramesh C. Agarwal

Pages 240–246https://doi.org/10.1145/235968.233336

The compare and branch sequences required in a traditional sort algorithm can not efficiently exploit multiple execution units present in currently available high performance RISC processors. This is because of the long latency of the compare ...

article

Free

Spatial hash-joins

Pages 247–258https://doi.org/10.1145/235968.233337

We examine how to apply the hash-join paradigm to spatial joins, and define a new framework for spatial hash-joins. Our spatial partition functions have two components: a set of bucket extents and an assignment function, which may map a data item into ...

article

Free

Partition based spatial-merge join

Pages 259–270https://doi.org/10.1145/235968.233338

This paper describes PBSM (Partition Based Spatial-Merge), a new algorithm for performing spatial join operation. This algorithm is especially effective when neither of the inputs to the join have an index on the joining attribute. Such a situation ...

article

Free

Bifocal sampling for skew-resistant join size estimation

Pages 271–281https://doi.org/10.1145/235968.233340

This paper introduces bifocal sampling, a new technique for estimating the size of an equi-join of two relations. Bifocal sampling classifies tuples in each relation into two groups, sparse and dense, based on the number of tuples with the same join ...

article

Free

Estimating alphanumeric selectivity in the presence of wildcards

Pages 282–293https://doi.org/10.1145/235968.233341

Success of commercial query optimizers and database management systems (object-oriented or relational) depend on accurate cost estimation of various query reordering [BGI]. Estimating predicate selectivity, or the fraction of rows in a database that ...

article

Free

Improved histograms for selectivity estimation of range predicates

Pages 294–305https://doi.org/10.1145/235968.233342

Many commercial database systems maintain histograms to summarize the contents of relations and permit efficient estimation of query result sizes and access plan costs. Although several types of histograms have been proposed in the past, there has never ...

article

Free

Structures for manipulating proposed updates in object-oriented databases

Pages 306–317https://doi.org/10.1145/235968.233344

Support for virtual states and deltas between them is useful for a variety of database applications, including hypothetical database access, version management, simulation, and active databases. The Heraclitus paradigm elevates delta values to be "first-...

article

Free

Safe and efficient sharing of persistent objects in Thor

Pages 318–329https://doi.org/10.1145/235968.233346

Thor is an object-oriented database system designed for use in a heterogeneous distributed environment. It provides highly-reliable and highly-available persistent storage for objects, and supports safe sharing of these objects by applications written ...

article

Free

An open abstract-object storage system

Pages 330–340https://doi.org/10.1145/235968.233348

Database systems must become more open to retain their relevance as a technology of choice and necessity. Openness implies not only databases exporting their data, but also exporting their services. This is as true in classical application areas as in ...

Sections

Save to Binder

Subjects

Comments