Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Optimizing chain queries in a distributed database system.

Published: 01 February 1984 Publication History
  • Get Citation Alerts
  • Abstract

    No abstract available.

    Cited By

    View all
    • (2008)Scalable multi-query optimization for exploratory queries over federated scientific databasesProceedings of the VLDB Endowment10.14778/1453856.14538641:1(16-27)Online publication date: 1-Aug-2008
    • (2008)BioScoutProceedings of the 11th international conference on Extending database technology: Advances in database technology10.1145/1353343.1353437(730-734)Online publication date: 25-Mar-2008
    • (1997)Adaptive Join Algorithms in Dynamic Distributed DatabasesDistributed and Parallel Databases10.1023/A:10086197050795:1(5-30)Online publication date: 1-Jan-1997
    • Show More Cited By

    Recommendations

    Reviews

    Mark Robbin Brown

    The paper considers the problem of evaluating a relational query consisting of the join of N distinct relations, projected to the columns of the first relation. The main result of the paper is an efficient algorithm that, given such a query, finds the lowest-cost program for it that consists of a sequence of semijoins. The semijoin of two relations is just the join of the two relations, projected to the columns of the first relation. Semijoins may be interesting in distributed systems because to compute a semijoin, only the joining column of the second relation needs to be communicated to the site of the first relation. Perhaps the most surprising result of the paper is that in some cases the optimal program is quite long (containing roughly N :9I 2/2 semijoins). It is quite difficult to produce mathematical results that are relevant to the problem of computing distributed queries on a real database. The impact of this paper is lessened in several ways, some of which are mentionaled by the authors in their concluding section. The paper focuses on queries that perform joins in order to select data from a single relation, but most programs perform joins in order to combine data from several tables. The paper focuses on semijoin programs, so the optimizer produces programs that do not use joins even when this would reduce the total cost. The cost measure is an estimate of total communication time, and it does not account for processing and disk I/O time or for potential parallel communication. Relations are assumed to be stored at distinct sites. Finally, the optimization algorithm requires knowledge of the database that is expensive to obtain and is invalidated by database updates. This need for detailed information may be inherent in algorithms that optimize performance against an arbitrary database, as opposed to those which optimize in some average sense given incomplete information about the database or those which optimize per instance over a restricted set of databases.

    Access critical reviews of Computing literature here

    Become a reviewer for Computing Reviews.

    Comments

    Information & Contributors

    Information

    Published In

    cover image SIAM Journal on Computing
    SIAM Journal on Computing  Volume 13, Issue 1
    Feb. 1984
    37 pages

    Publisher

    Society for Industrial and Applied Mathematics

    United States

    Publication History

    Published: 01 February 1984

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 12 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2008)Scalable multi-query optimization for exploratory queries over federated scientific databasesProceedings of the VLDB Endowment10.14778/1453856.14538641:1(16-27)Online publication date: 1-Aug-2008
    • (2008)BioScoutProceedings of the 11th international conference on Extending database technology: Advances in database technology10.1145/1353343.1353437(730-734)Online publication date: 25-Mar-2008
    • (1997)Adaptive Join Algorithms in Dynamic Distributed DatabasesDistributed and Parallel Databases10.1023/A:10086197050795:1(5-30)Online publication date: 1-Jan-1997
    • (1997)Adaptive Algorithms for Join Processing in Distributed Database SystemsDistributed and Parallel Databases10.1023/A:10086179119925:3(233-269)Online publication date: 1-Jul-1997
    • (1996)On the Complexity of Distributed Query OptimizationIEEE Transactions on Knowledge and Data Engineering10.1109/69.5362568:4(650-662)Online publication date: 1-Aug-1996
    • (1994)A Graph Theoretical Approach to Determine a Join Reducer Sequence in Distributed Query ProcessingIEEE Transactions on Knowledge and Data Engineering10.1109/69.2730346:1(152-165)Online publication date: 1-Feb-1994
    • (1994)Database placement in communication networks for minimizing the overall transmission costMathematical and Computer Modelling: An International Journal10.1016/0895-7177(94)90111-219:1(7-19)Online publication date: 1-Jan-1994
    • (1993)Combining Joint and Semi-Join Operations for Distributed Query ProcessingIEEE Transactions on Knowledge and Data Engineering10.1109/69.2242055:3(534-542)Online publication date: 1-Jun-1993
    • (1992)Interleaving a Join Sequence with Semijoins in Distributed Query ProcessingIEEE Transactions on Parallel and Distributed Systems10.1109/71.1590443:5(611-621)Online publication date: 1-Sep-1992
    • (1992)A Parallel Execution Method for Minimizing Distributed Query Response TimeIEEE Transactions on Parallel and Distributed Systems10.1109/71.1392063:3(325-333)Online publication date: 1-May-1992
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media