Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Combining Joint and Semi-Join Operations for Distributed Query Processing

Published: 01 June 1993 Publication History
  • Get Citation Alerts
  • Abstract

    The application of a combination of join and semi-join operations to minimize the amount of data transmission required for distributed query processing is discussed. Specifically, two important concepts that occur with the use of join operations as reducers in query processing, namely, gainful semi-joins and pure joint attributes, are used. Some semi-joint, though not profitable themselves, may benefit the execution of subsequent join operations as reducers. Such a semi-join is termed a gainful semi-join. In addition, join attributes that are not part of the output attributes are referred to as pure join attributes. They exploit the usefulness of gainful semi-joins and use the removability of pure join attributes to reduce the amount of data transmission required for query processing. Heuristic searches are developed to determine a sequence of join and semi-join reducers for query processing. Results indicate the importance of the approach to combining joins and semi-joins for distributed query processing.

    References

    [1]
    {1} P. M. G. Apers, A. R. Hevner, and S. B. Yao, "Optimization algorithms for distributed queries," IEEE Trans. Software Eng., vol. SE-9, pp. 57- 68, Jan. 1983.
    [2]
    {2} P. A. Bernstein and D.-M. W. Chiu, "Using seimi-joins to solve relational queries," J. ACM, vol. 28, no. 1, pp. 25-40, Jan. 1981.
    [3]
    {3} P.A. Bernstein, N. Goodman, E. Wong, C. Reeve, and J. B. Rothnie, "Query processing in a system for distributed databases (SDD-1)," ACM Trans. Database Syst., vol. 6, no. 4, pp. 602-625, Dec. 1981.
    [4]
    {4} S. Ceri and G. Pelagatti, Distributed Databases Principle and Systems. New York: McGraw-Hill, 1985.
    [5]
    {5} A. L. P. Chen and O. K. Li, "Optimizing star queries in a distributed database system," in Proc. of 10th Int. Conf. on Very Large Data Bases, Aug. 1984, pp. 429-438.
    [6]
    {6} M.-S. Chen and P.S. Yu, "Using combination of join and semijoins operations for distributed query processing." IBM Res. Rep. RC 14788, June 1989.
    [7]
    {7} M.-S. Chen and P.S. Yu, "Using join operations as reducers in distributed query processing," in Proc. 2nd Int. Symp. On Databases in Parallel and Distributed Systems, July 1990, pp. 116-123.
    [8]
    {8} M.-S. Chen and P.S. Yu, "Interleaving a join sequence iwht semijoins in distributed query processing," IEEE Trans. Parallel and Distriubuted Syst., vol. 3, no. 5, pp. 611-621, Sept. 1992.
    [9]
    {9} D.-M. Chiu, P. A. Eernstein, and Y.-C. Ho, "Optimizing chain queries in a distributed Database System," SIAM J. Computing, vol. 13, pp. 116-134, Feb. 1984.
    [10]
    {10} D. Gardy and C. Puech, "On the effect of join operations on relation sizes," ACM Trans. Database Syst., vol. 14, no. 4, pp. 574-603, Dec. 1989.
    [11]
    {11} N. Goodman and O. Shmueli, "The tree property is fundamental for query processing," in Proc. ACM Symp. on Principles of Database Systems, 1982, pp. 40-48.
    [12]
    {12} A. R. Hevner, "The optimization of query processing on distributed database systems," Ph.D. dissertation, Purdue Univ., 1979.
    [13]
    {13} A. R. Hevner and S. B. Yao, "Query processing in distributed database systems," IEEE Trans. Software Eng., vol. SE-5, pp. 177-187, May 1979.
    [14]
    {14} Y. Kambayashi, M. Yoshikawa, and S. Yajima, "Query processing for distributed databases using generalized semi-joins," in ACM Proc. of SIGMOD, 1982, pp. 151-160.
    [15]
    {15} H. Kang and N. Roussopoulos, "Combining joins and semijoins in distributed query processing," CS-TR-1794, Univ. Maryland, 1987.
    [16]
    {16} S. Lafortune and E. Wong, "A state transiton model for distributed query processing," ACM Trans. Database Syst., vol. 11, no. 3, pp. 294-322, Sept. 1986.
    [17]
    {17} G. M. Lohman, C. Mohan, L. M. Hass, B. G. Lindsay, P. G. Selinger, P. F. Wilms, and D. Daniels, "Query Processing in R*," RJ 4272, IBM Almaden Research Laboratory, San Jose, CA., Apr. 1984.
    [18]
    {18} J. K. Mullin, "Optimal semijoins for distributed database systems," IEEE Trans. Software Eng., vol. 16, pp. 558-560, May 1990.
    [19]
    {19} N. J. Nilsson, Principles of Artificial Intellignece. New York: Springer-Verlag, 1982.
    [20]
    {20} S. Pramanik and D. Vineyard, "Optimizing join queries in distributed databases," IEEE Trans. Software Eng., vol. 14, pp. 1319-1326, Sept. 1988.
    [21]
    {21} A. Segev, "Global heuristic for distributed query optimization," in Proc. of IEEE INFOCOM, 1986, pp. 388-394.
    [22]
    {22} C. Wang, "The complexity of processing tree queries in distributed databases," in Proc. 2nd IEEE Symp. on Parallel and Distributed Processing , Dec. 1990, pp. 604-611.
    [23]
    {23} S. B. Yao, "Approximating block access in database organizations," Commun. ACM, vol. 20, pp. 260-261, Apr. 1977
    [24]
    {24} H. Yoo and S. Lafortune, "An intelligent search method for query optimization by semijoins," IEEE Trans. Know. Data Eng., vol. 1, pp. 226-237, June 1989.
    [25]
    {25} C. T. Yu and C. C. Chang, "Distributed query processing," ACM Comput. Surveys, vol. 16, no. 4, pp. 399-433, Dec. 1984.
    [26]
    {26} P.S. Yu, M.-S. Chen, H. Heiss, and S. H. Lee, "On workload characterization of relational database environments," IEEE Trans. Software Eng., vol. 18, on. 347-355. Apr. 1992.

    Cited By

    View all
    • (2023)INEv: In-Network Evaluation for Event Stream ProcessingProceedings of the ACM on Management of Data10.1145/35889551:1(1-26)Online publication date: 30-May-2023
    • (2014)A multi-colony ant algorithm for optimizing join queries in distributed database systemsKnowledge and Information Systems10.1007/s10115-012-0608-439:1(175-206)Online publication date: 1-Apr-2014
    • (2010)Distributed stream join query processing with semijoinsDistributed and Parallel Databases10.1007/s10619-010-7062-727:3(211-254)Online publication date: 1-Jun-2010
    • Show More Cited By

    Index Terms

    1. Combining Joint and Semi-Join Operations for Distributed Query Processing
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image IEEE Transactions on Knowledge and Data Engineering
        IEEE Transactions on Knowledge and Data Engineering  Volume 5, Issue 3
        June 1993
        176 pages

        Publisher

        IEEE Educational Activities Department

        United States

        Publication History

        Published: 01 June 1993

        Author Tags

        1. data transmission
        2. distributed databases
        3. distributed query processing
        4. gainful semi-joins
        5. heuristic searches
        6. join operations
        7. joint attributes
        8. query processing
        9. reducers
        10. semijoin operations

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)INEv: In-Network Evaluation for Event Stream ProcessingProceedings of the ACM on Management of Data10.1145/35889551:1(1-26)Online publication date: 30-May-2023
        • (2014)A multi-colony ant algorithm for optimizing join queries in distributed database systemsKnowledge and Information Systems10.1007/s10115-012-0608-439:1(175-206)Online publication date: 1-Apr-2014
        • (2010)Distributed stream join query processing with semijoinsDistributed and Parallel Databases10.1007/s10619-010-7062-727:3(211-254)Online publication date: 1-Jun-2010
        • (2005)Query Processing in a Mobile Computing EnvironmentIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2005.11517:7(982-996)Online publication date: 1-Jul-2005
        • (2002)Processing Distributed Mobile Queries with Interleaved Remote Mobile JoinsIEEE Transactions on Computers10.1109/TC.2002.103984451:10(1182-1195)Online publication date: 1-Oct-2002
        • (2001)Distributed Query Processing in the InternetProceedings of the The 21st International Conference on Distributed Computing Systems10.5555/876878.879319Online publication date: 16-Apr-2001
        • (1996)On the Complexity of Distributed Query OptimizationIEEE Transactions on Knowledge and Data Engineering10.1109/69.5362568:4(650-662)Online publication date: 1-Aug-1996
        • (1996)Optimization of Parallel Execution for Multi-Join QueriesIEEE Transactions on Knowledge and Data Engineering10.1109/69.5067098:3(416-428)Online publication date: 1-Jun-1996
        • (1995)PERF joinProceedings of the fourth international conference on Information and knowledge management10.1145/221270.221360(137-144)Online publication date: 2-Dec-1995
        • (1994)A Graph Theoretical Approach to Determine a Join Reducer Sequence in Distributed Query ProcessingIEEE Transactions on Knowledge and Data Engineering10.1109/69.2730346:1(152-165)Online publication date: 1-Feb-1994

        View Options

        View options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media