Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/645442.652671guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Distributed Query Processing on the Grid

Published: 18 November 2002 Publication History

Abstract

Distributed query processing (DQP) has been widely used in data intensive applications where data of relevance to users is stored in multiple locations. This paper argues: (i) that DQP can be important in the Grid, as a means of providing high-level, declarative languages for integrating data access and analysis; and (ii) that the Grid provides resource management facilities that are useful to developers of DQP systems. As well as discussing and illustrating how DQP technologies can be deployed within the Grid, the paper describes a prototype implementation of a DQP system running over Globus.

References

[1]
R.G.G. Cattell and D.K. Barry. The Object Database Standard: ODMG 3.0 . Morgan Kaufmann, 2000.
[2]
M. Cornell, N.W. Paton, S. Wu, C.A. Goble, C.J. Miller, P. Kirby, K. Eilbeck, A. Brass, A. Hayes, and S.G. Oliver. GIMS - A Data Warehouse for Storage and Analysis of Genome Sequence and Functional Data. In Proc. 2nd IEEE Symposium on Bioinformatics and Bioengineering (BIBE) , pages 15-22. IEEE Press, 2001.
[3]
P. Dinda and B. Plale. A unified relational approachto grid information services. Technical Report GWD-GIS-012-1, Global Grid Forum, 2001.
[4]
L. Fegaras and D. Maier. Optimizing object queries using an effective calculus. ACM Transactions on Database Systems , 24(4):457-516, December 2000.
[5]
I Foster and N. T. Karonis. A Grid-Enabled MPI: Message Passing in Heterogeneous Distributed Computing Systems. In Proc. Supercomputing (SC) . IEEE Computer Society, 1998. Online at: http://www.supercomp.org/sc98/proceedings/.
[6]
I. Foster, C. Kesselman, J. Nick, and S. Tuecke. Grid Services for Distributed System Integration. IEEE Computer , 35:37-46, 2002.
[7]
G. Graefe. Encapsulation of parallelism in the Volcano query processing system. In ACM SIGMOD , pages 102-111, 1990.
[8]
G. Graefe. Query evaluation techniques for large databases. ACM Computing Surveys , 25(2):73-170, June 1993.
[9]
L. Haas, D. Kossmann, E.L. Wimmers, and J. Yang. Optimizing Queries Across Diverse Data Sources. In Proc. VLDB , pages 276-285. Morgan-Kaufmann, 1997.
[10]
W. Hasan and R. Motwani. Coloring away communication in parallel query optimization. In Proceedings of the 21th VLDB Conference , 1995.
[11]
D. Hsiao. Tutorial on Federated Databases and Systems. The VLDB Journal , 1(1):127-179, 1992.
[12]
D. Kossmann. The State of the Art in Distributed Query Processing. ACM Computing Surveys , 32(4):422-469, 2000.
[13]
M.T. Ozsu and P. Valduriez, editors. Principles of Distributed Database Systems (Second Edition) . Prentice-Hall, 1999.
[14]
E. Rahm and R. Marek. Dynamic multi-resource load balancing in parallel database systems. In Proc. 21st VLDB Conf. , pages 395-406, 1995.
[15]
S.F.M. Sampaio, J. Smith, N.W. Paton, and P. Watson. An Experimental Performance Evaluation of Join Algorithms for Parallel Object Databases. In R. Sakellariou et al., editors, Proc. 7th Intl. Euro-Par Conference , pages 280-291. Springer-Verlag, 2001.
[16]
J. Smith, S. F. M. Sampaio, P. Watson, and N. W. Paton. Polar: An architecture for a parallel ODMG compliant object database. In Conference on Information and Knowledge Management (CIKM) , pages 352-359. ACM press, 2000.
[17]
M. Snir, S. Otto, S. Huss-Lederman, D. Walker, and J. Dongarra. MPI - The Complete Reference. The MIT Press, Cambridge, Massachusetts, 1998. ISBN: 0-262-69215-5.
[18]
A. Szalay, P. Z. Kunszt, A. Thakar, J. Gray, and D. R. Slut. Designing and mining multi-terabyte astronomy archives: The sloan digital sky survey. In Proc. ACM SIGMOD , pages 451-462. ACM Press, 2000.
[19]
G. von Laszewski, I. Foster, J. Gawor, and P. Lane. A Java Commodity Grid Kit. Concurrency and Computation: Practice and Experience , 13(8-9):643-662, 2001.
[20]
P. Watson. Databases and the Grid. Technical Report CS-TR-755, University of Newcastle, 2001.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
GRID '02: Proceedings of the Third International Workshop on Grid Computing
November 2002
316 pages
ISBN:3540001336

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 18 November 2002

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2013)Resource Allocation for Query Optimization in Data Grid SystemsProceedings of the 17th East European Conference on Advances in Databases and Information Systems - Volume 813310.1007/978-3-642-40683-6_24(316-329)Online publication date: 1-Sep-2013
  • (2012)Case for dynamic deployment in a grid-based distributed query processorFuture Generation Computer Systems10.1016/j.future.2011.05.01828:1(171-183)Online publication date: 1-Jan-2012
  • (2011)Quality of experience in distributed databasesDistributed and Parallel Databases10.1007/s10619-011-7083-x29:5-6(361-396)Online publication date: 1-Oct-2011
  • (2009)A service-oriented system for distributed data querying and integration on GridsFuture Generation Computer Systems10.1016/j.future.2008.11.00925:5(511-524)Online publication date: 1-May-2009
  • (2008)A monitoring service for large-scale dynamic query optimisation in a grid environmentInternational Journal of Web and Grid Services10.1504/IJWGS.2008.0188894:2(222-246)Online publication date: 1-Jun-2008
  • (2008)Service-based data integration using OGSA-DQP and OGSA-WebDBProceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing10.1109/GRID.2008.4662795(160-167)Online publication date: 29-Sep-2008
  • (2008)Data Transformation Services over Grids with Real-Time Bound ConstraintsProceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part I on On the Move to Meaningful Internet Systems:10.1007/978-3-540-88871-0_60(852-869)Online publication date: 9-Nov-2008
  • (2008)QoS-Oriented Reputation-Aware Query Scheduling in Data GridsProceedings of the 14th international Euro-Par conference on Parallel Processing10.1007/978-3-540-85451-7_53(489-498)Online publication date: 26-Aug-2008
  • (2006)Adding dynamism to OGSA-DQPProceedings of the CoreGRID 2006, UNICORE Summit 2006, Petascale Computational Biology and Bioinformatics conference on Parallel processing10.5555/1765606.1765611(22-33)Online publication date: 29-Aug-2006
  • (2006)Data access and integration in the ISPIDER proteomics gridProceedings of the Third international conference on Data Integration in the Life Sciences10.1007/11799511_3(3-18)Online publication date: 20-Jul-2006
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media