Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2213836.2213953acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Query optimization in microsoft SQL server PDW

Published: 20 May 2012 Publication History

Abstract

In recent years, Massively Parallel Processors have increasingly been used to manage and query vast amounts of data. Dramatic performance improvements are achieved through distributed execution of queries across many nodes. Query optimization for such system is a challenging and important problem.
In this paper we describe the Query Optimizer inside the SQL Server Parallel Data Warehouse product (PDW QO). We leverage existing QO technology in Microsoft SQL Server to implement a cost-based optimizer for distributed query execution. By properly abstracting metadata we can readily reuse existing logic for query simplification, space exploration and cardinality estimation. Unlike earlier approaches that simply parallelize the best serial plan, our optimizer considers a rich space of execution alternatives, and picks one based on a cost-model for the distributed execution environment. The result is a high-quality, effective query optimizer for distributed query processing in an MPP.

References

[1]
Aster Data. http://www.asterdata.com/.
[2]
Greenplum. http://www.greenplum.com/.
[3]
M. M. Astrahan, M. W. Blasgen, D. D. Chamberlin, K. P. Eswaran, J. Gray, P. P. Griffiths, W. F. K. III, R. A. Lorie, P. R. McJones, J. W. Mehl, G. R. Putzolu, I. L. Traiger, B. W. Wade, and V. Watson. System R: Relational approach to database management. ACM Trans. Database Syst., 1(2):97--137, 1976.
[4]
M. Elhemali and L. Giakoumakis. Unit-testing query transformation rules. In DBTest, pages 1--6, 2008.
[5]
G. Graefe. The Cascades framework for query optimization. Data Engineering Bulletin, 18(3), 1995.
[6]
G. Graefe and W. J. McKenna. The Volcano optimizer generator: Extensibility and efficient search. In ICDE, 1993.
[7]
J. R. Haritsa. The picasso database query optimizer visualizer. PVLDB, 3(2):1517--1520, 2010.
[8]
W. Hong and M. Stonebraker. Optimization of parallel query execution plans in XPRS. Technical Report UCB/ERL M91/50, EECS Department, University of California, Berkeley, 1991.
[9]
Microsoft Corporation. Microsoft SQL Server PDW. riptsize http://www.microsoft.com/sqlserver/en/us/solutions-technologies/ data-warehousing/pdw.aspx.
[10]
R. V. Nehme and N. Bruno. Automated partitioning design in parallel database systems. In SIGMOD, 2011.

Cited By

View all
  • (2024)Presto's History-Based Query OptimizerProceedings of the VLDB Endowment10.14778/3685800.368582817:12(4077-4089)Online publication date: 1-Aug-2024
  • (2024)Unified Query Optimization in the Fabric Data WarehouseCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3653369(18-30)Online publication date: 9-Jun-2024
  • (2022)Redundancy Elimination in Distributed Matrix ComputationProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3517877(573-586)Online publication date: 10-Jun-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '12: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
May 2012
886 pages
ISBN:9781450312479
DOI:10.1145/2213836
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 May 2012

Permissions

Request permissions for this article.

Check for updates

Author Tag

  1. query optimization

Qualifiers

  • Research-article

Conference

SIGMOD/PODS '12
Sponsor:

Acceptance Rates

SIGMOD '12 Paper Acceptance Rate 48 of 289 submissions, 17%;
Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)99
  • Downloads (Last 6 weeks)10
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Presto's History-Based Query OptimizerProceedings of the VLDB Endowment10.14778/3685800.368582817:12(4077-4089)Online publication date: 1-Aug-2024
  • (2024)Unified Query Optimization in the Fabric Data WarehouseCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3653369(18-30)Online publication date: 9-Jun-2024
  • (2022)Redundancy Elimination in Distributed Matrix ComputationProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3517877(573-586)Online publication date: 10-Jun-2022
  • (2020)Generalized sub-query fusion for eliminating redundant I/O from big-data queriesProceedings of the 14th USENIX Conference on Operating Systems Design and Implementation10.5555/3488766.3488778(209-224)Online publication date: 4-Nov-2020
  • (2020)POLARISProceedings of the VLDB Endowment10.14778/3415478.341554513:12(3204-3216)Online publication date: 14-Sep-2020
  • (2019)A New Statistics Collecting Method with Adaptive StrategyDatabase Systems for Advanced Applications10.1007/978-3-030-18590-9_59(425-429)Online publication date: 24-Apr-2019
  • (2018)Fusion insight librAProceedings of the VLDB Endowment10.14778/3229863.322987011:12(1822-1834)Online publication date: 1-Aug-2018
  • (2018)Rapid Adoption of Cloud Data Warehouse Technology Using Datometry Hyper-QProceedings of the 2018 International Conference on Management of Data10.1145/3183713.3190652(825-839)Online publication date: 27-May-2018
  • (2018)An Automatic Verification Assembly Line System for Electricity Meter2018 Eighth International Conference on Instrumentation & Measurement, Computer, Communication and Control (IMCCC)10.1109/IMCCC.2018.00190(899-904)Online publication date: Jul-2018
  • (2018)A Strategy of Efficient and Accurate Cardinality Estimation Based on Query ResultXibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University10.1051/jnwpu/2018364076836:4(768-777)Online publication date: 24-Oct-2018
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media