Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- short-paperMay 2020
Starling: A Scalable Query Engine on Cloud Functions
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 131–141https://doi.org/10.1145/3318464.3380609Much like on-premises systems, the natural choice for running database analytics workloads in the cloud is to provision a cluster of nodes to run a database instance. However, analytics workloads are often bursty or low volume, leaving clusters idle ...
- panelJune 2013
We are drowning in a sea of least publishable units (LPUs)
SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of DataPages 921–922https://doi.org/10.1145/2463676.2465345Our field is drowning in a sea of conference submissions. We assert that the sheer number of papers has begun to seriously hurt the quality of the work that the field is doing and that the field is going to implode unless we take action to remedy the ...
- research-articleJune 2013
Query processing on smart SSDs: opportunities and challenges
SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of DataPages 1221–1230https://doi.org/10.1145/2463676.2465295Data storage devices are getting "smarter." Smart Flash storage devices (a.k.a. "Smart SSD") are on the horizon and will package CPU processing and DRAM storage inside a Smart SSD, and make that available to run user programs inside a Smart SSD. The ...
- research-articleJune 2013
Split query processing in polybase
- David J. DeWitt,
- Alan Halverson,
- Rimma Nehme,
- Srinath Shankar,
- Josep Aguilar-Saborit,
- Artin Avanes,
- Miro Flasza,
- Jim Gramling
SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of DataPages 1255–1266https://doi.org/10.1145/2463676.2463709This paper presents Polybase, a feature of SQL Server PDW V2 that allows users to manage and query data stored in a Hadoop cluster using the standard SQL query language. Unlike other database systems that provide only a relational view over HDFS-...
- research-articleMay 2012
Query optimization in microsoft SQL server PDW
- Srinath Shankar,
- Rimma Nehme,
- Josep Aguilar-Saborit,
- Andrew Chung,
- Mostafa Elhemali,
- Alan Halverson,
- Eric Robinson,
- Mahadevan Sankara Subramanian,
- David DeWitt,
- César Galindo-Legaria
SIGMOD '12: Proceedings of the 2012 ACM SIGMOD International Conference on Management of DataPages 767–776https://doi.org/10.1145/2213836.2213953In recent years, Massively Parallel Processors have increasingly been used to manage and query vast amounts of data. Dramatic performance improvements are achieved through distributed execution of queries across many nodes. Query optimization for such ...
-
- research-articleJune 2011
Turbocharging DBMS buffer pool using SSDs
SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of dataPages 1113–1124https://doi.org/10.1145/1989323.1989442Flash solid-state drives (SSDs) are changing the I/O landscape, which has largely been dominated by traditional hard disk drives (HDDs) for the last 50 years. In this paper we propose and systematically explore designs for using an SSD to improve the ...
A comparison of approaches to large-scale data analysis
- Andrew Pavlo,
- Erik Paulson,
- Alexander Rasin,
- Daniel J. Abadi,
- David J. DeWitt,
- Samuel Madden,
- Michael Stonebraker
SIGMOD '09: Proceedings of the 2009 ACM SIGMOD International Conference on Management of dataPages 165–178https://doi.org/10.1145/1559845.1559865There is currently considerable enthusiasm around the MapReduce (MR) paradigm for large-scale data analysis [17]. Although the basic control flow of this framework has existed in parallel SQL database management systems (DBMS) for over 20 years, some ...
- ArticleJune 2007
How to barter bits for chronons: compression and bandwidth trade offs for database scans
SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of dataPages 389–400https://doi.org/10.1145/1247480.1247525Two trends are converging to make the CPU cost of a table scan a more important component of database performance. First, table scans are becoming a larger fraction of the query processing workload, and second, large memories and compression are making ...
- ArticleJune 2006
Database support for matching: limitations and opportunities
SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of dataPages 85–96https://doi.org/10.1145/1142473.1142484We define a match join of R and S with predicate θ to be a subset of the θ-join of R and S such that each tuple of R and S contributes to at most one result tuple. Match joins and their generalizations belong to a broad class of matching problems that ...
- ArticleJune 2005
Proactive re-optimization with Rio
SIGMOD '05: Proceedings of the 2005 ACM SIGMOD international conference on Management of dataPages 936–938https://doi.org/10.1145/1066157.1066294Traditional query optimizers rely on the accuracy of estimated statistics of intermediate subexpressions to choose good query execution plans. This design often leads to suboptimal plan choices for complex queries since errors in estimates grow ...
- ArticleJune 2005
Proactive re-optimization
SIGMOD '05: Proceedings of the 2005 ACM SIGMOD international conference on Management of dataPages 107–118https://doi.org/10.1145/1066157.1066171Traditional query optimizers rely on the accuracy of estimated statistics to choose good execution plans. This design often leads to suboptimal plan choices for complex queries, since errors in estimates for intermediate subexpressions grow ...
- ArticleJune 2005
Incognito: efficient full-domain K-anonymity
SIGMOD '05: Proceedings of the 2005 ACM SIGMOD international conference on Management of dataPages 49–60https://doi.org/10.1145/1066157.1066164A number of organizations publish microdata for purposes such as public health and demographic research. Although attributes that clearly identify individuals, such as Name and Social Security Number, are generally removed, these databases can sometimes ...
- ArticleJune 2004
- proceedingJune 2002
SIGMOD '02: Proceedings of the 2002 ACM SIGMOD international conference on Management of data
The 2002 ACM SIGMOD International Conference on Management of Data, was held June 4-6, 2002 at the spectacular Frank Lloyd Wright-designed Monona Terrace conference center in Madison, Wisconsin. The SIGMOD conference has long held its status a leading ...
- ArticleMay 2001
On supporting containment queries in relational database management systems
SIGMOD '01: Proceedings of the 2001 ACM SIGMOD international conference on Management of dataPages 425–436https://doi.org/10.1145/375663.375722Virtually all proposals for querying XML include a class of query we term “containment queries”. It is also clear that in the foreseeable future, a substantial amount of XML data will be stored in relational database systems. This raises the question of ...
Also Published in:
ACM SIGMOD Record: Volume 30 Issue 2 - ArticleMay 2000
NiagaraCQ: a scalable continuous query system for Internet databases
SIGMOD '00: Proceedings of the 2000 ACM SIGMOD international conference on Management of dataPages 379–390https://doi.org/10.1145/342009.335432Continuous queries are persistent queries that allow users to receive new results when they become available. While continuous query systems can transform a passive web into an active environment, they need to be able to support millions of queries due ...
Also Published in:
ACM SIGMOD Record: Volume 29 Issue 2 - ArticleJune 1998
Efficient mid-query re-optimization of sub-optimal query execution plans
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of dataPages 106–117https://doi.org/10.1145/276304.276315For a number of reasons, even the best query optimizers can very often produce sub-optimal query execution plans, leading to a significant degradation of performance. This is especially true in databases used for complex decision support queries and/or ...
Also Published in:
ACM SIGMOD Record: Volume 27 Issue 2 - ArticleJune 1997
The BUCKY object-relational benchmark
- Michael J. Carey,
- David J. DeWitt,
- Jeffrey F. Naughton,
- Mohammad Asgarian,
- Paul Brown,
- Johannes E. Gehrke,
- Dhaval N. Shah
SIGMOD '97: Proceedings of the 1997 ACM SIGMOD international conference on Management of dataPages 135–146https://doi.org/10.1145/253260.253283According to various trade journals and corporate marketing machines, we are now on the verge of a revolution—the object-relational database revolution. Since we believe that no one should face a revolution without appropriate armaments, this paper ...
Also Published in:
ACM SIGMOD Record: Volume 26 Issue 2 - ArticleJune 1996
Partition based spatial-merge join
SIGMOD '96: Proceedings of the 1996 ACM SIGMOD international conference on Management of dataPages 259–270https://doi.org/10.1145/233269.233338This paper describes PBSM (Partition Based Spatial-Merge), a new algorithm for performing spatial join operation. This algorithm is especially effective when neither of the inputs to the join have an index on the joining attribute. Such a situation ...
Also Published in:
ACM SIGMOD Record: Volume 25 Issue 2 - ArticleMay 1995
The SPIFFI scalable video-on-demand system
SIGMOD '95: Proceedings of the 1995 ACM SIGMOD international conference on Management of dataPages 352–363https://doi.org/10.1145/223784.223851This paper presents a simulation study of a video-on-demand system. We present video server algorithms for real-time disk scheduling, prefetching, and buffer pool management. The performance of these algorithms is compared against the performance of ...
Also Published in:
ACM SIGMOD Record: Volume 24 Issue 2