research-article

Efficient bulk insertion into a distributed ordered table

Authors:

Adam Silberstein,

Brian F. Cooper,

Utkarsh Srivastava,

Ramana Yerneni,

Raghu RamakrishnanAuthors Info & Claims

SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data

Pages 765 - 778

https://doi.org/10.1145/1376616.1376693

Published: 09 June 2008 Publication History

Abstract

We study the problem of bulk-inserting records into tables in a system that horizontally range-partitions data over a large cluster of shared-nothing machines. Each table partition contains a contiguous portion of the table's key range, and must accept all records inserted into that range. Examples of such systems include BigTable[8] at Google, and PNUTS [15] at Yahoo! During bulk inserts into an existing table, if most of the inserted records end up going into a small number of data partitions, the obtained throughput may be very poor due to ineffective use of cluster parallelism. We propose a novel approach in which a planning phase is invoked before the actual insertions. By creating new partitions and intelligently distributing partitions across machines, the planning phase ensures that the insertion load will be well-balanced. Since there is a tradeoff between the cost of moving partitions and the resulting throughput gain, the planning phase must minimize the sum of partition movement time and insertion time. We show that this problem is a variation of NP-hard bin-packing, reduce it to a problem of packing vectors, and then give a solution with provable approximation guarantees. We evaluate our approach on a prototype system deployed on a cluster of 50 machines, and show that it yields significant improvements over more naïve techniques.

References

[1]

GridFTP. http://www.globus.org/grid_software/data/gridftp.php.

[2]

Hacmp for system p servers. http://www-03.ibm.com/systems/p/software/hacmp/index.html.

[3]

Oracle real application clusters 11g. http://www.oracle.com/technology/products/database/clustering/index.html.

[4]

Scalability and performance with oracle 11g database. http://www.oracle.com/technology/deploy/performance/pdf/db_perfscale_11gr1_twp.pdf, 2007.

[5]

M. K. Aguilera, A. Merchant, M. Shah, A. Veitch, and C. Karamanolis. Sinfonia: A new paradigm for building scalable distributed systems. In Proc. SOSP, October 2007.

Digital Library

[6]

P. Bernstein, N. Dani, B. Khessib, R. Manne, and D. Shutt. Data management issues in supporting large-scale web services. IEEE Data Engineering Bulletin, December 2006.

[7]

L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker. Web caching and zipf-like distributions: Evidence and implications. In Proc. INFOCOM, 1999.

[8]

F. Chang et al. Bigtable: A distributed storage system for structured data. In OSDI, 2006.

Digital Library

[9]

C. Chekuri and S. Khanna. A polynomial time approximation scheme for the multiple knapsack problem. SIAM Journal on Computing, 35(3):713--728, 2003.

Digital Library

[10]

B. F. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon, H. Jacobsen, N. Puz, D. Weaver, and R. Yerneni. PNUTS: Yahoo!'s hosted data serving platform. Technical report, Yahoo! Research, 2008.

[11]

J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. In Proc. OSDI, December 2004.

Digital Library

[12]

G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. Dynamo: Amazon's highly available key-value store. In Proc. SOSP, October 2007.

Digital Library

[13]

D. J. DeWitt, J. F. Naughton, and D. A. Schneider. Parallel sorting on a shared-nothing architecture using probabilistic splitting. In Proc. Conference on Parallel and Distributed Information Systems, pages 280--291, 1991.

Digital Library

[14]

G. Graefe. B-tree indexes for high update rates. SIGMOD Record, 35(1):39--44, March 2006.

Digital Library

[15]

C. S. Group. Community systems research at yahoo! SIGMOD Record, 36(3):47--54, September 2007.

Digital Library

[16]

S. Hung and J. Fisk. An algorithm for 0-1 multiple knapsack problems. In Naval Res. Logist. Quart., pages 571--579, 1978.

[17]

H. Jagadish, P. Narayan, S. Seshadri, S. Sudarshan, and R. Kanneganti. Incremental organization for data recording and warehousing. In Proc. Very Large Databases, August 1997.

Digital Library

[18]

R. Loulou and E. Michaelides. New greedy-like heuristics for the multidimensional 0-1 knapsack problem. Operations Research, 27:1101--1114, 1979.

Digital Library

[19]

S. Martello and P. Toth. Knapsack Problems: Algorithms and Computer Implementations. John Wiley and Sons, 1990.

Digital Library

[20]

S. Mishra. Loading bulk data into a partitioned table. http://www.microsoft.com/technet/prodtechnol/sql/bestpractice/loading_bulk_data_partitioned_table.mspx, September 2006.

[21]

C. Mohan and I. Narang. Algorithms for creating indexes for very large tables without quiescing updates. In Proc. SIGMOD, June 1992.

Digital Library

[22]

P. O'Neil, E. Cheng, D. Gawlick, and E. O'Neil. The log-sructured merge-tree. In Acta Informatica, volume 33, pages 351--385, 1996.

Digital Library

[23]

S. Senju and Y. Toyoda. An approach to linear programming with 0-1 variables. Management Science, 15(4):B196--B207, 1968.

Digital Library

[24]

S. Seshadri and J. Naughton. Sampling issues in parallel database systems. In Proc. Extending Database Technology, March 1992.

Digital Library

[25]

J. van den Bercken, B. Seeger, and P. Widmayer. A generic approach to bulk loading multidimensional index structures. In Proc. Very Large Databases, August 1997.

Digital Library

[26]

M. Vasquez and J. Hao. A hybrid approach for the 01 multidimensional knapsack problem. In IJCAI, pages 328--333, 2001.

Digital Library

[27]

S. A. Weil, S. A. Brandt, E. L. Miller, D. D. E. Long, and C. Maltzahn. Ceph: A scalable, high-performance distributed file system. In Proc. OSDI, 2006.

Digital Library

[28]

J. Wiener and J. Naughton. Oodb bulk loading revisited: The partitioned-list approach. In Proc. Very Large Databases, September 1995.

Digital Library

Cited By

He XCai PZhou XZhou A(2021)Continuously Bulk Loading over Range Partitioned Tables for Large Scale Historical Data2021 IEEE 37th International Conference on Data Engineering (ICDE)10.1109/ICDE51399.2021.00088(960-971)Online publication date: Apr-2021
https://doi.org/10.1109/ICDE51399.2021.00088
Chen GVo HWu SOoi BÖzsu M(2020)A Framework for supporting DBMS-like indexes in the cloudProceedings of the VLDB Endowment10.14778/3402707.34027114:11(702-713)Online publication date: 3-Jun-2020
https://dl.acm.org/doi/10.14778/3402707.3402711
Cooper BNarayan PRamakrishnan RSrivastava USilberstein ABohannon PJacobsen HPuz NWeaver DYerneni R(2019)PNUTS to SherpaProceedings of the VLDB Endowment10.14778/3352063.335214612:12(2300-2307)Online publication date: 1-Aug-2019
https://dl.acm.org/doi/10.14778/3352063.3352146
Show More Cited By

Index Terms

Efficient bulk insertion into a distributed ordered table
1. Information systems
  1. Data management systems
    1. Database management system engines
      1. Parallel and distributed DBMSs

Recommendations

Bulk construction of dynamic clustered metric trees

Repositories of complex data types, such as images, audio, video and free text, are becoming increasingly frequent in various fields. A general searching approach for such data types is that of similarity search, where the search is for similar objects ...
The Intersection of Insertion and Deletion Balls
2021 IEEE Information Theory Workshop (ITW)
This paper studies the intersections of insertion and deletion balls. The t-insertion, t-deletion ball of a sequence x is the set of all sequences received by t insertions, deletions to x, respectively. While the intersection of either deletion balls or ...
P Systems with Insertion and Deletion Exo-Operations
Theory that Counts: To Oscar Ibarra on His 70th Birthday

It is known that insertion-deletion (P) systems with two symbols context-free insertion and deletion rules are not computationally complete. It is thus interesting to consider conditions that would allow such systems to reach computational completeness. ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data

June 2008

1396 pages

ISBN:9781605581026

DOI:10.1145/1376616

General Chairs:
Laks V. S. Lakshmanan
University of British Columbia, Canada
,
Raymond T. Ng
University of British Columbia, Canada
,
Dennis Shasha
New York University, USA

Copyright © 2008 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2008

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SIGMOD/PODS '08

Sponsor:

SIGMOD/PODS '08: SIGMOD/PODS '08 - International Conference on Management of Data

June 9 - 12, 2008

Vancouver, Canada

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

30
Total Citations
View Citations
924
Total Downloads

Downloads (Last 12 months)6
Downloads (Last 6 weeks)2

Reflects downloads up to 24 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

He XCai PZhou XZhou A(2021)Continuously Bulk Loading over Range Partitioned Tables for Large Scale Historical Data2021 IEEE 37th International Conference on Data Engineering (ICDE)10.1109/ICDE51399.2021.00088(960-971)Online publication date: Apr-2021
https://doi.org/10.1109/ICDE51399.2021.00088
Chen GVo HWu SOoi BÖzsu M(2020)A Framework for supporting DBMS-like indexes in the cloudProceedings of the VLDB Endowment10.14778/3402707.34027114:11(702-713)Online publication date: 3-Jun-2020
https://dl.acm.org/doi/10.14778/3402707.3402711
Cooper BNarayan PRamakrishnan RSrivastava USilberstein ABohannon PJacobsen HPuz NWeaver DYerneni R(2019)PNUTS to SherpaProceedings of the VLDB Endowment10.14778/3352063.335214612:12(2300-2307)Online publication date: 1-Aug-2019
https://dl.acm.org/doi/10.14778/3352063.3352146
Kieseberg PSchrittwieser SWeippl E(2019)Fingerprinting using database steganography2019 International Conference on Software Security and Assurance (ICSSA)10.1109/ICSSA48308.2019.00009(16-20)Online publication date: Jul-2019
https://doi.org/10.1109/ICSSA48308.2019.00009
Wang LCai RFu THe JLu ZWinslett MZhang Z(2018)Waterwheel: Realtime Indexing and Temporal Range Query Processing over Massive Data Streams2018 IEEE 34th International Conference on Data Engineering (ICDE)10.1109/ICDE.2018.00033(269-280)Online publication date: Apr-2018
https://doi.org/10.1109/ICDE.2018.00033
Cai RLu ZWang LZhang ZFur TWinslett M(2017)DITIRProceedings of the VLDB Endowment10.14778/3137765.313779510:12(1865-1868)Online publication date: 1-Aug-2017
https://dl.acm.org/doi/10.14778/3137765.3137795
Berber FYahyapour R(2017)A High-Performance Persistent Identifier Management Protocol2017 International Conference on Networking, Architecture, and Storage (NAS)10.1109/NAS.2017.8026839(1-10)Online publication date: Aug-2017
https://doi.org/10.1109/NAS.2017.8026839
Kachemir RKellett BBehara KAli MNewsam SRenz MTrajcevski GRavada S(2016)GloriaProceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems10.1145/2996913.2997013(1-10)Online publication date: 31-Oct-2016
https://dl.acm.org/doi/10.1145/2996913.2997013
Dede ESendir BKuzlu PWeachock JGovindaraju MRamakrishnan L(2016)Processing Cassandra Datasets with Hadoop-Streaming Based ApproachesIEEE Transactions on Services Computing10.1109/TSC.2015.24448389:1(46-58)Online publication date: 1-Jan-2016
https://doi.org/10.1109/TSC.2015.2444838
Athanassoulis MChen SAilamaki AGibbons PStoica R(2015)Online Updates on Data Warehouses via Judicious Use of Solid-State StorageACM Transactions on Database Systems10.1145/269948440:1(1-42)Online publication date: 25-Mar-2015
https://dl.acm.org/doi/10.1145/2699484
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents