Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/191839.191884acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article
Free access

AlphaSort: a RISC machine sort

Published: 24 May 1994 Publication History

Abstract

A new sort algorithm, called AlphaSort, demonstrates that commodity processors and disks can handle commercial batch workloads. Using Alpha AXP processors, commodity memory, and arrays of SCSI disks, AlphaSort runs the industry-standard sort benchmark in seven seconds. This beats the best published record on a 32-cpu 32-disk Hypercube by 8:1. On another benchmark, AlphaSort sorted more than a gigabyte in a minute.
AlphaSort is a cache-sensitive memory-intensive sort algorithm. It uses file striping to get high disk bandwidth. It uses QuickSort to generate runs and uses replacement-selection to merge the runs. It uses shared memory multiprocessors to break the sort into subsort chores.
Because startup times are becoming a significant part of the total time, we propose two new benchmarks: (1) Minutesort: how much can you sort in a minute, and (2) DollarSort: how much can you sort for a dollar.

References

[1]
Anon-Et-A1. (1985). "A Measure of Transaction Processing Power." Datamation. V.31(7): PP. 112-118. also in Readings in Database Systems, M.J. Stonebraker ed., Morgan Kaufmann, San Mateo, 1989.
[2]
Baer, J.L., Lin, Y.B., "Improving Quicksort Performance with Codeword Data Structure", IEEE Trans. on Software Engineering, 15(5). May 1989. pp. 622-631.
[3]
Baugsto, B.A.W., Greipsland, J.F., "Parallel Sorting Methods for Large Data Volumes on a Hypercube Database Computer", Proc. 6th int. Workshop on Database Machines, Deauville France, Springer Verlag Lecture Notes No. 368, June 1989, pp.: 126-141.
[4]
Baugsto, B.A.W., Greipsland, J.F., Kamerbeek, J. "Sorting Large Data Files on POMA," Proc. CONPAR-90VAPP IV, Springer Verlag Lecture Notes No. 357, Sept. 1990, pp.: 536- 547.
[5]
Cvetanovic, Z., D. Bhandarkar, "Characterization of Alpha AMP Performance Using TP and SPEC Workloads", to appear in Proc. Int.Symposium on Computer Architecture, April 1994.
[6]
Bitton, D., Design, Analysis and Implementation of Parallel External Sorting Algorithms, Ph.D. Thesis, U. Wisconsin, Madison, WI, 1981
[7]
Beck, M., Bitton, D., Wilkenson, W.K., "Sorting Large Files on a Backend Multiprocessor", IEEE Transactions on Computers, V. 37(7), pp. 769-778, July 1988.
[8]
Conner, W.M., Offset Value Coding, IBM Technical Disclosure Bulletin, V 20(7), Dec. 1977, pp. 2832-2837
[9]
DeWitt, D.J., Naughton, J.F., Schneider, D.A. "Parallel Sorting on a Shared-Nothing Architecture Using Probabilistic Splitting", Proc. First Int Conf. on Parallel and Distributed Info Systems, IEEE Press, Jan 1992, pp. 280-291
[10]
Filgate, Bruce, "SCSI 3.5" 1.05 GB Disk Comparative Performance", Digital Storage Labs, Nov. 10 1992
[11]
Graefe, G., "Parallel external sorting in Volcano," U. Colorado Comp. Sci. Tech. Report 459, June 1990.
[12]
Graefe, G, S.S. Thakkar, "Tuning a Parallel Sort Algorithm on a Shared-Memory Multiprocessor", Software Practice and Experience, 22(7), July 1992, pp. 495.
[13]
Gray, J. (ed.), The Benchmark Handbook for Database and Transaction Processing Systems, Morgan Kaufmann, San Mateo, 1991.
[14]
Kaivalya, D., The SPEC Benchmark Suite, Chapter 6 of The Benchmark Handbook for Database and Transaction Processing Systems, Second Edition, Chapter 6, Morgan Kaufmann, San Mateo, 1993.
[15]
Kitsuregawa, M., Yang, W., Fushimi, S. "Evaluation of an 18- stage Pipeline Hardware Sorter", Proc. 6th Int. Workshop on Database Machines, Deauville France, Springer Verlag Lecture Notes No. 368, June 1989, pp. 142-155.
[16]
Kim. M.Y., "Synchronized Disk Interleaving," IEEE TOCS, V. 35(11), Nov. 1986, pp978-988.
[17]
Knuth, E.E., Sorting and Searching, The Art of Computer Programming, Addison Wesley, Reading, Ma., 1973.
[18]
Lorie, R.A., and Young, H. C., "A Low Communications Sort Algorithm for a Parallel Database Machine," Proc. Fifteenth VLDB, Amsterdam, 1989, pp. 125-134.
[19]
Lorin, H. Sorting, Addison Wesley, Englewood Cliffs, NJ, 1974.
[20]
Salzberg, B., et al., "FastSort- An External Sort Using Parallel Processing", Proc. SIGMOD 1990, pp. 88-101.
[21]
Tsukerman, A., "FastSort- An External Sort Using Parallel Processing" Tandem Systems Review, V 3(4), Dec. 1986, pp. 57-72.
[22]
Weinberger, PJ., Private communication 1986.
[23]
Yamane, Y., Take, R. "Parallel Partition Sort for Database Machines", Database Machines and Knowledge Based Machines, Kitsuregawa and Tanaka eds., pp.: 1117-130. Klwar Academic Publishers, 1988.

Cited By

View all
  • (2022)The benefits of general-purpose on-NIC memoryProceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems10.1145/3503222.3507711(1130-1147)Online publication date: 28-Feb-2022
  • (2022)KVSTL: An Application Support to LSM-Tree Based Key-Value Store via Shingled Translation Layer Data ManagementIEEE Transactions on Computers10.1109/TC.2021.309896171:7(1598-1611)Online publication date: 1-Jul-2022
  • (2020)XIndexProceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming10.1145/3332466.3374547(308-320)Online publication date: 19-Feb-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '94: Proceedings of the 1994 ACM SIGMOD international conference on Management of data
May 1994
525 pages
ISBN:0897916395
DOI:10.1145/191839
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 May 1994

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SIGMOD/PODS94

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)166
  • Downloads (Last 6 weeks)20
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2022)The benefits of general-purpose on-NIC memoryProceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems10.1145/3503222.3507711(1130-1147)Online publication date: 28-Feb-2022
  • (2022)KVSTL: An Application Support to LSM-Tree Based Key-Value Store via Shingled Translation Layer Data ManagementIEEE Transactions on Computers10.1109/TC.2021.309896171:7(1598-1611)Online publication date: 1-Jul-2022
  • (2020)XIndexProceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming10.1145/3332466.3374547(308-320)Online publication date: 19-Feb-2020
  • (2020)External Merge Sort for Top-K QueriesProceedings of the 2020 ACM SIGMOD International Conference on Management of Data10.1145/3318464.3389729(2423-2437)Online publication date: 11-Jun-2020
  • (2019)Improved Programming-Language Independent MapReduce on Shared-Memory SystemsBig Data Analytics and Knowledge Discovery10.1007/978-3-030-27520-4_15(206-220)Online publication date: 26-Aug-2019
  • (2017)Building Efficient Key-Value Stores via a Lightweight Compaction TreeACM Transactions on Storage10.1145/313992213:4(1-28)Online publication date: 24-Nov-2017
  • (2017)WiscKeyACM Transactions on Storage10.1145/303327313:1(1-28)Online publication date: 2-Mar-2017
  • (2016)WiscKeyProceedings of the 14th Usenix Conference on File and Storage Technologies10.5555/2930583.2930594(133-148)Online publication date: 22-Feb-2016
  • (2015)Optimizing Sort in Hadoop Using Replacement SelectionAdvances in Databases and Information Systems10.1007/978-3-319-23135-8_25(365-379)Online publication date: 15-Aug-2015
  • (2014)Ultra-low-leakage chip multiprocessor design with hybrid FinFET logic stylesACM Journal on Emerging Technologies in Computing Systems10.1145/262957611:1(1-16)Online publication date: 6-Oct-2014
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media