Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2851613.2851741acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

Collaborative processing of data-intensive algorithms with CPU, intelligent SSD, and GPU

Published: 04 April 2016 Publication History

Abstract

The graphic processing unit (GPU) is a computing resource to process graphics-related applications. The intelligent SSD (iSSD) is a solid state device (SSD) that is provided with data processing power. These days, CPU, GPU, and SSD are equipped together in most processing environment. If SSD is replaced with iSSD later on, we have a new processing environment where three computing resources collaborate one another to process a huge volume of data (so called big data) quite effectively. In this paper, we address how to exploit all these computing resources for efficient processing of data-intensive algorithms.Through extensive experiment, we verify the effectiveness and potential of the proposed collaborative processing environment by processing data concurrently with multiple computing resources. The results reveal that processing in the our environment outperforms that in the traditional one by up to 3.5 times.

References

[1]
D. Bae et al., "Intelligent SSD: A Turbo for Big Data Mining," In Proc. of ACM Int'l Conf. on Information and Knowledge Management, ACM CIKM, pp. 1553--1556, 2013.
[2]
N. Gov et al., "GPUTeraSort: High Performance Graphics Coprocessor Sorting for Large Database Management," In Proc. ACM Int'l. Conf. on Management of Data, ACM SIGMOD, pp. 325--336, 2006.
[3]
J. Fung and S. Mann, "Using Graphics Devices in Reverse: GPU-based Image Processing and Computer Vision," In Proc. IEEE Int'l Conf. Multimedia and Expo, pp. 9--12, 2008.
[4]
S. Ryoo et al., "Optimization Principles and Application Performance Evaluation of a Multithreaded GPU using CUDA," In Proc. ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, PPoPP, pp. 73--82, 2008
[5]
S. Kim et al., "Fast, Energy Efficient Scan inside Flash Memory SSDs," In Proc. Int'l Workshop on Accelerating Data Management Systems using Modern Processor and Storage Architectures, ADMS, 2011.
[6]
Y. Jo et al., "On Running Data-Intensive Algorithms with Intelligent SSD and Host CPU: A Collaborative Approach," In Proc. Int'l Conf. on ACM/SIGAPP Symposium On Applied Computing, ACM SAC, pp. 2060--2065, 2015.
[7]
S. Pabst, A. Koch, and W. Straber, "Fast and Scalable CPU=GPU Collision Detection for Rigid and Deformable Surfaces," Computer Graphics Forum, Vol. 29, No. 5, pp. 1605--1612, 2010.
[8]
H. Oh and S. Ha, "A Static Scheduling Heuristic for Heterogeneous Processors," In Proc. Int'l Conf. Euro-Par Parallel Processing, pp. 573--577, 1996.
[9]
H. Topcuoglu, S. Hariri and M. Wu, "Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing," IEEE Transactions on Parallel and Distributed Systems, Vol. 3, No. 3, pp. 260--274. 2002.
[10]
G. Sih and E. Lee, "A Compile-Time Scheduling Heuristic for Interconnection-Constrained Heterogeneous Processor Architectures," IEEE Transactions on Parallel and Distributed Systems, Vol. 4, No. 2, pp. 175--187. 1993.
[11]
Y. Kwok and I. Ahmad, "Static Scheduling Algorithms for Allocating Directed Task Graphs to Multiprocessors," ACM Computing Surveys, Vol. 31, No. 4, 1999.
[12]
N. Bell and M. Garland, Efficient Sparse Matrix-Vector Multiplication on CUDA, NVIDIA Technical Report, NVIDIA Corporation, 2008.
[13]
E. Lee and D. Messerschmitt, "Synchronous data flow," Proceedings of the IEEE, Vol. 75, No. 9, pp. 1235--1245, 1987.
[14]
J. MacQueen et al., "Some Methods for Classification and Analysis of Multivariate Observations," In Proc. of Berkeley Symp. on Mathematical Statistics and Probability, pp. 281--297, 1967.
[15]
L. Page et al., The PageRank Citation Ranking: Bringing Order to the Web, Technical Report, Stanford University, 1999.
[16]
G. Jeh and J. Widom, "SimRank: a measure of structural-context similarity," In Proc. of ACM Int'l. Conf. on Knowledge discovery and data mining, ACM SIGKDD, pp. 538--543, 2002.
[17]
Intel, Intel VTune Amplifier, https://software.intel.com/en-us/node/529213, 2014.

Cited By

View all
  • (2022)GIRAF: General Purpose In-Storage Resistive Associative FrameworkIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2021.306544833:2(276-287)Online publication date: 1-Feb-2022
  • (2019)Computational storage: an efficient and scalable platform for big data and HPC applicationsJournal of Big Data10.1186/s40537-019-0265-56:1Online publication date: 15-Nov-2019
  • (2018)PRINS: Processing-in-Storage Acceleration of Machine LearningIEEE Transactions on Nanotechnology10.1109/TNANO.2018.279987217:5(889-896)Online publication date: Sep-2018
  • Show More Cited By

Index Terms

  1. Collaborative processing of data-intensive algorithms with CPU, intelligent SSD, and GPU

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SAC '16: Proceedings of the 31st Annual ACM Symposium on Applied Computing
    April 2016
    2360 pages
    ISBN:9781450337397
    DOI:10.1145/2851613
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 April 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. GPU
    2. SSD
    3. collaborative processing
    4. heterogeneous
    5. scheduling

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SAC 2016
    Sponsor:
    SAC 2016: Symposium on Applied Computing
    April 4 - 8, 2016
    Pisa, Italy

    Acceptance Rates

    SAC '16 Paper Acceptance Rate 252 of 1,047 submissions, 24%;
    Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

    Upcoming Conference

    SAC '25
    The 40th ACM/SIGAPP Symposium on Applied Computing
    March 31 - April 4, 2025
    Catania , Italy

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)12
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 23 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)GIRAF: General Purpose In-Storage Resistive Associative FrameworkIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2021.306544833:2(276-287)Online publication date: 1-Feb-2022
    • (2019)Computational storage: an efficient and scalable platform for big data and HPC applicationsJournal of Big Data10.1186/s40537-019-0265-56:1Online publication date: 15-Nov-2019
    • (2018)PRINS: Processing-in-Storage Acceleration of Machine LearningIEEE Transactions on Nanotechnology10.1109/TNANO.2018.279987217:5(889-896)Online publication date: Sep-2018
    • (2017)From Processing-in-Memory to Processing-in-StorageSupercomputing Frontiers and Innovations: an International Journal10.14529/jsfi1703074:3(99-116)Online publication date: 15-Sep-2017

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media