Scalability and Parallel Execution of OmpSs-OpenCL Tasks on Heterogeneous CPU-GPU Environment

Elangovan, Vinoth Krishnan; Badia, Rosa. M.; Ayguadé, Eduard

doi:10.1007/978-3-319-07518-1_9

Vinoth Krishnan Elangovan^18,19,
Rosa. M. Badia^18,20 &
Eduard Ayguadé^18,19

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8488))

Included in the following conference series:

International Supercomputing Conference

2794 Accesses
2 Citations

Abstract

With heterogeneous computing becoming mainstream, researchers and software vendors have been trying to exploit the best of the underlying architectures like GPUs or CPUs to enhance performance. Parallel programming models play a crucial role in achieving this enhancement. One such model is OpenCL, a parallel computing API for cross platform computations targeting heterogeneous architectures. However, OpenCL is a low-level programming language, therefore it can be time consuming to directly develop OpenCL code. To address this shortcoming, OpenCL has been integrated with OmpSs, a task-based programming model to provide abstraction to the user thereby reducing programmer effort. OmpSs-OpenCL programming model deals with a single OpenCL device either a CPU or a GPU. In this paper, we upgrade OmpSs-OpenCL programming model by supporting parallel execution of tasks across multiple CPU-GPU heterogeneous platforms. We discuss the design of the programming model along with its asynchronous runtime system. We investigated scalability of four OmpSs-OpenCL benchmarks across 4 GPUs gaining speedup of up to 4x. Further, in order to achieve effective utilization of the computing resources, we present static and work-stealing scheduling techniques. We show results of parallel execution of applications using OmpSs-OpenCL model and use heterogeneous workloads to evaluate our scheduling techniques on a heterogeneous CPU-GPU platform.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

OmpSs-OpenCL Programming Model for Heterogeneous Systems

Accelerating Scientific Applications on Heterogeneous Systems with HybridOMP

Enhancing MPI+OpenMP Task Based Applications for Heterogeneous Architectures with GPU Support

References

AMD. Amd sdk examples, http://developer.amd.com/tools-and-sdks/heterogeneous-computing/
Augonnet, C., Thibault, S., Namyst, R., Wacrenier, P.-A.: Starpu: a unified platform for task scheduling on heterogeneous multicore architectures. Concurrency and Computation: Practice and Experience 23(2), 187–198 (2011)
Article Google Scholar
Ayguadé, E., Badia, R.M., Igual, F.D., Labarta, J., Mayo, R., Quintana-Ortí, E.S.: An extension of the starss programming model for platforms with multiple gpus. In: Sips, H., Epema, D., Lin, H.-X. (eds.) Euro-Par 2009. LNCS, vol. 5704, pp. 851–862. Springer, Heidelberg (2009)
Chapter Google Scholar
Barcelona Supercomputing Center. The nanos group site: The mercurium compiler, http://nanos.ac.upc.edu/mcxx
Danalis, A., Marin, G., McCurdy, C., Meredith, J.S., Roth, P.C., Spafford, K., Tipparaju, V., Vetter, J.S.: The scalable heterogeneous computing (shoc) benchmark suite. In: Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, pp. 63–74. ACM (2010)
Google Scholar
Dolbeau, R., Bihan, S., Bodin, F.: Hmpp: A hybrid multi-core parallel programming environment. In: Workshop on General Purpose Processing on Graphics Processing Units (GPGPU 2007) (2007)
Google Scholar
Duran, A., Ayguadé, E., Badia, R.M., Labarta, J., Martinell, L., Martorell, X., Planas, J.: Ompss: a proposal for programming heterogeneous multi-core architectures. Parallel Processing Letters 21(2), 173–193 (2011)
Article MathSciNet Google Scholar
Elangovan, V.K., Badia, R.M., Parra, E.A.: OmpSs-openCL programming model for heterogeneous systems. In: Kasahara, H., Kimura, K. (eds.) LCPC 2012. LNCS, vol. 7760, pp. 96–111. Springer, Heidelberg (2013)
Chapter Google Scholar
Gregg, C., Hazelwood, K.: Where is the data? why you cannot debate cpu vs. gpu performance without the answer. In: 2011 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 134–144. IEEE (2011)
Google Scholar
Grewe, D., Wang, Z., O’Boyle, M.F.P.: Opencl task partitioning in the presence of gpu contention
Google Scholar
Khronos OpenCL Working Group et al.: The opencl specification. In: Munshi, A. (ed.) (2008)
Google Scholar
OpenACC Working Group et al. The openacc application programming interface (2011)
Google Scholar
Kim, J., Seo, S., Lee, J., Nah, J., Jo, G., Lee, J.: Snucl: an opencl framework for heterogeneous cpu/gpu clusters. In: Proceedings of the 26th ACM International Conference on Supercomputing, pp. 341–352. ACM (2012)
Google Scholar
McCalpin, J.D.: A survey of memory bandwidth and machine balance in current high performance computers. IEEE TCCA Newsletter, 19–25 (1995)
Google Scholar
Munshi, A., Gaster, B., Mattson, T.G., Ginsburg, D.: OpenCL programming guide. Pearson Education (2011)
Google Scholar
CUDA Nvidia. Programming guide (2008)
Google Scholar
Webber, R.: Clutil-making opencl as easy to use as cuda (website)
Google Scholar

Download references

Author information

Authors and Affiliations

Barcelona Supercomputing Center, Spain
Vinoth Krishnan Elangovan, Rosa. M. Badia & Eduard Ayguadé
Universitat Politècnica de Catalunya, Spain
Vinoth Krishnan Elangovan & Eduard Ayguadé
Artificial Intelligence Research Institute (IIIA), Spanish National Research Council (CSIC), Spain
Rosa. M. Badia

Authors

Vinoth Krishnan Elangovan
View author publications
You can also search for this author in PubMed Google Scholar
Rosa. M. Badia
View author publications
You can also search for this author in PubMed Google Scholar
Eduard Ayguadé
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

MIN Faculty, Department of Informatics Scientific Computing, University of Hamburg, Bundestraße 45a, 20146, Hamburg, Germany
Julian Martin Kunkel
Deutsches Klimarechenzentrum, Bundesstraße 45a, 20146, Hamburg, Germany
Thomas Ludwig
Germany and Prometeus GmbH, University of Mannheim, Fliederstraße 2, 74915, Waibstadt, Germany
Hans Werner Meuer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Elangovan, V.K., Badia, R.M., Ayguadé, E. (2014). Scalability and Parallel Execution of OmpSs-OpenCL Tasks on Heterogeneous CPU-GPU Environment. In: Kunkel, J.M., Ludwig, T., Meuer, H.W. (eds) Supercomputing. ISC 2014. Lecture Notes in Computer Science, vol 8488. Springer, Cham. https://doi.org/10.1007/978-3-319-07518-1_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-07518-1_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07517-4
Online ISBN: 978-3-319-07518-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Scalability and Parallel Execution of OmpSs-OpenCL Tasks on Heterogeneous CPU-GPU Environment

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

OmpSs-OpenCL Programming Model for Heterogeneous Systems

Accelerating Scientific Applications on Heterogeneous Systems with HybridOMP

Enhancing MPI+OpenMP Task Based Applications for Heterogeneous Architectures with GPU Support

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Scalability and Parallel Execution of OmpSs-OpenCL Tasks on Heterogeneous CPU-GPU Environment

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

OmpSs-OpenCL Programming Model for Heterogeneous Systems

Accelerating Scientific Applications on Heterogeneous Systems with HybridOMP

Enhancing MPI+OpenMP Task Based Applications for Heterogeneous Architectures with GPU Support

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation