Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1109/IPDPS.2011.129guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

A Waterfall Model to Achieve Energy Efficient Tasks Mapping for Large Scale GPU Clusters

Published: 16 May 2011 Publication History

Abstract

High energy consumption has become a critical problem for supercomputer systems. GPU clusters are becoming an increasingly popular architecture for building supercomputers because of its great improvement in performance. In this paper, we first formulate the tasks mapping problem as a mini-mal energy consumption problem with deadline constraint. Its optimizing object is very different from the traditional mapping problem which often aims at minimizing make span or minimizing response time. Then a Waterfall Energy Consumption Model, which abstracts the energy consumption of one GPU cluster system into several levels from high to low, is proposed to achieve an energy efficient tasks mapping for large scale GPU clusters. Based on our Waterfall Model, a new task mapping algorithm is developed which tries to apply different energy saving strategies to keep the system remaining at lower energy levels. Our mapping algorithm adopts the Dynamic Voltage Scaling, Dynamic Resource Scaling and $beta$-migration for GPU sub-task to significantly reduce the energy consumption and achieve a better load balance for GPU clusters. A task generator based on the real task traces is developed and the simulation results show that our mapping algorithm based on the Waterfall Model can reduce nearly 50% energy consumption compared with traditional approaches which can only run at a high energy level. Not only the task deadline can be satisfied, but also the task execution time of our mapping algorithm can be reduced.

Cited By

View all
  • (2021)Characterization and prediction of deep learning workloads in large-scale GPU datacentersProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3458817.3476223(1-15)Online publication date: 14-Nov-2021
  • (2020)A Taxonomy and Survey of Power Models and Power Modeling for Cloud ServersACM Computing Surveys10.1145/340620853:5(1-41)Online publication date: 28-Sep-2020
  • (2019)GreenMMProceedings of the ACM International Conference on Supercomputing10.1145/3330345.3330373(308-318)Online publication date: 26-Jun-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
IPDPSW '11: Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum
May 2011
2107 pages
ISBN:9780769545776

Publisher

IEEE Computer Society

United States

Publication History

Published: 16 May 2011

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Characterization and prediction of deep learning workloads in large-scale GPU datacentersProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3458817.3476223(1-15)Online publication date: 14-Nov-2021
  • (2020)A Taxonomy and Survey of Power Models and Power Modeling for Cloud ServersACM Computing Surveys10.1145/340620853:5(1-41)Online publication date: 28-Sep-2020
  • (2019)GreenMMProceedings of the ACM International Conference on Supercomputing10.1145/3330345.3330373(308-318)Online publication date: 26-Jun-2019
  • (2019)A real-time and reliable dynamic migration model for concurrent taskflow in a GPU clusterCluster Computing10.1007/s10586-018-2866-822:2(585-599)Online publication date: 1-Jun-2019
  • (2018)A collaborative CPU–GPU approach for principal component analysis on mobile heterogeneous platformsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2018.05.006120:C(44-61)Online publication date: 1-Oct-2018
  • (2017)Energy Efficient Job Scheduling with DVFS for CPU-GPU Heterogeneous SystemsProceedings of the Eighth International Conference on Future Energy Systems10.1145/3077839.3077855(1-11)Online publication date: 16-May-2017
  • (2016)Ai BCSMicroprocessors & Microsystems10.1016/j.micpro.2016.05.00847:PA(121-132)Online publication date: 1-Nov-2016
  • (2016)Green computing on graphics processing unitsConcurrency and Computation: Practice & Experience10.1002/cpe.369228:16(4305-4325)Online publication date: 1-Nov-2016
  • (2015)A Survey of CPU-GPU Heterogeneous Computing TechniquesACM Computing Surveys10.1145/278839647:4(1-35)Online publication date: 21-Jul-2015
  • (2014)A Survey of Methods for Analyzing and Improving GPU Energy EfficiencyACM Computing Surveys10.1145/263634247:2(1-23)Online publication date: 25-Aug-2014
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media