Energy-Aware Scheduling for High-Performance Computing Systems: A Survey
Abstract
:1. Introduction
- types of systems in the context of their heterogeneity as well as compute device types including multi- and many-core CPUs and accelerators such as GPUs [1];
2. Related Review Works
3. Problem Formulation
3.1. Optimization Goals
3.2. Mechanisms Allowing Performance Energy Configurations
3.2.1. DPM
3.2.2. DVFS
3.2.3. Power Capping
3.3. Energy-Aware Scheduling for Particular System Types
3.3.1. CPU
3.3.2. GPU
3.3.3. Heterogeneous Environments
3.4. Time and Power/Energy Measurements
4. Energy-Aware Scheduling Algorithms
4.1. Randomized Algorithms
4.2. Machine Learning
4.3. Dynamic Programming
4.4. Fuzzy Logic
4.5. Integer Programming
4.6. Evolutionary Algorithms
4.7. Constraint Programming
4.8. Other Algorithms
Algorithm Type | Algorithm | Optimization Goals/Metrics (O—Optimized, C—Constrained) | System Type | Energy-Aware Mechanism | Workload Type |
---|---|---|---|---|---|
Randomized algorithms | UEJS with H-PSO [69] | EC (O), ExecT (C), Utilization (C) | Heterogeneous cluster | Independent tasks | |
Particle Swarm optimized greedy algorithm [70] | EDP (O) | Homogeneous cluster | Independent tasks | ||
LP relaxation [71] | Average weighted completion time (O), EC (O, C) | Heterogeneous cluster | Independent tasks | ||
Machine learning | RL based scheduler [72] | EC (O), Weighted ExecT (O) | Homogeneous cluster | Independent tasks | |
ML approach based on supervised learning [46] | EC (O), Machines usage (O), ExecT (C) | Heterogeneous cluster | Independent tasks | ||
DRL [73] | EC (O) | Heterogeneous cluster | Independent tasks | ||
ML Classifiers [74] | Energy efficiency (amount of work completed per unit of energy) (O), Power (C) | Homogeneous cluster | DVFS | Single application | |
Scheduling-based Power Capping using CP and ML [49] | Wait time (O), ExecT (O), Power (C) | Heterogeneous cluster | Independent tasks | ||
Dynamic programming | Accelerated Search [77] | EC (O), Probability of execution (C), ExecT (C) | Heterogeneous multicore | DVFS | Workflow |
Fuzzy logic | Important inherent program analysis [78] | Branch Transition Rate (O), Cache efficiency (O), Issue width (O) | Heterogeneous multicore | Independent tasks | |
Integer programming | Search space design for search unrestricted, crown, bookshelf, and pipe schedulers [81] | Complex objective function (ExecT, number of cores, frequency) (O), ExecT (C) | Homogeneous multicore | DVFS | Independent tasks |
MaxJobPerf [82] | Wait time (O), Frequency (O), EC (C) | Homogeneous cluster | DVFS | Independent tasks | |
XInt-SQP [83] | Cooling cost (O, C) | Homogeneous cluster | Independent tasks, Online scheduling | ||
HILP [80] | Utilization (O), ExecT (O, C), EDF (O) | Homogeneous multicore/cluster | DVFS | Independent tasks | |
RNRA, RIRA [143] | EC (O) | Heterogeneous cluster | DVFS | Independent tasks | |
Evolutionary algorithms | XInt-GA [83] | Cooling cost (O, C) | Homogeneous cluster | Independent tasks, Online scheduling | |
Plain GA [84] | EC (O) | Heterogeneous cluster | Workflow | ||
Plain GA, CA + GA [84] | EC (O) | Heterogeneous cluster | Workflow | ||
Parallel bi-objective hybrid genetic algorithm. [85] | Pareto front (O) | Heterogeneous virtualized cluster | DVFS | Workflow | |
NSGA-II, MOCell, IBEA [87] | Pareto front (O) | Heterogeneous cluster | DVFS | Workflow | |
GA with elitist or struggle replacement mechanisms [91] | EC (O), ExecT (O) | Heterogeneous computational grid | DVFS | Independent tasks | |
1pX-W, OX-W, MX-W, and noX-W GA with power constraints. [92] | ExecT (O), Power (C) | Homogeneous multicore | Independent tasks | ||
Constraint programming | Hybrid dispatcher [49] | Power (C) | Heterogeneous cluster | Independent tasks | |
Other | Power-aware scheduler [95] | Power (C) | Heterogeneous cluster | Independent tasks | |
EAH [84] | EC (O) | Heterogeneous cluster | Workflow | ||
S-PSM, DPM [94] | EC (O), ExecT (C) | Homogeneous cluster | DVFS | Workflow | |
ETF [96] | EC (O), ExecT (C) | Homogeneous cluster | DVFS | Workflow | |
20 algorithms based on list heuristics [97] | EC (O), ExecT (O) | Heterogeneous computational grid | Independent tasks | ||
Greedy algorithm for knapsack problem with power constraints [28] | ExecT (O), Power (C) | Heterogeneous cluster | Single application | ||
Heuristics with continuous frequency scaling [98] | EC (O), ExecT (O, C), Reliability (O, C) | Homogeneous cluster | DVFS | Workflow | |
Greedy policy, 0-1 knapsack policy [99] | Electricity cost (bills) (O) | Homogeneous cluster | Independent tasks | ||
Prediction and planning with a regression model. [138] | EDP (O) | Heterogeneous multicore | Online scheduling | ||
PRB [49] | ExecT (O) | Heterogeneous cluster | Independent tasks | ||
E-FIFO, E-BFF, E-BBF [111] | EC (O), ExecT (C) | Homogeneous cluster | DPM | Independent tasks | |
EMRSA, EMRSA-I, EMRSA-II [112,113] | EC (O), ExecT (C) | Heterogeneous cluster | MapReduce jobs | ||
EDL [114] | EC, ExecT (C) | Heterogeneous clusters | DVFS | Independent tasks, Online scheduling | |
Extended EAS [115] | EC (O), ExecT (O) | Heterogeneous clusters | Independent tasks | ||
EAMM [102] | EC (O), ExecT (O) | Heterogeneous cluster | Independent tasks | ||
EAMD [105] | EC (O), ExecT (O) | Heterogeneous cluster | Workflow | ||
CPU/GPU partitioning [100] | EC (O), EDP (O) | CPU + GPU node | Independent tasks | ||
MMF-DVFS [16] | ExecT (C), EC (O) | Heterogeneous cluster | DVFS | Workflow | |
EASLA [117], Improved EASLA [118] | EC (O), ExecT (C) | Heterogeneous clusters | DVFS | Workflow | |
QHA [134] | EC (O, C), ExecT (O, C) | Heterogeneous cluster | DVFS | Workflow | |
EADAGS [119] | EC (O), ExecT (O) | Heterogeneous cluster | DVFS | Workflow | |
eFLS [121] | EC (O), ExecT (C) | Heterogeneous cluster | DVFS | Workflow | |
VHEST, EASA [122] | Utilization (O), ExecT (O) | Homogeneous virtualized cluster | Workflow | ||
EDLS [126] | EC (O), ExecT (O) | Heterogeneous cluster | DVFS | Workflow | |
LESA [125] | ExecT (O), EC (C) | Heterogeneous cluster | DVFS | Workflow | |
Spatio-temporal thermal-aware scheduling [128] | ExecT (O), Temperature (C) | Homogeneous cluster | DVFS | Independent tasks, Online scheduling | |
PAAS [129] | EC (O), ExecT (O) | Homogeneous cluster | DVFS | Independent tasks | |
EED, EEND [109] | EC (O), ExecT (C) | Homogeneous cluster | DVFS | Workflow | |
RSMECC [130] | AST (O), AFT (O), ExecT (O), EC (C) | Heterogeneous cluster | DVFS | Workflow | |
ECS [86], ECS + idle [131] | EC (O), ExecT (O) | Heterogeneous cluster | DVFS | Workflow | |
ESPA [133] | EC (O), ExecT (O) | Heterogeneous cluster | DVFS, DPM | Workflow | |
GACSM [135] | EC (O), ExecT (C) | Heterogeneous cluster | DVFS | Workflow | |
LS [124] | EDP (O), EC (O, C), ExecT (O, C) | Homogeneous cluster | Independent tasks | ||
ESTS [136] | EC (O, C), ExecT (O, C) | Heterogeneous cluster | DVFS | Independent tasks | |
EAD, PEBD [108] | EC (O), ExecT (O) | Homogeneous cluster | Workflow | ||
Phase_EDP [139] | EDP (O) | Heterogeneous multicore | Independent tasks, Online scheduling | ||
ALEPH [140] | Pareto front (O) | Homogeneous cluster | DVFS | Single application | |
HEPOPTA [141] | Pareto front (O) | Heterogeneous processors | Single application | ||
LBOPA-TE [142] | Pareto front (O) | Heterogeneous processors | Single application |
5. Conclusions, Open Problems, and Areas for Further Research
- We conclude that there is a variety of problem formulations and corresponding algorithm types that tackle the problem of energy-aware scheduling for HPC systems, including machine learning (with reinforcement and supervised learning), dynamic programming, fuzzy logic, integer programming, randomized algorithms, evolutionary algorithms, constraint programming, and others.
- Most optimization goals involve metrics such as execution time/makespan, energy, and power, either in functions such as EDP or EDS or optimizing some while putting constraints on others, e.g., minimization of execution time under power limit. A limited number of works specifically consider cooling costs and temperatures.
- Application models include mostly a bag or stream of incoming independent or periodic tasks or a workflow (DAG) composed of tasks in nodes of the graph with edges denoting dependencies.
- System types targeted include mostly heterogeneous but also homogeneous clusters and homogeneous and heterogeneous multicore environments.
- Most works use DFVS as a mechanism for controlling the power/energy of compute devices, some combine DVFS and DPM (including turning off machines), and a limited number of works use explicit power capping.
- Which optimization goals shall be considered for what purposes e.g., consideration of EDP and EDS as ones involving time and energy (relative coefficients such as in EDS might depend on current electricity costs), pure energy or in some cases or areas minimization of execution time under power limit (across time) seems to become more important due to the risk of blackouts, etc.
- Analysis of the impact of frequency of data monitoring on both accuracy (specifically referring to power/energy monitoring) as well as consideration of parameters applicable to data (power, load, etc.) filtering (such as low pass filters [9]) such as running averages which impacts the latency of energy-aware monitoring and correspondingly the scheduling algorithm vs. the possibility to deal with highly changing application and/or system load.
- Following the discussion on measurement accuracy in Section 3.4, the accuracy of APIs such as Intel RAPL and NVIDIA NVML requires constant assessment in view of new generations of CPUs and GPUs and new APIs’ versions. Additionally, another topic for investigation is use cases and conditions (possibly involving usage of data filtering) for which the aforementioned APIs provide reliable results compared to ground truth hardware meters.
- Some works present other approaches to energy-/power-aware aspects of the computations, such as system efficiency or thermal-awareness in scheduling [14]. It is important to consider and analyze power under load for parallel applications by components such as fans and other cooling components, power supplies, and power distribution units (PDU) depending on their types and classes. Moreover, further consideration of temperatures, along with performance and power/energy consumption, in the context of power required for cooling/air conditioning can have a significant influence on HPC systems. Specifically, power capping might affect not only execution times and power/application energy consumption but can lower the temperature.
- The problem of finding (hyper)parameters for auto-tuning of algorithms for energy-aware scheduling. This might involve parameters of the scheduling algorithm, i.e., how to find parameters for algorithms finding desired performance-energy configurations automatically such as monitoring/tuning windows [10] but also system and application parameters [144] for thorough optimization of a run.
- Consideration of (mixed) precision vs. energy and (mixed) precision vs. performance combined with energy trade-offs. This is especially important for ML applications deployed in a GPU environment, where single or even half float precision can be used to deliver reasonable results and the underlying hardware can provide a significant boost to the performance and/or energy efficiency.
- The testbed environments, especially for HPC, where the hardware technologies are extremely advanced, and therefore expensive, often cannot be easily used for experiments. Thus, for existing simulators that consider performance and energy during scheduling—such as MERPSYS [145]—energy and time accuracy vs. simulation time functions shall be developed.
- Several algorithms relatively recently deployed for scheduling optimization such as deep reinforcement learning [146] shall consider energy aspects in the future. The trend of replacing classical programming constructs with ML alternatives is spreading around all computer science areas [147], especially for problems requiring heuristics. Thus, it seems to be reasonable to assume that scheduling, and specifically its energy-aware version will be more and more supported by such an approach.
- Since the main technology used for controlling the power lever is DVFS, we see a need for consideration of power caps in energy-aware scheduling, which apart from frequency scaling, uses other complementary techniques, e.g., thread throttling. Similarly to DVFS, it can be extensively used for heterogeneous cluster systems such as CPU+GPU systems—both single nodes and clusters, in the context of the dynamic application of power caps. This is especially relevant as more and more hardware accelerators are being proposed [2].
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
HPC | High-Performance Computing |
ExecT | Execution Time |
EC | Energy Consumption |
DAG | Directed Acyclic Graph |
PpW | Performance per watt |
RPpW | Reference Performance per watt |
TGI | The Green Index |
REE | Relative Energy Efficiency |
EDP | Energy Delay Product |
EDS | Energy Delay Summation |
EDD | Energy Delay Distance |
GPU | Graphical Processing Unit |
CPU | Central Processing Unit |
DVFS | Dynamic Voltage and Frequency Scaling |
DVS | Dynamic Voltage Scaling |
DPM | Dynamic Power Management |
ML | Machine Learning |
RL | Reinforcement Learning |
CP | Constraint Programming |
GA | Genetic Algorithm |
ILP | Integer Linear Programming |
LP | Linear Programming |
(O) | Optimized |
(C) | Constrained |
References
- Czarnul, P. Parallel Programming for Modern High Performance Computing Systems; CRC Press: Boca Raton, FL, USA, 2018; ISBN 9781138305953. [Google Scholar]
- Dongarra, J. HPC: Where We Are Today and a Look into the Future; Parallel Processing and Applied Mathematics, PPAM: Gdansk, Poland, 2022. [Google Scholar]
- Czarnul, P.; Proficz, J.; Krzywaniak, A. Energy-Aware High-Performance Computing: Survey of State-of-the-Art Tools, Techniques, and Environments. Sci. Program. 2019, 2019, 1–19. [Google Scholar] [CrossRef] [Green Version]
- Subramaniam, B.; Feng, W.C. The Green Index: A Metric for Evaluating System-Wide Energy Efficiency in HPC Systems. In Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops PhD Forum, Shanghai, China, 21–25 May 2012; pp. 1007–1013. [Google Scholar] [CrossRef]
- Laros III, J.H.; Pedretti, K.; Kelly, S.M.; Shu, W.; Ferreira, K.; Vandyke, J.; Vaughan, C. Energy delay product. In Energy-Efficient High Performance Computing; Springer: Berlin/Heidelberg, Germany, 2013; pp. 51–55. [Google Scholar]
- Martin, A.J.; Nyström, M.; Pénzes, P.I. ET 2: A metric for time and energy efficiency of computation. In Power Aware Computing; Springer: Berlin/Heidelberg, Germany, 2002; pp. 293–315. [Google Scholar]
- Chandio, A.A.; Bilal, K.; Tziritas, N.; Yu, Z.; Jiang, Q.; Khan, S.U.; Xu, C.Z. A comparative study on resource allocation and energy efficient job scheduling strategies in large-scale parallel computing systems. Clust. Comput. 2014, 17, 1349–1367. [Google Scholar] [CrossRef]
- Sheikh, H.F.; Tan, H.; Ahmad, I.; Ranka, S.; Bv, P. Energy- and Performance-Aware Scheduling of Tasks on Parallel and Distributed Systems. J. Emerg. Technol. Comput. Syst. 2012, 8, 1–37. [Google Scholar] [CrossRef]
- Ilsche, T.; Schöne, R.; Schuchart, J.; Hackenberg, D.; Simon, M.; Georgiou, Y.; Nagel, W.E. Power measurement techniques for energy-efficient computing: Reconciling scalability, resolution, and accuracy. SICS Softw.-Intensive Cyber-Phys. Syst. 2019, 34, 45–52. [Google Scholar] [CrossRef]
- Krzywaniak, A.; Czarnul, P.; Proficz, J. DEPO: A dynamic energy-performance optimizer tool for automatic power capping for energy efficient high-performance computing. Softw. Pract. Exp. 2022, 52, 2598–2634. [Google Scholar] [CrossRef]
- Cai, C.; Wang, L.; Khan, S.U.; Tao, J. Energy-Aware High Performance Computing: A Taxonomy Study. In Proceedings of the 2011 IEEE 17th International Conference on Parallel and Distributed Systems, Tainan, Taiwan, 7–9 December 2011; pp. 953–958. [Google Scholar] [CrossRef] [Green Version]
- Benedict, S. Energy-aware performance analysis methodologies for HPC architectures—An exploratory study. J. Netw. Comput. Appl. 2012, 35, 1709–1719. [Google Scholar] [CrossRef]
- Maiterth, M.; Koenig, G.; Pedretti, K.; Jana, S.; Bates, N.; Borghesi, A.; Montoya, D.; Bartolini, A.; Puzovic, M. Energy and Power Aware Job Scheduling and Resource Management: Global Survey—Initial Analysis. In Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Vancouver, BC, Canada, 21–25 May 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 685–693. [Google Scholar] [CrossRef]
- Chaudhry, M.T.; Ling, T.C.; Manzoor, A.; Hussain, S.A.; Kim, J. Thermal-Aware Scheduling in Green Data Centers. ACM Comput. Surv. 2015, 47, 1–48. [Google Scholar] [CrossRef]
- Juarez, F.; Ejarque, J.; Badia, R.M. Dynamic energy-aware scheduling for parallel task-based application in cloud computing. Future Gener. Comput. Syst. 2018, 78, 257–271. [Google Scholar] [CrossRef] [Green Version]
- Rizvandi, N.B.; Taheri, J.; Zomaya, A.Y.; Lee, Y.C. Linear combinations of dvfs-enabled processor frequencies to modify the energy-aware scheduling algorithms. In Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, Melbourne, Australia, 17–20 May 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 388–397. [Google Scholar]
- Sinnen, O. Task Scheduling for Parallel Systems; John Wiley & Sons: Hoboken, NJ, USA, 2007. [Google Scholar]
- Kafil, M.; Ahmad, I. Optimal task assignment in heterogeneous distributed computing systems. IEEE Concurr. 1998, 6, 42–50. [Google Scholar] [CrossRef] [Green Version]
- Dorronsoro, B.; Pinel, F. Combining Machine Learning and Genetic Algorithms to Solve the Independent Tasks Scheduling Problem. In Proceedings of the 2017 3rd IEEE International Conference on Cybernetics (CYBCONF), Exeter, UK, 21–23 June 2017; pp. 1–8. [Google Scholar] [CrossRef]
- Pietri, I.; Sakellariou, R. Energy-Aware Workflow Scheduling Using Frequency Scaling. In Proceedings of the 2014 43rd International Conference on Parallel Processing Workshops, Minneapolis, MN, USA, 9–12 September 2014; pp. 104–113. [Google Scholar] [CrossRef]
- Topcuoglu, H.; Hariri, S.; Wu, M.Y. Task scheduling algorithms for heterogeneous processors. In Proceedings of the Eighth Heterogeneous Computing Workshop (HCW’99), San Juan, PR, USA, 12 April 1999; pp. 3–14. [Google Scholar] [CrossRef]
- Bhuiyan, A.; Guo, Z.; Saifullah, A.; Guan, N.; Xiong, H. Energy-Efficient Real-Time Scheduling of DAG Tasks. ACM Trans. Embed. Comput. Syst. 2018, 17, 1–25. [Google Scholar] [CrossRef]
- Bambagini, M.; Marinoni, M.; Aydin, H.; Buttazzo, G. Energy-Aware Scheduling for Real-Time Systems: A Survey. ACM Trans. Embed. Comput. Syst. 2016, 15, 1–34. [Google Scholar] [CrossRef]
- Zeng, Q.; Du, Y.; Huang, K.; Leung, K.K. Energy-Efficient Radio Resource Allocation for Federated Edge Learning. In Proceedings of the 2020 IEEE International Conference on Communications Workshops (ICC Workshops), Dublin, Ireland, 7–11 June 2020; pp. 1–6. [Google Scholar] [CrossRef]
- Ravi, V.T.; Becchi, M.; Jiang, W.; Agrawal, G.; Chakradhar, S. Scheduling concurrent applications on a cluster of cpu-gpu nodes. In Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), Ottawa, ON, Canada, 13–16 May 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 140–147. [Google Scholar]
- Kim, J.K.; Siegel, H.J.; Maciejewski, A.A.; Eigenmann, R. Dynamic Resource Management in Energy Constrained Heterogeneous Computing Systems Using Voltage Scaling. IEEE Trans. Parallel Distrib. Syst. 2008, 19, 1445–1457. [Google Scholar] [CrossRef] [Green Version]
- Xiao, Z.; Song, W.; Chen, Q. Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment. IEEE Trans. Parallel Distrib. Syst. 2013, 24, 1107–1117. [Google Scholar] [CrossRef]
- Czarnul, P.; Rościszewski, P. Optimization of Execution Time under Power Consumption Constraints in a Heterogeneous Parallel System with GPUs and CPUs. In Proceedings of the 15th International Conference on Distributed Computing and Networking (ICDCN), Coimbatore, India, 4–7 January 2014. [Google Scholar]
- Boiński, T.; Czarnul, P. Optimization of Data Assignment for Parallel Processing in a Hybrid Heterogeneous Environment Using Integer Linear Programming. Comput. J. 2021, 65, 1412–1433. [Google Scholar] [CrossRef]
- Kar, I.; Parida, R.R.; Das, H. Energy aware scheduling using genetic algorithm in cloud data centers. In Proceedings of the 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT), Chennai, India, 3–5 March 2016; pp. 3545–3550. [Google Scholar] [CrossRef]
- Koomey, J.; Berard, S.; Sanchez, M.; Wong, H. Implications of historical trends in the electrical efficiency of computing. IEEE Ann. Hist. Comput. 2010, 33, 46–54. [Google Scholar] [CrossRef]
- Abdulsalam, S.; Zong, Z.; Gu, Q.; Qiu, M. Using the Greenup, Powerup, and Speedup metrics to evaluate software energy efficiency. In Proceedings of the 2015 Sixth International Green and Sustainable Computing Conference (IGSC), Las Vegas, NV, USA, 14–16 December 2015; pp. 1–8. [Google Scholar] [CrossRef]
- Gonzalez, R.; Horowitz, M. Energy dissipation in general purpose microprocessors. IEEE J. Solid-State Circuits 1996, 31, 1277–1284. [Google Scholar] [CrossRef] [Green Version]
- Roberts, S.I.; Wright, S.A.; Fahmy, S.A.; Jarvis, S.A. Metrics for Energy-Aware Software Optimisation. In Proceedings of the High Performance Computing: 32nd International Conference, ISC High Performance 2017, Frankfurt, Germany, 18–22 June 2017; Springer: Berlin/Heidelberg, Germany, 2017; pp. 413–430. [Google Scholar] [CrossRef] [Green Version]
- Benini, L.; Bogliolo, A.; De Micheli, G. A survey of design techniques for system-level dynamic power management. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 2000, 8, 299–316. [Google Scholar] [CrossRef]
- Darwish, T.; Bayoumi, M. 5—Trends in Low-Power VLSI Design. In The Electrical Engineering Handbook; CHEN, W.K., Ed.; Academic Press: Burlington, MA, USA, 2005; pp. 263–280. [Google Scholar] [CrossRef]
- Safari, M.; Khorsand, R. Energy-aware scheduling algorithm for time-constrained workflow tasks in DVFS-enabled cloud environment. Simul. Model. Pract. Theory 2018, 87, 311–326. [Google Scholar] [CrossRef]
- Petoumenos, P.; Mukhanov, L.; Wang, Z.; Leather, H.; Nikolopoulos, D.S. Power capping: What works, what does not. In Proceedings of the 2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS), Melbourne, Australia, 14–17 December 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 525–534. [Google Scholar]
- Tsuzuku, K.; Endo, T. Power capping of CPU-GPU heterogeneous systems using power and performance models. In Proceedings of the 2015 International Conference on Smart Cities and Green ICT Systems (SMARTGREENS), Lisbon, Portugal, 20–22 May 2015; pp. 1–8. [Google Scholar]
- Komoda, T.; Hayashi, S.; Nakada, T.; Miwa, S.; Nakamura, H. Power capping of CPU-GPU heterogeneous systems through coordinating DVFS and task mapping. In Proceedings of the 2013 IEEE 31st International Conference on Computer Design (ICCD), Asheville, NC, USA, 6–9 October 2013; pp. 349–356. [Google Scholar] [CrossRef]
- Borghesi, A.; Collina, F.; Lombardi, M.; Milano, M.; Benini, L. Power Capping in High Performance Computing Systems. In Principles and Practice of Constraint Programming; Pesant, G., Ed.; Springer International Publishing: Cham, Switzerland, 2015; pp. 524–540. [Google Scholar]
- Krzywaniak, A.; Czarnul, P. Performance/Energy Aware Optimization of Parallel Applications on GPUs Under Power Capping. In Parallel Processing and Applied Mathematics; Wyrzykowski, R., Deelman, E., Dongarra, J., Karczewski, K., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 123–133. [Google Scholar]
- Haidar, A.; Jagode, H.; Vaccaro, P.; Yarkhan, A.; Tomov, S.; Dongarra, J. Investigating power capping toward energy-efficient scientific applications. Concurr. Comput. Pract. Exp. 2018, 31, e4485. [Google Scholar] [CrossRef]
- Imes, C.; Zhang, H.; Zhao, K.; Hoffmann, H. CoPPer: Soft Real-Time Application Performance Using Hardware Power Capping. In Proceedings of the 2019 IEEE International Conference on Autonomic Computing (ICAC), Umea, Sweden, 16–20 June 2019; pp. 31–41. [Google Scholar] [CrossRef]
- Ramesh, S.; Perarnau, S.; Bhalachandra, S.; Malony, A.D.; Beckman, P. Understanding the Impact of Dynamic Power Capping on Application Progress. In Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Rio de Janeiro, Brazil, 20–24 May 2019; pp. 793–804. [Google Scholar] [CrossRef]
- Berral, J.L.; Goiri, I.n.; Nou, R.; Julià, F.; Guitart, J.; Gavaldà, R.; Torres, J. Towards Energy-Aware Scheduling in Data Centers Using Machine Learning. In Proceedings of the 1st International Conference on Energy-Efficient Computing and Networking, New York, NY, USA, 13–15 April 2010; Association for Computing Machinery: New York, NY, USA, 2010. e-Energy ’10. pp. 215–224. [Google Scholar] [CrossRef] [Green Version]
- Zhao, X.; Jamali, N. Energy-aware resource allocation for multicores with per-core frequency scaling. J. Internet Serv. Appl. 2014, 5, 9. [Google Scholar] [CrossRef] [Green Version]
- Rajagopal, D.; Tafani, D.; Georgiou, Y.; Glesser, D.; Ott, M. A Novel Approach for Job Scheduling Optimizations Under Power Cap for ARM and Intel HPC Systems. In Proceedings of the 2017 IEEE 24th International Conference on High Performance Computing (HiPC), Jaipur, India, 18–21 December 2017; pp. 142–151. [Google Scholar] [CrossRef]
- Borghesi, A.; Bartolini, A.; Lombardi, M.; Milano, M.; Benini, L. Scheduling-based power capping in high performance computing systems. Sustain. Comput. Inform. Syst. 2018, 19, 1–13. [Google Scholar] [CrossRef]
- Zhang, Z.; Lang, M.; Pakin, S.; Fu, S. Trapped capacity: Scheduling under a power cap to maximize machine-room throughput. In Proceedings of the 2014 Energy Efficient Supercomputing Workshop, New Orleans, LA, USA, 16–21 November 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 41–50. [Google Scholar]
- Nair, P.P.; Devaraj, R.; Sarkar, A. FEST: Fault-Tolerant Energy-Aware Scheduling on Two-Core Heterogeneous Platform. In Proceedings of the 2018 8th International Symposium on Embedded Computing and System Design (ISED), Cochin, India, 13–15 December 2018; pp. 63–68. [Google Scholar] [CrossRef]
- Goiri, I.; Julià, F.; Nou, R.; Berral, J.L.; Guitart, J.; Torres, J. Energy-Aware Scheduling in Virtualized Datacenters. In Proceedings of the 2010 IEEE International Conference on Cluster Computing, Heraklion, Greece, 20–24 September 2010; pp. 58–67. [Google Scholar] [CrossRef] [Green Version]
- Zhu, X.; Yang, L.; Chen, H.; Wang, J.; Yin, S.; Liu, X. Real-Time Tasks Oriented Energy-Aware Scheduling in Virtualized Clouds. Cloud Comput. IEEE Trans. 2014, 2, 168–180. [Google Scholar] [CrossRef]
- Hosseinimotlagh, S.; Khunjush, F.; Hosseinimotlagh, S. A Cooperative Two-Tier Energy-Aware Scheduling for Real-Time Tasks in Computing Clouds. In Proceedings of the 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Turin, Italy, 12–14 February 2014; pp. 178–182. [Google Scholar] [CrossRef]
- Ardagna, D.; Panicucci, B.; Trubian, M.; Zhang, L. Energy-Aware Autonomic Resource Allocation in Multitier Virtualized Environments. IEEE Trans. Serv. Comput. 2012, 5, 2–19. [Google Scholar] [CrossRef]
- Kandhalu, A.; Kim, J.; Lakshmanan, K.; Rajkumar, R. Energy-Aware Partitioned Fixed-Priority Scheduling for Chip Multi-processors. In Proceedings of the 2011 IEEE 17th International Conference on Embedded and Real-Time Computing Systems and Applications, Toyama, Japan, 28–31 August 2011; Volume 1, pp. 93–102. [Google Scholar] [CrossRef]
- D’Amico, M.; Gonzalez, J.C. Energy hardware and workload aware job scheduling towards interconnected HPC environments. IEEE Trans. Parallel Distrib. Syst. 2021, 1. [Google Scholar] [CrossRef]
- Li, D.; Byna, S.; Chakradhar, S. Energy-Aware Workload Consolidation on GPU. In Proceedings of the 2011 40th International Conference on Parallel Processing Workshops, Taipei, Taiwan, 13–16 September 2011; pp. 389–398. [Google Scholar] [CrossRef]
- Guerreiro, J.; Ilic, A.; Roma, N.; Tomás, P. Multi-kernel Auto-Tuning on GPUs: Performance and Energy-Aware Optimization. In Proceedings of the 2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Turku, Finland, 4–6 March 2015; pp. 438–445. [Google Scholar] [CrossRef]
- Yao, C.; Liu, W.; Tang, W.; Hu, S. EAIS: Energy-aware adaptive scheduling for CNN inference on high-performance GPUs. Future Gener. Comput. Syst. 2022, 130, 253–268. [Google Scholar] [CrossRef]
- Pirahandeh, M.; Kim, D.H. Energy-Aware GPU-RAID Scheduling for Reducing Energy Consumption in Cloud Storage Systems. In Computer Science and Its Applications; Park, J.J.J.H., Stojmenovic, I., Jeong, H.Y., Yi, G., Eds.; Springer: Berlin/Heidelberg, Germany, 2015; pp. 705–711. [Google Scholar]
- Pirahandeh, M.; Kim, D.H. EGE: A New Energy-Aware GPU Based Erasure Coding Scheduler for Cloud Storage Systems. In Proceedings of the 2018 Tenth International Conference on Ubiquitous and Future Networks (ICUFN), Prague, Czech Republic, 3–6 July 2018; pp. 619–621. [Google Scholar] [CrossRef]
- Sun, Y.; Gong, X.; Ziabari, A.K.; Yu, L.; Li, X.; Mukherjee, S.; McCardwell, C.; Villegas, A.; Kaeli, D. Hetero-mark, a benchmark suite for CPU-GPU collaborative computing. In Proceedings of the 2016 IEEE International Symposium on Workload Characterization (IISWC), Providence, RI, USA, 25–27 September 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1–10. [Google Scholar]
- Rościszewski, P.; Czarnul, P.; Lewandowski, R.; Schally-Kacprzak, M. KernelHive: A New Workflow-Based Framework for Multilevel High Performance Computing Using Clusters and Workstations with CPUs and GPUs. Concurr. Comput. Pract. Exp. 2016, 28, 2586–2607. Available online: https://onlinelibrary.wiley.com/doi/pdf/10.1002/cpe.3719 (accessed on 15 December 2022). [CrossRef]
- Gregg, C.; Boyer, M.; Hazelwood, K.; Skadron, K. Dynamic heterogeneous scheduling decisions using historical runtime data. In Proceedings of the Workshop on Applications for Multi-and Many-Core Processors (A4MMC), San Jose, CA, USA, 4–8 June 2011; pp. 1–12. [Google Scholar]
- Czarnul, P. Investigation of Parallel Data Processing Using Hybrid High Performance CPU + GPU Systems and CUDA Streams. Comput. Inform. 2020, 39, 510–536. [Google Scholar] [CrossRef]
- Arafa, Y.; ElWazir, A.; ElKanishy, A.; Aly, Y.; Elsayed, A.; Badawy, A.H.; Chennupati, G.; Eidenbenz, S.; Santhi, N. Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs. In Proceedings of the 17th ACM International Conference on Computing Frontiers, Bertinoro, Italy, 17–22 May 2020; Association for Computing Machinery: New York, NY, USA, 2020. CF ’20. pp. 60–70. [Google Scholar] [CrossRef]
- Fahad, M.; Shahid, A.; Manumachu, R.R.; Lastovetsky, A. A Comparative Study of Methods for Measurement of Energy of Computing. Energies 2019, 12, 2204. [Google Scholar] [CrossRef] [Green Version]
- Tang, X.; Fu, Z. CPU–GPU Utilization Aware Energy-Efficient Scheduling Algorithm on Heterogeneous Computing Systems. IEEE Access 2020, 8, 58948–58958. [Google Scholar] [CrossRef]
- Mejri, N.; Dupont, B.; Da Costa, G. Energy-aware scheduling of malleable HPC applications using a Particle Swarm optimised greedy algorithm. Sustain. Comput. Inform. Syst. 2020, 28, 100447. [Google Scholar]
- Angel, E.; Bampis, E.; Kacem, F. Energy Aware Scheduling for Unrelated Parallel Machines. In Proceedings of the 2012 IEEE International Conference on Green Computing and Communications, Besancon, France, 20–23 November 2012; pp. 533–540. [Google Scholar] [CrossRef]
- Lin, X.; Wang, Y.; Pedram, M. A Reinforcement Learning-Based Power Management Framework for Green Computing Data Centers. In Proceedings of the 2016 IEEE International Conference on Cloud Engineering (IC2E), Berlin, Germany, 4–8 April 2016; pp. 135–138. [Google Scholar] [CrossRef]
- Li, J.; Zhang, X.; Wei, Z.; Wei, J.; Ji, Z. Energy-aware task scheduling optimization with deep reinforcement learning for large-scale heterogeneous systems. CCF Trans. High Perform. Comput. 2021, 3, 383–392. [Google Scholar] [CrossRef]
- Imes, C.; Hofmeyr, S.; Hoffmann, H. Energy-efficient application resource scheduling using machine learning classifiers. In Proceedings of the 47th International Conference on Parallel Processing, Eugene, OR, USA, 13–16 August 2018; pp. 1–11. [Google Scholar]
- Bellman, R. The theory of dynamic programming. Bull. Am. Math. Soc. 1954, 60, 503–515. [Google Scholar] [CrossRef] [Green Version]
- Bellman, R.E.; Dreyfus, S.E. Applied Dynamic Programming; Princetown University Press: Princeton, NJ, USA, 1962. [Google Scholar]
- Li, Y.; Niu, J.; Atiquzzaman, M.; Long, X. Energy-aware scheduling on heterogeneous multi-core systems with guaranteed probability. Special Issue on Scalable Cyber-Physical Systems. J. Parallel Distrib. Comput. 2017, 103, 64–76. [Google Scholar] [CrossRef]
- Chen, J.; John, L.K. Energy-aware application scheduling on a heterogeneous multi-core system. In Proceedings of the 2008 IEEE International Symposium on Workload Characterization, Seattle, WA, USA, 14–16 September 2008; pp. 5–13. [Google Scholar] [CrossRef] [Green Version]
- Zadeh, L. Fuzzy logic. Computer 1988, 21, 83–93. [Google Scholar] [CrossRef]
- Méndez-Díaz, I.; Orozco, J.; Santos, R.; Zabala, P. Energy-aware scheduling mandatory/optional tasks in multicore real-time systems. Int. Trans. Oper. Res. 2017, 24, 173–198. [Google Scholar] [CrossRef]
- Keller, J.; Litzinger, S. Systematic search space design for energy-efficient static scheduling of moldable tasks. J. Parallel Distrib. Comput. 2022, 162, 44–58. [Google Scholar] [CrossRef]
- Etinski, M.; Corbalan, J.; Labarta, J.; Valero, M. Parallel job scheduling for power constrained HPC systems. Parallel Comput. 2012, 38, 615–630. [Google Scholar] [CrossRef]
- Tang, Q.; Gupta, S.K.S.; Varsamopoulos, G. Energy-Efficient Thermal-Aware Task Scheduling for Homogeneous High-Performance Computing Data Centers: A Cyber-Physical Approach. IEEE Trans. Parallel Distrib. Syst. 2008, 19, 1458–1472. [Google Scholar] [CrossRef]
- Agrawal, P.; Rao, S. Energy-aware scheduling of distributed systems. IEEE Trans. Autom. Sci. Eng. 2014, 11, 1163–1175. [Google Scholar] [CrossRef]
- Mezmaz, M.; Melab, N.; Kessaci, Y.; Lee, Y.C.; Talbi, E.G.; Zomaya, A.Y.; Tuyttens, D. A parallel bi-objective hybrid metaheuristic for energy-aware scheduling for cloud computing systems. J. Parallel Distrib. Comput. 2011, 71, 1497–1508. [Google Scholar] [CrossRef] [Green Version]
- Lee, Y.C.; Zomaya, A.Y. Minimizing Energy Consumption for Precedence-Constrained Applications Using Dynamic Voltage Scaling. In Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, Shanghai, China, 18–21 May 2009; IEEE: Piscataway, NJ, USA, 2009; pp. 92–99. [Google Scholar] [CrossRef]
- Guzek, M.; Pecero, J.E.; Dorronsoro, B.; Bouvry, P. Multi-objective evolutionary algorithms for energy-aware scheduling on distributed computing systems. Appl. Soft Comput. 2014, 24, 432–446. [Google Scholar] [CrossRef]
- Deb, K.; Pratap, A.; Agarwal, S.; Meyarivan, T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 2002, 6, 182–197. [Google Scholar] [CrossRef] [Green Version]
- Nebro, A.J.; Durillo, J.J.; Luna, F.; Dorronsoro, B.; Alba, E. Design Issues in a Multiobjective Cellular Genetic Algorithm. In Evolutionary Multi-Criterion Optimization; Springer: Berlin/Heidelberg, Germany, 2007; Volume 4403 LNCS, pp. 126–140. [Google Scholar] [CrossRef]
- Zitzler, E.; Künzli, S. Indicator-Based Selection in Multiobjective Search. In Parallel Problem Solving from Nature—PPSN VIII; Yao, X., Burke, E.K., Lozano, J.A., Smith, J., Merelo-Guervós, J.J., Bullinaria, J.A., Rowe, J.E., Tiňo, P., Kabán, A., Schwefel, H.P., Eds.; Springer: Berlin/Heidelberg, Germany, 2004; pp. 832–842. [Google Scholar]
- Kolodziej, J.; Khan, S.U.; Xhafa, F. Genetic algorithms for energy-aware scheduling in computational grids. In Proceedings of the 2011 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, Barcelona, Spain, 8–10 November 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 17–24. [Google Scholar]
- Kassab, A.; Nicod, J.M.; Philippe, L.; Rehn-Sonigo, V. Assessing the use of genetic algorithms to schedule independent tasks under power constraints. In Proceedings of the 2018 International Conference on High Performance Computing & Simulation (HPCS), Orleans, France, 16–20 July 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 252–259. [Google Scholar]
- Kassab, A.; Nicod, J.m.; Philippe, L.; Rehn-Sonigo, V. Scheduling Independent Tasks in Parallel under Power Constraints. In Proceedings of the 2017 46th International Conference on Parallel Processing (ICPP), Bristol, UK, 14–17 August 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 543–552. [Google Scholar] [CrossRef] [Green Version]
- Mishra, R.; Rastogi, N.; Zhu, D.; Mossé, D.; Melhem, R. Energy aware scheduling for distributed real-time systems. In Proceedings of the International Parallel and Distributed Processing Symposium, Cambridge, MA, USA, 20–24 May 2003; IEEE: Piscataway, NJ, USA, 2003; p. 9. [Google Scholar]
- Chiesi, M.; Vanzolini, L.; Mucci, C.; Franchi Scarselli, E.; Guerrieri, R. Power-Aware Job Scheduling on Heterogeneous Multicore Architectures. IEEE Trans. Parallel Distrib. Syst. 2015, 26, 868–877. [Google Scholar] [CrossRef]
- Wang, L.; Von Laszewski, G.; Dayal, J.; Wang, F. Towards energy aware scheduling for precedence constrained parallel tasks in a cluster with DVFS. In Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, Melbourne, Australia, 17–20 May 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 368–377. [Google Scholar]
- Nesmachnow, S.; Dorronsoro, B.; Pecero, J.E.; Bouvry, P. Energy-aware scheduling on multicore heterogeneous grid computing systems. J. Grid Comput. 2013, 11, 653–680. [Google Scholar] [CrossRef]
- Aupy, G.; Benoit, A.; Robert, Y. Energy-aware scheduling under reliability and makespan constraints. In Proceedings of the 2012 19th International Conference on High Performance Computing, Pune, India, 18–22 December 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 1–10. [Google Scholar]
- Yang, X.; Zhou, Z.; Wallace, S.; Lan, Z.; Tang, W.; Coghlan, S.; Papka, M.E. Integrating dynamic pricing of electricity into energy aware scheduling for HPC systems. In Proceedings of the SC’13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, Denver, CO, USA, 17–21 November 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 1–11. [Google Scholar]
- Barik, R.; Farooqui, N.; Lewis, B.T.; Hu, C.; Shpeisman, T. A Black-Box Approach to Energy-Aware Scheduling on Integrated CPU-GPU Systems. In Proceedings of the 2016 International Symposium on Code Generation and Optimization, Barcelona, Spain, 12–18 March 2016; Association for Computing Machinery: New York, NY, USA, 2016. CGO ’16. pp. 70–81. [Google Scholar] [CrossRef]
- Maheswaran, M.; Ali, S.; Siegel, H.J.; Hensgen, D.; Freund, R.F. Dynamic mapping of a class of independent tasks onto heterogeneous computing systems. J. Parallel Distrib. Comput. 1999, 59, 107–131. [Google Scholar] [CrossRef] [Green Version]
- Li, Y.; Liu, Y.; Qian, D. A Heuristic Energy-aware Scheduling Algorithm for Heterogeneous Clusters. In Proceedings of the 2009 15th International Conference on Parallel and Distributed Systems, Shenzhen, China, 9–11 December 2009; pp. 407–413. [Google Scholar] [CrossRef]
- Biswas, T.; Kuila, P.; Ray, A.K. A novel energy efficient scheduling for high performance computing systems. In Proceedings of the 2018 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Bengaluru, India, 10–12 July 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–6. [Google Scholar]
- Amalarethinam, D.G.; Kavitha, S. Priority based performance improved algorithm for meta-task scheduling in cloud environment. In Proceedings of the 2017 2nd International Conference on Computing and Communications Technologies (ICCCT), Chennai, India, 23–24 February 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 69–73. [Google Scholar]
- Mei, J.; Li, K. Energy-Aware Scheduling Algorithm with Duplication on Heterogeneous Computing Systems. In Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing, Beijing, China, 20–23 September 2012; pp. 122–129. [Google Scholar] [CrossRef]
- Bansal, S.; Kumar, P.; Singh, K. Dealing with heterogeneity through limited duplication for scheduling precedence constrained task graphs. J. Parallel Distrib. Comput. 2005, 65, 479–491. [Google Scholar] [CrossRef]
- Hagras, T.; Janecek, J. A high performance, low complexity algorithm for compile-time task scheduling in heterogeneous systems. In Proceedings of the 18th International Parallel and Distributed Processing Symposium, Santa Fe, NM, USA, 26–30 April 2004; p. 107. [Google Scholar]
- Zong, Z.; Manzanares, A.; Ruan, X.; Qin, X. EAD and PEBD: Two energy-aware duplication scheduling algorithms for parallel tasks on homogeneous clusters. IEEE Trans. Comput. 2010, 60, 360–374. [Google Scholar] [CrossRef] [Green Version]
- Ebaid, A.; Rajasekaran, S.; Ammar, R.; Ebaid, R. Energy-aware heuristics for scheduling parallel applications on high performance computing platforms. In Proceedings of the 2014 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), Noida, India, 15–17 December 2014; pp. 282–289. [Google Scholar] [CrossRef]
- Ranaweera, S.; Agrawal, D.P. A task duplication based scheduling algorithm for heterogeneous systems. In Proceedings of the 14th International Parallel and Distributed Processing Symposium, IPDPS 2000, Cancun, Mexico, 1–5 May 2000; IEEE: Piscataway, NJ, USA, 2000; pp. 445–450. [Google Scholar]
- Mämmelä, O.; Majanen, M.; Basmadjian, R.; Meer, H.; Giesler, A.; Homberg, W. Energy-aware job scheduler for high-performance computing. Comput. Sci.-Res. Dev. 2012, 27, 265–275. [Google Scholar] [CrossRef]
- Mashayekhy, L.; Nejad, M.M.; Grosu, D.; Lu, D.; Shi, W. Energy-aware scheduling of mapreduce jobs. In Proceedings of the 2014 IEEE International Congress on Big Data, Washington, DC, USA, 27–30 October 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 32–39. [Google Scholar]
- Mashayekhy, L.; Nejad, M.M.; Grosu, D.; Zhang, Q.; Shi, W. Energy-Aware Scheduling of MapReduce Jobs for Big Data Applications. IEEE Trans. Parallel Distrib. Syst. 2015, 26, 2720–2733. [Google Scholar] [CrossRef]
- Mei, X.; Wang, Q.; Chu, X.; Liu, H.; Leung, Y.W.; Li, Z. Energy-aware Task Scheduling with Deadline Constraint in DVFS-enabled Heterogeneous Clusters. arXiv 2021, arXiv:2104.00486. [Google Scholar] [CrossRef]
- Kiselev, E.; Telegin, P.N.; Shabanov, B.M. An energy-efficient scheduling algorithm for shared facility supercomputer centers. Lobachevskii J. Math. 2021, 42, 2554–2561. [Google Scholar] [CrossRef]
- Wong, P.; Der Wijngaart, R. NAS parallel benchmarks I/O version 2.4. In Technical Report NAS-03-002; NASA Ames Research Center: Moffet Field, CA, USA, 2003; p. 91. [Google Scholar]
- Hu, Y.; Liu, C.; Li, K.; Chen, X.; Li, K. Slack allocation algorithm for energy minimization in cluster systems. Future Gener. Comput. Syst. 2017, 74, 119–131. [Google Scholar] [CrossRef]
- Maurya, A.K.; Modi, K.; Kumar, V.; Naik, N.S.; Tripathi, A.K. Energy-aware scheduling using slack reclamation for cluster systems. Clust. Comput. 2020, 23, 911–923. [Google Scholar] [CrossRef]
- Baskiyar, S.; Abdel-Kader, R. Energy aware DAG scheduling on heterogeneous systems. Clust. Comput. 2010, 13, 373–383. [Google Scholar] [CrossRef]
- Park, G.L.; Shirazi, B.; Marquis, J.; Choo, H. Decisive path scheduling: A new list scheduling method. In Proceedings of the Proceedings of the 1997 International Conference on Parallel Processing (Cat. No. 97TB100162), Bloomington, IL, USA, 11–15 August 1997; IEEE: Piscataway, NJ, USA, 1997; pp. 472–480. [Google Scholar]
- Roeder, J.; Rouxel, B.; Altmeyer, S.; Grelck, C. Energy-aware scheduling of multi-version tasks on heterogeneous real-time systems. In Proceedings of the 36th Annual ACM Symposium on Applied Computing, Gwangju, Republic of Korea, 22–26 March 2021; pp. 501–510. [Google Scholar]
- Ebrahimirad, V.; Goudarzi, M.; Rajabi, A. Energy-aware scheduling for precedence-constrained parallel virtual machines in virtualized data centers. J. Grid Comput. 2015, 13, 233–253. [Google Scholar] [CrossRef]
- Topcuoglu, H.; Hariri, S.; Wu, M.Y. Performance-effective and low-complexity task scheduling for heterogeneous computing. IEEE Trans. Parallel Distrib. Syst. 2002, 13, 260–274. [Google Scholar] [CrossRef] [Green Version]
- Li, K. Energy efficient scheduling of parallel tasks on multiprocessor computers. J. Supercomput. 2012, 60, 223–247. [Google Scholar] [CrossRef]
- Chen, J.; He, Y.; Zhang, Y.; Han, P.; Du, C. Energy-aware scheduling for dependent tasks in heterogeneous multiprocessor systems. J. Syst. Archit. 2022, 129, 102598. [Google Scholar] [CrossRef]
- Shekar, V.; Izadi, B. Energy aware scheduling for DAG structured applications on heterogeneous and DVS enabled processors. In Proceedings of the International Conference on Green Computing, Chicago, IL, USA, 15–18 August 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 495–502. [Google Scholar]
- Sih, G.C.; Lee, E.A. A compile-time scheduling heuristic for interconnection-constrained heterogeneous processor architectures. IEEE Trans. Parallel Distrib. Syst. 1993, 4, 175–187. [Google Scholar] [CrossRef] [Green Version]
- Sun, H.; Stolf, P.; Pierson, J.M. Spatio-temporal thermal-aware scheduling for homogeneous high-performance computing datacenters. Future Gener. Comput. Syst. 2017, 71, 157–170. [Google Scholar] [CrossRef] [Green Version]
- Raghu, H.; Saurav, S.K.; Bapu, B.S. PAAS: Power Aware Algorithm for Scheduling in High Performance Computing. In Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing, Washington, DC, USA, 9–12 December 2013; pp. 327–332. [Google Scholar] [CrossRef]
- Hu, Y.; Li, J.; He, L. A reformed task scheduling algorithm for heterogeneous distributed systems with energy consumption constraints. Neural Comput. Appl. 2020, 32, 5681–5693. [Google Scholar] [CrossRef]
- Lee, Y.C.; Zomaya, A.Y. Energy Conscious Scheduling for Distributed Computing Systems under Different Operating Conditions. IEEE Trans. Parallel Distrib. Syst. 2011, 22, 1374–1381. [Google Scholar] [CrossRef]
- Bozdag, D.; Catalyurek, U.; Ozguner, F. A task duplication based bottom-up scheduling algorithm for heterogeneous environments. In Proceedings of the 20th IEEE International Parallel & Distributed Processing Symposium, Rhodes, Greece, 25–29 April 2006; IEEE: Piscataway, NJ, USA, 2006; p. 12. [Google Scholar]
- MA, Y.; GONG, B.; GUO, Z.; CHEN, Y.; ZOU, L. Energy-aware scheduling of parallel application in hybrid computing system. Chin. J. Electron. 2014, 23, 688–694. [Google Scholar]
- Chen, S.; Li, Z.; Yang, B.; Rudolph, G. Quantum-Inspired Hyper-Heuristics for Energy-Aware Scheduling on Heterogeneous Computing Systems. IEEE Trans. Parallel Distrib. Syst. 2016, 27, 1796–1810. [Google Scholar] [CrossRef]
- Deng, Z.; Yan, Z.; Huang, H.; Shen, H. Energy-Aware Task Scheduling on Heterogeneous Computing Systems With Time Constraint. IEEE Access 2020, 8, 23936–23950. [Google Scholar] [CrossRef]
- Li, K.; Tang, X.; Li, K. Energy-Efficient Stochastic Task Scheduling on Heterogeneous Computing Systems. IEEE Trans. Parallel Distrib. Syst. 2014, 25, 2867–2876. [Google Scholar] [CrossRef]
- Kim, K.H.; Buyya, R.; Kim, J. Power Aware Scheduling of Bag-of-Tasks Applications with Deadline Constraints on DVS-enabled Clusters. In Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid ’07), Rio de Janeiro, Brazil, 14–17 May 2007; pp. 541–548. [Google Scholar] [CrossRef]
- Cong, J.; Yuan, B. Energy-Efficient Scheduling on Heterogeneous Multi-Core Architectures. In Proceedings of the 2012 ACM/IEEE International Symposium on Low Power Electronics and Design, Redondo Beach, CA, USA, 30 July 2012–1 August 2012; Association for Computing Machinery: New York, NY, USA, 2012. ISLPED ’12. pp. 345–350. [Google Scholar] [CrossRef] [Green Version]
- Sawalha, L.; Barnes, R.D. Energy-Efficient Phase-Aware Scheduling for Heterogeneous Multicore Processors. In Proceedings of the 2012 IEEE Green Technologies Conference, Besancon, France, 20–23 November 2012; pp. 1–6. [Google Scholar] [CrossRef]
- Manumachu, R.R.; Lastovetsky, A. Bi-Objective Optimization of Data-Parallel Applications on Homogeneous Multicore Clusters for Performance and Energy. IEEE Trans. Comput. 2018, 67, 160–177. [Google Scholar] [CrossRef]
- Khaleghzadeh, H.; Fahad, M.; Shahid, A.; Manumachu, R.R.; Lastovetsky, A. Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy through Workload Distribution. IEEE Trans. Parallel Distrib. Syst. 2021, 32, 543–560. [Google Scholar] [CrossRef]
- Khaleghzadeh, H.; Reddy Manumachu, R.; Lastovetsky, A. Efficient Exact Algorithms for Continuous Bi-Objective Performance-Energy Optimization of Applications with Linear Energy and Monotonically Increasing Performance Profiles on Heterogeneous High Performance Computing Platforms. Concurr. Comput. Pract. Exp. 2022, e7285. Available online: https://onlinelibrary.wiley.com/doi/pdf/10.1002/cpe.7285 (accessed on 15 December 2022). [CrossRef]
- Li, D.; Wu, J. Energy-aware scheduling for frame-based tasks on heterogeneous multiprocessor platforms. In Proceedings of the 2012 41st International Conference on Parallel Processing, Pittsburgh, PA, USA, 10–13 September 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 430–439. [Google Scholar]
- Czarnul, P.; Rościszewski, P. Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge. In Proceedings of the 2019 International Conference on High Performance Computing Simulation (HPCS), Dublin, Ireland, 15–19 July 2019; pp. 551–558. [Google Scholar] [CrossRef]
- Czarnul, P.; Kuchta, J.; Matuszek, M.R.; Proficz, J.; Rosciszewski, P.; Wójcik, M.; Szymanski, J. MERPSYS: An environment for simulation of parallel application execution on large scale HPC systems. Simul. Model. Pract. Theory 2017, 77, 124–140. [Google Scholar] [CrossRef]
- Fomperosa, J.; Mario Ibañez, E.S.; Bosque, J.L. Task Scheduler for Heterogeneous Data Centres based on Deep Reinforcement Learning. In Parallel Processing and Applied Mathematics; PPAM: Gdansk, Poland, 2022. [Google Scholar]
- Welsh, M. The End of Programming. Commun. ACM 2023, 66, 34–35. [Google Scholar] [CrossRef]
- Krzywaniak, A.; Czarnul, P.; Proficz, J. GPU Power Capping for Energy-Performance Trade-Offs in Training of Deep Convolutional Neural Networks for Image Recognition. In Proceedings of the Computational Science—ICCS 2022: 22nd International Conference, Part I, London, UK, 21–23 June 2022; Springer: Berlin/Heidelberg, Germany, 2022; pp. 667–681. [Google Scholar] [CrossRef]
Symbol | Description |
---|---|
J | set of tasks/jobs |
R | set of resources |
j | task/job |
r | resource |
execution time of task j on resource r | |
energy needed by task j on resource r | |
start time of task j | |
resource assigned to task j | |
communication time between resources , performing tasks x and y | |
D | set of the precedence pairs |
set of the resources assigned to task j | |
execution time | |
energy consumption | |
factors | |
W | weights |
performance per watt | |
reference performance per watt | |
The Green Index | |
relative energy efficiency | |
energy delay product | |
energy delay summation | |
energy delay distance |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kocot, B.; Czarnul, P.; Proficz, J. Energy-Aware Scheduling for High-Performance Computing Systems: A Survey. Energies 2023, 16, 890. https://doi.org/10.3390/en16020890
Kocot B, Czarnul P, Proficz J. Energy-Aware Scheduling for High-Performance Computing Systems: A Survey. Energies. 2023; 16(2):890. https://doi.org/10.3390/en16020890
Chicago/Turabian StyleKocot, Bartłomiej, Paweł Czarnul, and Jerzy Proficz. 2023. "Energy-Aware Scheduling for High-Performance Computing Systems: A Survey" Energies 16, no. 2: 890. https://doi.org/10.3390/en16020890
APA StyleKocot, B., Czarnul, P., & Proficz, J. (2023). Energy-Aware Scheduling for High-Performance Computing Systems: A Survey. Energies, 16(2), 890. https://doi.org/10.3390/en16020890