Abstract
Embedded workloads are increasing day by day and becoming more complex. The number of workloads running in the embedded processors has been growing exponentially for the past few decades owing to the increased penetration of Internet of Things (IoT) among the users. Realizing the growing demand and complexity of the embedded workloads, embedded CPU designs are migrated from the single core to multiple cores to cater the needs of user’s application. But still, energy consumption, intelligent handling of the workloads still remains to be the real challenge among the researchers. To cater these challenges in realizing the full potential of the underlying platform, this paper proposes the deep learning-based workload characterization technique in which the microarchitecture independent workloads are considered as the major inputs. These inputs are then used for training the novel deep learning network called Bi-Attention–LSTM (Long Short Term Memory) which categorize the workloads in accordance to the present characteristics of embedded processors. The microarchitecture-independent workloads are collected from the three different benchmarks namely MiBENCH, IoMT (Internet of Medical Things) and EEMBC workloads thereby conducting the comprehensive experimentation to validate the proposed framework. The performance of characterization is then contrasted with that of other cutting-edge deep learning frameworks to demonstrate the superior performance of the proposed framework. Findings demonstrate that the suggested framework has outperformed the other frameworks with the finds its strong place in the workload characterization for multi core embedded processors.
Similar content being viewed by others
Data availabilty
Not applicable.
References
Butko A, Bruguier F, Gamatie A, Sassatelli G, Novo D Full system simulation of big., LITTLE Multicore Architecture for Performance and Energy Exploration. MCSoC, Embedded Multicore/Many-core Systems-on-Chip. 2016
Ganesh R, Aalsaud A. Experiments with Odroid-XU3 board".: Technical report series, No. CS-TR-1471. 2015
Krishna JV, Nasre R. Optimizing graph algorithms in asymmetric multicore processors. IEEE Trans Comput Aided Des Integrat Circuits Syst. 2018;37(11):2673–84.
Izadkhah H. Learning-based genetic algorithm for task graph scheduling. Appl Comput Intell Soft Comput. 2019. https://doi.org/10.1155/2019/6543957.
Li G, Wu Z. Ant colony optimization task scheduling algorithm for swim based on load balancing. Future Int. 2019;11(4):90.
Wachowiak MP, Timson MC. Adaptive particle swarm optimization with heterogeneous multicore parallelism and GPU acceleration. IEEE Trans Parall Distribut Syst. 2017;28(10):2784–93.
Tarplee KM, Friese R. Energy and makespan tradeoffs in heterogeneous computing systems using efficient linear programming techniques. IEEE Trans Parallel Distribut Syst. 2016;27(6):1633–46.
Venugopalan S, Sinnen O. ILP formulations for optimal task scheduling with communication delays on parallel systems. IEEE Trans Parallel Distribut Syst. 2015;26(1):142–51.
Tang Z, Qi L, Cheng Z, Li K, Khan SU, Li K. An energy efficient task scheduling algorithm in DVFS-enabled cloud environment. J Grid Comput. 2016;14(1):55–74.
Xie G, Zeng G, Xiao X. Energy-Efficient scheduling algorithms for real-time parallel applications on heterogeneous distributed embedded systems. IEEE Trans Parallel Distribut Syst. 2017;28(12):141–52.
Ruan W. Heterogeneous Computing Made Easy: Qualcomm® Symphony System Manager SDK. 2017
Van Craeynest K, aleel A, Eeckhout L, Narvaez P, Emer J. Scheduling heterogeneous multi-cores through performance impact estimation (PIE), in: Proceedings of the 39th ISCA. 2012
Limaye A, Adegbija T. HERMIT: a benchmark suite for the internet of medical things. IEEE Int Things. 2018;5(5):4212–22.
Gillhuber A. Core-Mark—Open-Source-Benchmark on EEMBC, Elektronik net.de (2009)
Guthaus MR, Ringenberg JS, Ernst D, Austin TM, Mudge T, Brown RB, MiBench A. free, commercially representative embedded benchmark suite, in: Proceedings of the 4th Annual IEEE International Workshop Workload Characterization, 2001; p3–14.
Li CV, Vinicius P, Mossae D. Exploring machine learning for thread characterization on heterogeneous multiprocessors. ACM SIGOPS Operat Syst Rev. 2017. https://doi.org/10.1145/3139645.3139664.
Bhattacharjee A, Martonosi M. Thread criticality predictors for dynamic performance, power, and resource management in chip multiprocessors, in: Proceedings of the 36th International Symposium on Computer Architecture (ISCA). 2009; p. 290–301.
Saez JC, Pousa A, Castro F, Chaver D, Prieto-Matias M. Towards completely fair scheduling on asymmetric single-ISA multicore processors. J Parallel Distribut Comput. 2017;102:115–31.
Van Craeynest K, Akram S, Heirman W, Jaleel A, Eeckhout L. Fairness-aware scheduling on single-is a heterogeneous multi-cores, in: Proceedings of the 22ndInternational Conference on Parallel Architectures and Compilation Techniques. 2013; p. 177–187.
Nemirovsky D, Arkose T, Markovic N, Nemirovsky M, Unsal O, Cristal A. A machine learning approach for performance prediction and scheduling on heterogeneous CPUs, in: Proceedings of the 29th International Symposium on Computer Architecture and High-Performance Computing. 2017; p. 121–128.
Sayadi H, Patel N, Sasan A. HoumanHomayoun energy-efficiency prediction and scheduling in composite cores architectures, in: Proceedings of the IEEE International Conference on Computer Design (ICCD). 2017; p. 129–136.
Gamatie A, An X, Zhang Y, An Kang G. Sassatelli, Empirical model-based performance prediction for application mapping on multicore architectures. J Syst Architect. 2019;98(1):1–16.
Ababei C, Moghaddam MG. A survey of prediction and classification techniques in multicore processor systems. IEEE Transact Parall Distribut Syst. 2019;30(5):1184–200.
Funding
No funding received for this research.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
No conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Advances in Computational Approaches for Image Processing, Wireless Networks, Cloud Applications and Network Security” guest edited by P. Raviraj, Maode Ma and Roopashree H R.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sivaramakrishnan, R., Senthilkumar, G. A Deep Learning Framework for Microarchitecture Independent Workload Characterization Technique for Multi-core Asymmetric Embedded Systems. SN COMPUT. SCI. 4, 511 (2023). https://doi.org/10.1007/s42979-023-01909-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-023-01909-8