Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

A Deep Learning Framework for Microarchitecture Independent Workload Characterization Technique for Multi-core Asymmetric Embedded Systems

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Embedded workloads are increasing day by day and becoming more complex. The number of workloads running in the embedded processors has been growing exponentially for the past few decades owing to the increased penetration of Internet of Things (IoT) among the users. Realizing the growing demand and complexity of the embedded workloads, embedded CPU designs are migrated from the single core to multiple cores to cater the needs of user’s application. But still, energy consumption, intelligent handling of the workloads still remains to be the real challenge among the researchers. To cater these challenges in realizing the full potential of the underlying platform, this paper proposes the deep learning-based workload characterization technique in which the microarchitecture independent workloads are considered as the major inputs. These inputs are then used for training the novel deep learning network called Bi-Attention–LSTM (Long Short Term Memory) which categorize the workloads in accordance to the present characteristics of embedded processors. The microarchitecture-independent workloads are collected from the three different benchmarks namely MiBENCH, IoMT (Internet of Medical Things) and EEMBC workloads thereby conducting the comprehensive experimentation to validate the proposed framework. The performance of characterization is then contrasted with that of other cutting-edge deep learning frameworks to demonstrate the superior performance of the proposed framework. Findings demonstrate that the suggested framework has outperformed the other frameworks with the finds its strong place in the workload characterization for multi core embedded processors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Data availabilty

Not applicable.

References

  1. Butko A, Bruguier F, Gamatie A, Sassatelli G, Novo D Full system simulation of big., LITTLE Multicore Architecture for Performance and Energy Exploration. MCSoC, Embedded Multicore/Many-core Systems-on-Chip. 2016

  2. Ganesh R, Aalsaud A. Experiments with Odroid-XU3 board".: Technical report series, No. CS-TR-1471. 2015

  3. Krishna JV, Nasre R. Optimizing graph algorithms in asymmetric multicore processors. IEEE Trans Comput Aided Des Integrat Circuits Syst. 2018;37(11):2673–84.

    Article  Google Scholar 

  4. Izadkhah H. Learning-based genetic algorithm for task graph scheduling. Appl Comput Intell Soft Comput. 2019. https://doi.org/10.1155/2019/6543957.

    Article  Google Scholar 

  5. Li G, Wu Z. Ant colony optimization task scheduling algorithm for swim based on load balancing. Future Int. 2019;11(4):90.

    Article  Google Scholar 

  6. Wachowiak MP, Timson MC. Adaptive particle swarm optimization with heterogeneous multicore parallelism and GPU acceleration. IEEE Trans Parall Distribut Syst. 2017;28(10):2784–93.

    Article  Google Scholar 

  7. Tarplee KM, Friese R. Energy and makespan tradeoffs in heterogeneous computing systems using efficient linear programming techniques. IEEE Trans Parallel Distribut Syst. 2016;27(6):1633–46.

    Article  Google Scholar 

  8. Venugopalan S, Sinnen O. ILP formulations for optimal task scheduling with communication delays on parallel systems. IEEE Trans Parallel Distribut Syst. 2015;26(1):142–51.

    Article  Google Scholar 

  9. Tang Z, Qi L, Cheng Z, Li K, Khan SU, Li K. An energy efficient task scheduling algorithm in DVFS-enabled cloud environment. J Grid Comput. 2016;14(1):55–74.

    Article  Google Scholar 

  10. Xie G, Zeng G, Xiao X. Energy-Efficient scheduling algorithms for real-time parallel applications on heterogeneous distributed embedded systems. IEEE Trans Parallel Distribut Syst. 2017;28(12):141–52.

    Article  Google Scholar 

  11. Ruan W. Heterogeneous Computing Made Easy: Qualcomm® Symphony System Manager SDK. 2017

  12. Van Craeynest K, aleel A, Eeckhout L, Narvaez P, Emer J. Scheduling heterogeneous multi-cores through performance impact estimation (PIE), in: Proceedings of the 39th ISCA. 2012

  13. Limaye A, Adegbija T. HERMIT: a benchmark suite for the internet of medical things. IEEE Int Things. 2018;5(5):4212–22.

    Article  Google Scholar 

  14. Gillhuber A. Core-Mark—Open-Source-Benchmark on EEMBC, Elektronik net.de (2009)

  15. Guthaus MR, Ringenberg JS, Ernst D, Austin TM, Mudge T, Brown RB, MiBench A. free, commercially representative embedded benchmark suite, in: Proceedings of the 4th Annual IEEE International Workshop Workload Characterization, 2001; p3–14.

  16. Li CV, Vinicius P, Mossae D. Exploring machine learning for thread characterization on heterogeneous multiprocessors. ACM SIGOPS Operat Syst Rev. 2017. https://doi.org/10.1145/3139645.3139664.

    Article  Google Scholar 

  17. Bhattacharjee A, Martonosi M. Thread criticality predictors for dynamic performance, power, and resource management in chip multiprocessors, in: Proceedings of the 36th International Symposium on Computer Architecture (ISCA). 2009; p. 290–301.

  18. Saez JC, Pousa A, Castro F, Chaver D, Prieto-Matias M. Towards completely fair scheduling on asymmetric single-ISA multicore processors. J Parallel Distribut Comput. 2017;102:115–31.

    Article  Google Scholar 

  19. Van Craeynest K, Akram S, Heirman W, Jaleel A, Eeckhout L. Fairness-aware scheduling on single-is a heterogeneous multi-cores, in: Proceedings of the 22ndInternational Conference on Parallel Architectures and Compilation Techniques. 2013; p. 177–187.

  20. Nemirovsky D, Arkose T, Markovic N, Nemirovsky M, Unsal O, Cristal A. A machine learning approach for performance prediction and scheduling on heterogeneous CPUs, in: Proceedings of the 29th International Symposium on Computer Architecture and High-Performance Computing. 2017; p. 121–128.

  21. Sayadi H, Patel N, Sasan A. HoumanHomayoun energy-efficiency prediction and scheduling in composite cores architectures, in: Proceedings of the IEEE International Conference on Computer Design (ICCD). 2017; p. 129–136.

  22. Gamatie A, An X, Zhang Y, An Kang G. Sassatelli, Empirical model-based performance prediction for application mapping on multicore architectures. J Syst Architect. 2019;98(1):1–16.

    Article  Google Scholar 

  23. Ababei C, Moghaddam MG. A survey of prediction and classification techniques in multicore processor systems. IEEE Transact Parall Distribut Syst. 2019;30(5):1184–200.

    Article  Google Scholar 

Download references

Funding

No funding received for this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Sivaramakrishnan.

Ethics declarations

Conflict of Interest

No conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the topical collection “Advances in Computational Approaches for Image Processing, Wireless Networks, Cloud Applications and Network Security” guest edited by P. Raviraj, Maode Ma and Roopashree H R.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sivaramakrishnan, R., Senthilkumar, G. A Deep Learning Framework for Microarchitecture Independent Workload Characterization Technique for Multi-core Asymmetric Embedded Systems. SN COMPUT. SCI. 4, 511 (2023). https://doi.org/10.1007/s42979-023-01909-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-023-01909-8

Keywords