Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Free access
Just Accepted

Heterogeneous Big Data Parallel Computing Optimization Model using MPI/OpenMP Hybrid and Sensor Networks

Online AM: 09 August 2022 Publication History

Editorial Notes

EXPRESSION OF CONCERN: ACM is issuing a formal Expression of Concern for all papers published in the TOSN Special Issue on Green Communications and Sensor Networks with Machine Intelligence for Smart Cities while a thorough investigation takes place with regards to the integrity of the peer review process. ACM strongly suggests that papers in this special issue not be cited in the literature until ACM's investigation has concluded and final decisions have been made regarding the integrity of the peer review process.

Abstract

For the heterogeneous big data parallel computing model, two levels of parallelism between nodes are not considered, resulting in low efficiency of heterogeneous big data parallel computing and bandwidth to send and receive information, high communication overhead, long model running time and small computational volume. In the paper, we propose an optimization model of heterogeneous big data parallel computing based on a hybrid Multi Point Interface (MPI)/Open Multi-Processing (OpenMP) and Sensor Networks. First, the processor characteristics of heterogeneous big data architecture is analyzed, the parallel tasks among processors are divided, collect the heterogeneous big data to be computed and cluster them, and use the processing results as the input items of the model. Then, a parallel load balancing mechanism is established to optimally divide the parallel computing load of heterogeneous big data, and a parallel computing optimization program is written by combining the hybrid programming mode of MPI and OpenMP and using the hybrid MPI/OpenMP, and finally, the parallel computing optimization of heterogeneous big data is realized by optimizing the parallel communication and determining the model parameters. The results show that the proposed model has a communication bandwidth of 510Mbps, a computational volume of 1.16GB, a model runtime of 24s, and an improved network bandwidth utilization of 93%, which can effectively reduce the communication overhead, and improve the efficiency of parallel computing and bandwidth sending and receiving information in sensor networks, and shorten the model running time.

References

[1]
Wu Fa Hui, Zhang Ling. Design of hybrid parallel algorithm for cluster computer based on pram model. Journal of University of information engineering, 2019, 20 (04): 417-420
[2]
Hu Xiaoqiang, Wu Zhen, Wen Lijie, et al. Parallel distributed process mining algorithm based on spark. Computer integrated manufacturing system, 2019, 25 (04): 5-11
[3]
Yang Shitong, Cai Yanxia, Lu Guorui, et al. Parallel computing technology of CME parameter identification model based on MapReduce. Journal of space science, 2020, 40 (02): 31-37
[4]
Xu Dexin, Li Lingjuan. Parallelization of association rule mining algorithm based on spark. Computer technology and development, 2019, 29 (03): 36-40
[5]
H Khaleghzadeh, Manumachu R R, Lastovetsky A. A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes. IEEE Access, 2020, 8:7861-7876.
[6]
N Park, Park S, Lee M. High performance parallel KMP algorithm on a heterogeneous architecture. Cluster Computing, 2019(22):1-13.
[7]
Chen L, Liu Z H. Energy- and locality-efficient multi-job scheduling based on MapReduce for heterogeneous datacenter. Service Oriented Computing and Applications, 2019, 13(1):1-12.
[8]
Lin H, Li M F, Jia C F, et al. Degree-of-Node Task Scheduling of Fine-Grained Parallel Programs on Heterogeneous Systems. Journal of Computer Science and Technology, 2019, 34(005):1096-1108.
[9]
D. Jeong, J. Kim, M. -L. Oldja and S. Ha, "Parallel Scheduling of Multiple SDF Graphs Onto Heterogeneous Processors," in IEEE Access, vol. 9, pp. 20493-20507, 2021
[10]
WANG Wei, XU Chuanfu, CHE Yonggang. Heterogeneous parallel algorithm for internal and external subregion partitioning of high precision CFD programs. Journal of the National University of Defense Technology, 2020, 42(02):34-43.
[11]
Li A-M, Ji S-S, Liao X-Y, et al. A structured parallel programming framework for heterogeneous computing. Computer Engineering and Science, 2019, 41(03):424-432.
[12]
Wang F, Zhang W, Guo H, et al. Automatic Translation of Data Parallel Programs for Heterogeneous Parallelism Through OpenMP Offloading. The Journal of Supercomputing, 2020, 77:4957-4987.
[13]
Raca V, Mehofer E. clusterCL: comprehensive support for multi-kernel data-parallel applications in heterogeneous asymmetric clusters. The Journal of Supercomputing, 2020, 76(2):9976-10008.
[14]
Lf A, Ag B, Fxt A. A hierarchical parallel implementation for heterogeneous computing. Application to algebra-based CFD simulations on hybrid supercomputers - ScienceDirect. Computers & Fluids, 2020, 214:104768.
[15]
Bevilaqua A, D Bevilaqua, Yamanaka K. Parallel island based Memetic Algorithm with Lin–Kernighan local search for a real-life Two-Echelon Heterogeneous Vehicle Routing Problem based on Brazilian wholesale companies. Applied Soft Computing, 2019, 76:697-711.
[16]
Saad, Y. SPARSKIT: A basic toolkit for sparse matrix computations (1994) www.cs.umn. edu/Research/arpa/SPARSKIT/sparskit.html
[17]
Z. Shen, L. Han, R. Chen, C. Ma, Z. Jia and Z. Shao, An Efficient Directory Entry Lookup Cache With Prefix-Awareness for Mobile Devices, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 39, no. 12, pp. 4575-4586, 2020
[18]
Cebrian J M, Imbernon B, Soto J, et al. High-throughput fuzzy clustering on heterogeneous architectures. Future generation computer systems, 2020, 106(May):401-411.
[19]
Gahar R M, Arfaoui O, Hidri M S, et al. A Distributed Approach for High-Dimensionality Heterogeneous Data Reduction. IEEE Access, 2019, PP(99):1-1.
[20]
S. Yan, Q. Liu, J. Li and L. Han, Heterogeneous Acceleration of Hybrid PSO-QN Algorithm for Neural Network Training IEEE Access, vol. 7, pp. 161499-161509, 2019.
[21]
H Khaleghzadeh, Fahad M, Shahid A, et al. Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution. IEEE Transactions on Parallel and Distributed Systems, 2021, 32(3):543-560.
[22]
Nozal R, Perez B, Bosque J L, et al. Load balancing in a heterogeneous world: CPU-Xeon Phi co-execution of data-parallel kernels. Journal of supercomputing, 2019, 75(3):1123-1136.
[23]
H Khaleghzadeh, Manumachu R R, Lastovetsky A. A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes. IEEE Access, 2020, 8:7861-7876.
[24]
Kim D, Kang S, J Lim, et al. Resource-Aware Device Allocation of Data-Parallel Applications on Heterogeneous Systems. Electronics, 2020, 9(11):1825
[25]
Manumachu R R, Lastovetsky A L. Design of self-adaptable data parallel applications on multicore clusters automatically optimized for performance and energy through load distribution. Concurrency and Computation Practice and Experience, 2019, 31(4):e4958.1-e4958.24
[26]
Szustak L, Bratek P. Performance portable parallel programming of heterogeneous stencils across shared-memory platforms with modern Intel processors. Experimental Mechanics, 2019, 33(3):534-553.
[27]
Hassan B A, Rashid T A. A multidisciplinary ensemble algorithm for clustering heterogeneous datasets. Neural Computing and Applications, 2021(3):1-24.
[28]
Taniguchi I, H Tomiyama, Lin M, et al. A branch-and-bound approach to scheduling of data-parallel tasks on multi-core architectures. International Journal of Embedded Systems, 2020, 12(1):125.
[29]
H Khaleghzadeh, Fahad M, Shahid A, et al. Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution. IEEE Transactions on Parallel and Distributed Systems, 2021, 32(3):543-560.
[30]
Gu R, Zuo Z, Jiang X, et al. Towards Efficient Large-Scale Interprocedural Program Static Analysis on Distributed Data-Parallel Computation. IEEE Transactions on Parallel and Distributed Systems, 2021, 32(4):867-883.

Cited By

View all
  • (2024)Exploring Scalability of Value-Flow Graph ConstructionProceedings of the 2024 4th International Conference on Artificial Intelligence, Automation and High Performance Computing10.1145/3690931.3690951(112-117)Online publication date: 19-Jul-2024
  • (2024)Development of a Library for Image Processing Using Openmpi and Openmp2024 5th International Conference on Electronics and Sustainable Communication Systems (ICESC)10.1109/ICESC60852.2024.10689807(1973-1979)Online publication date: 7-Aug-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Sensor Networks
ACM Transactions on Sensor Networks Just Accepted
EISSN:1550-4867
Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

Publication History

Online AM: 09 August 2022
Accepted: 25 May 2022
Revised: 09 May 2022
Received: 22 March 2022

Check for updates

Author Tags

  1. MPI/OpenMP hybrid
  2. Heterogeneous big data
  3. Parallel computing
  4. Machine Learning
  5. Reservoir computing
  6. Sensor Networks

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)126
  • Downloads (Last 6 weeks)23
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Exploring Scalability of Value-Flow Graph ConstructionProceedings of the 2024 4th International Conference on Artificial Intelligence, Automation and High Performance Computing10.1145/3690931.3690951(112-117)Online publication date: 19-Jul-2024
  • (2024)Development of a Library for Image Processing Using Openmpi and Openmp2024 5th International Conference on Electronics and Sustainable Communication Systems (ICESC)10.1109/ICESC60852.2024.10689807(1973-1979)Online publication date: 7-Aug-2024

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media