Scalable Loop Self-Scheduling Schemes for Large-Scale Clusters and Cloud Systems

Han, Yiming; Chronopoulos, Anthony T.

doi:10.1007/s10766-016-0434-5

Scalable Loop Self-Scheduling Schemes for Large-Scale Clusters and Cloud Systems

Published: 11 May 2016

Volume 45, pages 595–611, (2017)
Cite this article

International Journal of Parallel Programming Aims and scope Submit manuscript

Yiming Han¹ &
Anthony T. Chronopoulos¹

314 Accesses
8 Citations
3 Altmetric
Explore all metrics

Abstract

Cloud systems have demonstrated the powerful computation and storage capability in many scientific applications. In this paper, we propose a class of scalable distributed loop self-scheduling schemes to achieve good load balancing and scalability. We implemented these schemes on a large-scale cluster and on a heterogeneous cloud system. The schemes consider the distribution of the output data, which can help reduce communication overhead and improve scalability. We evaluated the schemes using four scientific computations: Mandelbrot set, adjoint convolution, matrix multiplication and quick sort. The results show that the new schemes achieve better load balancing, better scalability and better overall performance than standard distributed loop self-scheduling schemes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Query Processing Framework for Large-Scale Scientific Data Analysis

Scheduling in Parallel and Distributed Computing Systems

A Query Processing Framework for Array-Based Computations

References

Keahey, K: Cloud computing for science. In: Proceeding of 21st Scientific and Statistical Database Management Conference. vol. 5566, pp. 478–478 (2009)
Wang, L., Tao, J., Kunze, M., Castellanos, A., Kramer, D., Karl, W.: Scientific cloud computing: early definition and experience. In: 10th IEEE International Conference on High Performance Computing and Communications, HPCC’08, pp. 825–830 (2008)
Yeo, S., Lee, H.-H.: Using mathematical modeling in provisioning a heterogeneous cloud computing environment. Computer 44, 55–62 (2011)
Google Scholar
Han, Y., Chronopoulos, A.: Distributed loop scheduling schemes for cloud systems. In: IEEE 27th International Parallel and Distributed Processing Symposium Workshops PhD Forum (IPDPSW), pp. 955–962 (2013)
Sonnek, J., Greensky, J., Reutiman, R., Chandra, A.: Starling: minimizing communication overhead in virtualized computing platforms using decentralized affinity-aware migration. In: 39th International Conference on Parallel Processing, ICPP’10, pp. 228–237 (2010)
Han, Y., Chronopoulos, A.: A hierarchical distributed loop self-scheduling scheme for cloud systems. In: 12th IEEE International Symposium on Network Computing and Applications (NCA), pp. 7–10 (2013)
Han, Y., Chronopoulos, A.: Scalable loop self-scheduling schemes implemented on large-scale clusters. In: IEEE 27th International Parallel and Distributed Processing Symposium Workshops PhD Forum (IPDPSW), pp. 1735–1742 (2013)
Joyent corporation. http://joyent.com/
Kejariwal, A., Nicolau, A., Polychronopoulos, C.: History-aware self-scheduling. In: International Conference on Parallel Processing Parallel Processing, ICPP’06, pp. 185–192 (2006)
Chronopoulos, A.T., Penmatsa, S., Xu, J., Ali, S.: Distributed loop-scheduling schemes for heterogeneous computer systems. Concurr. Comput. Pract. Exp. 18, 771–785 (2006)
Article Google Scholar
Banicescu, I., Velusamy, V., Devaprasad, J.: On the scalability of dynamic scheduling scientific applications with adaptive weighted factoring. Clust. Comput. 6, 215–226 (2003)
Article Google Scholar
Chronopoulos, A.T., Penmatsa, S., Yu, N., Yu, D.: Scalable loop self-scheduling schemes for heterogeneous clusters. Int. J. Comput. Sci. Eng. 1, 110–117 (2005)
Article Google Scholar
Li, P., Ji, Q., Zhang, Y., Zhu, Q.: An adaptive chunk self-scheduling scheme on service grid. In: IEEE Asia-Pacific Services Computing Conference, APSCC’08, pp. 39–44 (2008)
Yang, K.-W.C., Chao-Tung, Li, K.-C.: An efficient parallel loop self-scheduling on grid environments. In: Network and Parallel Computing, pp. 92–100 (2004)
Yang, C.-T., Chang, S.-C.: A parallel loop self-scheduling on extremely heterogeneous pc clusters. In: Proceedings of International Conference on Computational, Science, pp. 1079–1088 (2003)
Cheng, K.-W., Yang, C.-T., Lai, C.-L., Chang, S.-C.: A parallel loop self-scheduling on grid computing environments. In: Proceedings of the 2004 International Symposium on Parallel Architectures, Algorithms and IEEE Networks, pp. 409–414 (2004)
Andronikos, T., Ciorba, F.M., Riakiotakis, I., Papakonstantinou, G., Chronopoulos, A.T.: Studying the impact of synchronization frequency on scheduling tasks with dependencies in heterogeneous systems. Perform. Eval. 67(12), 1324–1339 (2010)
Article Google Scholar
Ciorba, F., Andronikos, T., Riakiotakis, I., Chronopoulos, A., Papakonstantinou, G.: Dynamic multi phase scheduling for heterogeneous clusters. In: Proceedings of the 20th IEEE International Parallel Distributed Processing Symposium, IPDPS06, p. 10 (2006)
Riakiotakis, I., Ciorba, F.M., Andronikos, T., Papakonstantinou, G., Chronopoulos, A.T.: Towards the optimal synchronization granularity for dynamic scheduling of pipelined computations on heterogeneous computing systems. Concurr. Comput. Pract. Exp. 24, 2302–2327 (2012)
Article Google Scholar
Diaz, J., Reyes, S., Nino, A., Munoz-Caro, C.: Derivation of self-scheduling algorithms for heterogeneous distributed computer systems: application to internet-based grids of computers. Future Gener. Comput. Syst. 25, 617–626 (2009)
Article Google Scholar
Wu, C.-C., Yang, C.-T., Lai, K.-C., Chiu, P.-H.: Designing parallel loop self-scheduling schemes using the hybrid MPI and openMP programming model for multi-core grid systems. J. Supercomput. 59, 42–60 (2012)
Article Google Scholar
He, Y., Liu, J., Sun, H.: Scheduling functionally heterogeneous systems with utilization balancing. In: IEEE International Parallel Distributed Processing Symposium, IPDPS’11, pp. 1187–1198 (2011)
Penmatsa, S., Chronopoulos, A., Karonis, N., Toonen, B.: Implementation of distributed loop scheduling schemes on the teragrid. IN: IEEE International Parallel and Distributed Processing Symposium, IPDPS’07, pp. 1–8 (2007)
Kyriakopoulos, K., Chronopoulos, A., Ni, L.: An optimal scheduling scheme for tiling in distributed systems. In: IEEE International Conference on Cluster Computing, pp. 267–274 (2007)
Chronopoulos, A., Ni, L., Penmatsa, S.: Multi-dimensional dynamic loop scheduling algorithms. In: IEEE International Conference on Cluster Computing, pp. 241–248 (2007)
Hwang, K., Dongarra, J., Fox, G.C.: Distributed and cloud computing: from parallel processing to the internet of things. Morgan Kaufmann, (2011)
Wen, G., Hong, J., Xu, C., Balaji, P., Feng, S., Jiang, P.: Energy-aware hierarchical scheduling of applications in large scale data centers. In: International Conference on Cloud and Service Computing, CSC’11, pp. 158–165 (2011)
Iosup, A., Ostermann, S., Yigitbasi, M., Prodan, R., Fahringer, T., Epema, D.: Performance analysis of cloud computing services for many-tasks scientific computing. IEEE Trans. Parallel Distrib. Syst. 22, 931–945 (2011)
Article Google Scholar
Ekanayake, J. Fox, G.: High performance parallel computing with clouds and cloud technologies. In: Proceedings of the First International Conference on Cloud Computing, pp. 20–38 (2010)
Evangelinos, C., Hill, C.N.: Cloud computing for parallel scientific HPC applications: feasibility of running coupled atmosphere–ocean climate models on Amazon’s EC2. In: 5th International Conference on Computability and Complexity in Analysis, pp. 159–168 (2008)
Shih, W.-C. Tseng, S.-S., Yang, C.-T.: Performance study of parallel programming on cloud computing environments using MapReduce. In: International Conference on Information Science and Applications, ICISA’10, pp. 1–8 (2010)
Stress. http://www.hecticgeek.com/2012/11/stress-test-your-ubuntu-computer-with-stress/

Download references

Acknowledgments

The authors acknowledge: (1) support by NSF grant (HRD-0932339) to the University of Texas at San Antonio; and (2) time grants to access the facilities of Institute for Cyber Security(ICS) of University of Texas at San Antonio and FutureGrid at Indiana University, Bloomington; (3) the Texas Advanced Computing Center (TACC) at The University of Texas at Austin for providing HPC, visualization, database, or grid resources that have contributed to the research results reported within this paper. URL:http://www.tacc.utexas.edu.

Author information

Authors and Affiliations

Department of Computer Science, University of Texas at San Antonio, San Antonio, TX, USA
Yiming Han & Anthony T. Chronopoulos

Authors

Yiming Han
View author publications
You can also search for this author in PubMed Google Scholar
Anthony T. Chronopoulos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anthony T. Chronopoulos.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Han, Y., Chronopoulos, A.T. Scalable Loop Self-Scheduling Schemes for Large-Scale Clusters and Cloud Systems. Int J Parallel Prog 45, 595–611 (2017). https://doi.org/10.1007/s10766-016-0434-5

Download citation

Received: 28 June 2015
Accepted: 26 April 2016
Published: 11 May 2016
Issue Date: June 2017
DOI: https://doi.org/10.1007/s10766-016-0434-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Scalable Loop Self-Scheduling Schemes for Large-Scale Clusters and Cloud Systems

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Query Processing Framework for Large-Scale Scientific Data Analysis

Scheduling in Parallel and Distributed Computing Systems

A Query Processing Framework for Array-Based Computations

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Scalable Loop Self-Scheduling Schemes for Large-Scale Clusters and Cloud Systems

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Query Processing Framework for Large-Scale Scientific Data Analysis

Scheduling in Parallel and Distributed Computing Systems

A Query Processing Framework for Array-Based Computations

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation