A Method for Estimating the Execution Time of a Parallel Task on a Grid Node

Phinjaroenphan, Panu; Bevinakoppa, Savitri; Zeephongsekul, Panlop

doi:10.1007/11508380_24

Panu Phinjaroenphan²¹,
Savitri Bevinakoppa²¹ &
Panlop Zeephongsekul²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3470))

Included in the following conference series:

European Grid Conference

548 Accesses

Abstract

The mapping problem has been studied extensively and many algorithms have been proposed. However, unrealistic assumptions have made the practicality of those algorithms doubtful. One of these assumptions is the ability to precisely calculate the execution time of a task to be mapped on a node before the actual execution. Since the theoretical calculation of task execution time is impossible in real environments, an estimation methodology is needed. In this paper, a practical method to estimate the execution time of a parallel task to be mapped on a grid node is proposed. It is not necessary to know the internal design and algorithm of the application in order to apply this method. The estimation is based upon past observations of the task executions. The estimating technique is a k-nearest-neighbours algorithm (knn). A backward predictor elimination, leave-one-out cross validation, and a statistical technique are used to derive the relevant parameters to be used by knn. Experimental results show that on average the proposed method can produce 2.3 times the number of accurate estimated execution times (with errors less than 25%) greater than the existing method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Performance and energy task migration model for heterogeneous clusters

Article 23 February 2021

OKCM: improving parallel task scheduling in high-performance computing systems using online learning

Article 13 November 2020

Designing a MapReduce performance model in distributed heterogeneous platforms based on benchmarking approach

Article 16 January 2020

References

Foster, I., Kesselman, C.: The Grid: Blueprint for Future Computing Infrastructure. Morgan Kaufmann, San Francisco (1998)
Google Scholar
Bokhari, S.: On the mapping problem. IEEE Transaction on Computers C-30, 207–214 (1981)
Article MathSciNet Google Scholar
Dail, H., Berman, F., Casanova, H.: A decoupled scheduling approach for grid application development environments. Journal of Parallel and Distributed Computing 63, 505–524 (2003)
Article MATH Google Scholar
Zhang, W., Fang, B., He, H., Zhang, H., Hu, M.: Multisite resource selection and scheduling algorithm on computational grid. In: 18^th International Parallel and Distributed Processing Symposium (IPDPS), pp. 105–115 (2004)
Google Scholar
Atkeson, C., Schaal, S., Moore, A.: Locally weighted learning. AI Reviews 11, 11–73 (1997)
Google Scholar
Iverson, M., Ozguner, F., Potter, L.: Statistical prediction of task execution times through analytic benchmarking for scheduling in a heterogeneous environment. IEEE Transaction on Computers 48, 35–44 (1999)
Article Google Scholar
Walpole, R.: Introduction to Statistics, 3rd edn. Collier Macmillan, Basingstoke (1982)
MATH Google Scholar
Grama, A., Gupta, A., Karypis, G., Kumar, V.: Introduction to Parallel Computing, 2nd edn. Pearson Education Limited, Essex (2003)
Google Scholar
Varavithya, V., Uthayopas, P.: ThaiGrid: Architecture and overview. NECTEC Techniqcal Journal 2 (2000)
Google Scholar
The PingER Project (2004), http://www-iepm.slac.stanford.edu/pinger/

Download references

Author information

Authors and Affiliations

School of Computer Science and Information Technology, RMIT University, GPO Box 2476V, Melbourne, Australia
Panu Phinjaroenphan & Savitri Bevinakoppa
School of Mathematical and Geospatial Sciences, RMIT University, GPO Box 2476V, Melbourne, Australia
Panlop Zeephongsekul

Authors

Panu Phinjaroenphan
View author publications
You can also search for this author in PubMed Google Scholar
Savitri Bevinakoppa
View author publications
You can also search for this author in PubMed Google Scholar
Panlop Zeephongsekul
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Sciences, Section of Computational Science, University of Amsterdam, Kruislaan 403, 1098, Amsterdam, SJ, The Netherlands
Peter M. A. Sloot
Section Computational Science, University of Amsterdam, The Netherlands
Alfons G. Hoekstra
INRIA Rennes - Bretagne Atlantique, Campus de Beaulieu, 35042, Rennes Cedex, France
Thierry Priol
Zuse Institute Berlin,
Alexander Reinefeld
Institute of Computer Science, AGH, Poland
Marian Bubak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Phinjaroenphan, P., Bevinakoppa, S., Zeephongsekul, P. (2005). A Method for Estimating the Execution Time of a Parallel Task on a Grid Node. In: Sloot, P.M.A., Hoekstra, A.G., Priol, T., Reinefeld, A., Bubak, M. (eds) Advances in Grid Computing - EGC 2005. EGC 2005. Lecture Notes in Computer Science, vol 3470. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508380_24

Download citation

DOI: https://doi.org/10.1007/11508380_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26918-2
Online ISBN: 978-3-540-32036-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics