Abstract
The mapping problem has been studied extensively and many algorithms have been proposed. However, unrealistic assumptions have made the practicality of those algorithms doubtful. One of these assumptions is the ability to precisely calculate the execution time of a task to be mapped on a node before the actual execution. Since the theoretical calculation of task execution time is impossible in real environments, an estimation methodology is needed. In this paper, a practical method to estimate the execution time of a parallel task to be mapped on a grid node is proposed. It is not necessary to know the internal design and algorithm of the application in order to apply this method. The estimation is based upon past observations of the task executions. The estimating technique is a k-nearest-neighbours algorithm (knn). A backward predictor elimination, leave-one-out cross validation, and a statistical technique are used to derive the relevant parameters to be used by knn. Experimental results show that on average the proposed method can produce 2.3 times the number of accurate estimated execution times (with errors less than 25%) greater than the existing method.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Foster, I., Kesselman, C.: The Grid: Blueprint for Future Computing Infrastructure. Morgan Kaufmann, San Francisco (1998)
Bokhari, S.: On the mapping problem. IEEE Transaction on Computers C-30, 207–214 (1981)
Dail, H., Berman, F., Casanova, H.: A decoupled scheduling approach for grid application development environments. Journal of Parallel and Distributed Computing 63, 505–524 (2003)
Zhang, W., Fang, B., He, H., Zhang, H., Hu, M.: Multisite resource selection and scheduling algorithm on computational grid. In: 18th International Parallel and Distributed Processing Symposium (IPDPS), pp. 105–115 (2004)
Atkeson, C., Schaal, S., Moore, A.: Locally weighted learning. AI Reviews 11, 11–73 (1997)
Iverson, M., Ozguner, F., Potter, L.: Statistical prediction of task execution times through analytic benchmarking for scheduling in a heterogeneous environment. IEEE Transaction on Computers 48, 35–44 (1999)
Walpole, R.: Introduction to Statistics, 3rd edn. Collier Macmillan, Basingstoke (1982)
Grama, A., Gupta, A., Karypis, G., Kumar, V.: Introduction to Parallel Computing, 2nd edn. Pearson Education Limited, Essex (2003)
Varavithya, V., Uthayopas, P.: ThaiGrid: Architecture and overview. NECTEC Techniqcal Journal 2 (2000)
The PingER Project (2004), http://www-iepm.slac.stanford.edu/pinger/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Phinjaroenphan, P., Bevinakoppa, S., Zeephongsekul, P. (2005). A Method for Estimating the Execution Time of a Parallel Task on a Grid Node. In: Sloot, P.M.A., Hoekstra, A.G., Priol, T., Reinefeld, A., Bubak, M. (eds) Advances in Grid Computing - EGC 2005. EGC 2005. Lecture Notes in Computer Science, vol 3470. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508380_24
Download citation
DOI: https://doi.org/10.1007/11508380_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26918-2
Online ISBN: 978-3-540-32036-4
eBook Packages: Computer ScienceComputer Science (R0)