Abstract
This paper focuses on simultaneous scheduling of computation and data replication for life science applications on the grid. We present an adaptive algorithm based on the SRA algorithm (Static Joint Replication and Scheduling) [4] with more dynamicity for the jobs frequencies. The use of a linear program giving a databases mapping on the nodes and a jobs distribution schema, ensures us that our data placement and jobs distribution will be near the optimal solution, as long as the informations about the jobs frequencies are right. We validate our results with large jobs submissions simulations on a realistic platform.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bolze, R., Cappello, F., Caron, E., Daydé, M., Desprez, F., Jeannot, E., Jégou, Y., Lanteri, S., Leduc, J., Melab, N., Mornet, G., Namyst, R., Primet, P., Quetier, B., Richard, O., Talbi, E.-G., Touché, I.: Grid 5000: A Large Scale and Highly Reconfigurable Experimental Grid Testbed. International Journal of High Performance Computing Applications 20(4), 481–494 (2006)
Cameron, D.G., Carvajal-Schiaffino, R., Millar, A.P., Nicholson, C., Stockinger, K., Zini, F.: Evaluating scheduling and replica optimisation strategies in OptorSim. In: Proc. Fourth International Workshop on Grid Computing, 2003, pp. 52–59 (2003)
Caron, E., Desprez, F.: Diet: A Scalable Toolbox to Build Network Enabled Servers on the Grid. International Journal of High Performance Computing Applications 20(3), 335 (2006)
Desprez, F., Vernois, A.: Simultaneous Scheduling of Replication and Computation for Data-Intensive Applications on the Grid. J. of Grid Computing 4(1), 19–31 (2006)
Donno, F., Gaido, L., Ghiselli, A., Prelz, F., Sgaravatto, M.: Datagrid prototype 1. In: TERENA Networking conference (June 2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Caron, E., Desprez, F., Le Mahec, G. (2008). Parallelization and Distribution Strategies of Large Bioinformatics Requests over the Grid. In: Bourgeois, A.G., Zheng, S.Q. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2008. Lecture Notes in Computer Science, vol 5022. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69501-1_26
Download citation
DOI: https://doi.org/10.1007/978-3-540-69501-1_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69500-4
Online ISBN: 978-3-540-69501-1
eBook Packages: Computer ScienceComputer Science (R0)