Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Managing very large distributed data sets on a data grid

Published: 01 August 2010 Publication History
  • Get Citation Alerts
  • Abstract

    In this work we address the management of very large data sets, which need to be stored and processed across many computing sites. The motivation for our work is the ATLAS experiment for the Large Hadron Collider (LHC), where the authors have been involved in the development of the data management middleware. This middleware, called DQ2, has been used for the last several years by the ATLAS experiment for shipping petabytes of data to research centres and universities worldwide. We describe our experience in developing and deploying DQ2 on the Worldwide LHC computing Grid, a production Grid infrastructure formed of hundreds of computing sites. From this operational experience, we have identified an important degree of uncertainty that underlies the behaviour of large Grid infrastructures. This uncertainty is subjected to a detailed analysis, leading us to present novel modelling and simulation techniques for Data Grids. In addition, we discuss what we perceive as practical limits to the development of data distribution algorithms for Data Grids given the underlying infrastructure uncertainty, and propose future research directions. Copyright © 2009 John Wiley & Sons, Ltd.

    Cited By

    View all
    • (2015)Adding storage simulation capacities to the SimGrid toolkitProceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2015.134(251-260)Online publication date: 4-May-2015
    • (2011)A similarity measure for time, frequency, and dependencies in large-scale workloadsProceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2063384.2063441(1-11)Online publication date: 12-Nov-2011
    • (2010)Identification, Modelling and Prediction of Non-periodic Bursts in WorkloadsProceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing10.1109/CCGRID.2010.118(485-494)Online publication date: 17-May-2010

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Concurrency and Computation: Practice & Experience
    Concurrency and Computation: Practice & Experience  Volume 22, Issue 11
    Grid Computing, High Performance and Distributed Application
    August 2010
    181 pages

    Publisher

    John Wiley and Sons Ltd.

    United Kingdom

    Publication History

    Published: 01 August 2010

    Author Tags

    1. data management
    2. distributed systems
    3. grid computing
    4. modelling
    5. simulation

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)Adding storage simulation capacities to the SimGrid toolkitProceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2015.134(251-260)Online publication date: 4-May-2015
    • (2011)A similarity measure for time, frequency, and dependencies in large-scale workloadsProceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2063384.2063441(1-11)Online publication date: 12-Nov-2011
    • (2010)Identification, Modelling and Prediction of Non-periodic Bursts in WorkloadsProceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing10.1109/CCGRID.2010.118(485-494)Online publication date: 17-May-2010

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media