Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
extended-abstract

Effective Straggler Mitigation: Which Clones Should Attack and When?

Published: 11 October 2017 Publication History
  • Get Citation Alerts
  • First page of PDF

    References

    [1]
    Jeffrey Dean and Luiz André Barroso. The tail at scale. Communications of the ACM, 56(2):74--80, 2013.
    [2]
    Ganesh Ananthanarayanan, Ali Ghodsi, Scott Shenker, and Ion Stoica. Effective straggler mitigation: Attack of the clones. In NSDI, volume 13, pages 185--198, 2013.
    [3]
    Jeffrey Dean and Sanjay Ghemawat. Mapreduce: simplified data processing on large clusters. Communications of the ACM, 51(1):107--113, 2008.
    [4]
    Da Wang, Gauri Joshi, and Gregory Wornell. Using straggler replication to reduce latency in large-scale parallel computing. ACM SIGMETRICS Performance Evaluation Review, 43(3):7--11, 2015.
    [5]
    Gauri Joshi, Emina Soljanin, and Gregory Wornell. Queues with redundancy: Latency-cost analysis. ACM SIGMETRICS Performance Evaluation Review, 43(2):54--56, 2015.
    [6]
    Sanghamitra Dutta, Viveck Cadambe, and Pulkit Grover. Short-dot: Computing large linear transforms distributedly using coded short dot products. In Advances In Neural Information Processing Systems, pages 2092--2100, 2016.
    [7]
    Charles Reiss, Alexey Tumanov, Gregory R Ganger, Randy H Katz, and Michael A Kozuch. Towards understanding heterogeneous clouds at scale: Google trace analysis. Intel Science and Technology Center for Cloud Computing, Tech. Rep, page 84, 2012.

    Cited By

    View all
    • (2023) Folded Polynomial Codes for Coded Distributed AA ⊤ -Type Matrix Multiplication IEEE Transactions on Communications10.1109/TCOMM.2023.328642071:9(5051-5064)Online publication date: Sep-2023
    • (2023)Efficient straggler task management in cloud environment using stochastic gradient descent with momentum learning-driven neural networksCluster Computing10.1007/s10586-023-04191-8Online publication date: 6-Dec-2023
    • (2022)Scalable Load Balancing in Networked Systems: A Survey of Recent AdvancesSIAM Review10.1137/20M132374664:3(554-622)Online publication date: 4-Aug-2022
    • Show More Cited By

    Index Terms

    1. Effective Straggler Mitigation: Which Clones Should Attack and When?
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM SIGMETRICS Performance Evaluation Review
        ACM SIGMETRICS Performance Evaluation Review  Volume 45, Issue 2
        Setember 2017
        131 pages
        ISSN:0163-5999
        DOI:10.1145/3152042
        Issue’s Table of Contents

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 11 October 2017
        Published in SIGMETRICS Volume 45, Issue 2

        Check for updates

        Qualifiers

        • Extended-abstract

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)16
        • Downloads (Last 6 weeks)0

        Other Metrics

        Citations

        Cited By

        View all
        • (2023) Folded Polynomial Codes for Coded Distributed AA ⊤ -Type Matrix Multiplication IEEE Transactions on Communications10.1109/TCOMM.2023.328642071:9(5051-5064)Online publication date: Sep-2023
        • (2023)Efficient straggler task management in cloud environment using stochastic gradient descent with momentum learning-driven neural networksCluster Computing10.1007/s10586-023-04191-8Online publication date: 6-Dec-2023
        • (2022)Scalable Load Balancing in Networked Systems: A Survey of Recent AdvancesSIAM Review10.1137/20M132374664:3(554-622)Online publication date: 4-Aug-2022
        • (2022)Diversity/Parallelism Trade-Off in Distributed Systems With RedundancyIEEE Transactions on Information Theory10.1109/TIT.2021.312792068:2(1279-1295)Online publication date: 1-Feb-2022
        • (2022)Soft BIBD and Product Gradient CodesIEEE Journal on Selected Areas in Information Theory10.1109/JSAIT.2022.31829433:2(229-240)Online publication date: Jun-2022
        • (2022)RCS: A Redirection Computational Scheduler to Accelerate Straggler Recovery for Erasure Coded Cloud Storage System2022 IEEE 40th International Conference on Computer Design (ICCD)10.1109/ICCD56317.2022.00104(681-684)Online publication date: Oct-2022
        • (2022)EP4DDL: addressing straggler problem in heterogeneous distributed deep learningThe Journal of Supercomputing10.1007/s11227-022-04466-878:13(15663-15680)Online publication date: 21-Apr-2022
        • (2021)Private and rateless adaptive coded matrix-vector multiplicationEURASIP Journal on Wireless Communications and Networking10.1186/s13638-020-01887-y2021:1Online publication date: 22-Jan-2021
        • (2021)START: Straggler Prediction and Mitigation for Cloud Computing Environments using Encoder LSTM NetworksIEEE Transactions on Services Computing10.1109/TSC.2021.3129897(1-1)Online publication date: 2021
        • (2021)Optimal Incentive and Load Design for Distributed Coded Machine LearningIEEE Journal on Selected Areas in Communications10.1109/JSAC.2021.307849439:7(2090-2104)Online publication date: Jul-2021
        • Show More Cited By

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media