Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3267809.3267820acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article
Open access

Fast and Accurate Load Balancing for Geo-Distributed Storage Systems

Published: 11 October 2018 Publication History
  • Get Citation Alerts
  • Abstract

    The increasing density of globally distributed datacenters reduces the network latency between neighboring datacenters and allows replicated services deployed across neighboring locations to share workload when necessary, without violating strict Service Level Objectives (SLOs).
    We present Kurma, a practical implementation of a fast and accurate load balancer for geo-distributed storage systems. At run-time, Kurma integrates network latency and service time distributions to accurately estimate the rate of SLO violations for requests redirected across geo-distributed datacenters. Using these estimates, Kurma solves a decentralized rate-based performance model enabling fast load balancing (in the order of seconds) while taming global SLO violations. We integrate Kurma with Cassandra, a popular storage system. Using real-world traces along with a geo-distributed deployment across Amazon EC2, we demonstrate Kurma's ability to effectively share load among datacenters while reducing SLO violations by up to a factor of 3 in high load settings or reducing the cost of running the service by up to 17%.

    References

    [1]
    Amazon EC2. http://aws.amazon.com/ec2/Accessed 28-Aug-2018.
    [2]
    The Internet Traffic Archive. http://ita.ee.lbl.gov/html/contrib/WorldCup.html Accessed 11-Feb-2018.
    [3]
    Adan, I., and Resing, J. Queueing theory. Eindhoven University of Technology Eindhoven, 2002.
    [4]
    Agarwal, S., Dunagan, J., Jain, N., Saroiu, S., Wolman, A., and Bhogan, H. Volley: Automated Data Placement for Geo-Distributed Cloud Services. In Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation (Berkeley, CA, USA, 2010), NSDI'10, USENIX Association, pp. 2--2.
    [5]
    Amazon. Amazon EC2 Dedicated Instances. https://aws.amazon.com/ec2/purchasing-options/dedicated-instances/ Accessed 28-Aug-2018.
    [6]
    Amazon. Amazon Elastic Block Store. https://aws.amazon.com/ebs/details/Accessed 28-Aug-2018.
    [7]
    Amazon. EC2 Auto Scaling. https://docs.aws.amazon.com/autoscaling/ec2/userguide/what-is-amazon-ec2-auto-scaling.html Accessed 01-Jun-2017.
    [8]
    Amazon. Target tracking scaling policies. http://goo.gl/fzjZ92 Accessed 01-Jun-2017.
    [9]
    Angel, S., Ballani, H., Karagiannis, T., O'Shea, G., and Thereska, E. End-to-end Performance Isolation Through Virtual Datacenters. In 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14) (2014), pp. 233--248.
    [10]
    Apache. Cassandra. http://cassandra.apache.org/Accessed28-Aug-2018.
    [11]
    Apache. Cassandra, Dynamic Snitching. https://docs.datastax.com/en/cassandra/3.0/cassandra/architecture/archSnitchDynamic.html?hl=dynamic%2Csnitch Accessed 20-Jan-2018.
    [12]
    Ardagna, D., Casolari, S., Colajanni, M., and Panicucci, B. Dual time-scale distributed capacity allocation and load redirect algorithms for cloud systems. Journal of Parallel and Distributed Computing 72, 6 (2012), 796--808.
    [13]
    Ardekani, M. S., and Terry, D. B. A Self-Configurable Geo-Replicated Cloud Storage System. In 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14) (2014), pp. 367--381.
    [14]
    Armbrust, M., Fox, A., Griffith, R., Joseph, A. D., Katz, R. H., Konwinski, A., Lee, G., Patterson, D. A., Rabkin, A., Stoica, I., et al. Above the Clouds: A Berkeley View of Cloud Computing. Tech. rep., Technical Report UCB/EECS-2009-28, EECS Department, University of California, Berkeley, 2009.
    [15]
    Atikoglu, B., Xu, Y., Frachtenberg, E., Jiang, S., and Paleczny, M. Workload Analysis of a Large-Scale Key-Value Store. In ACM SIGMETRICS Performance Evaluation Review (2012), vol. 40, ACM, pp. 53--64.
    [16]
    Barb Darrow. Amazon and Google Continue Cloud Arms Race With New Data Centers. 30-Sep-2016. Fortune.com. http://fortune.com/2016/09/30/amazon-google-add-data-centers/Accessed 01-May-2018.
    [17]
    Bessani, A., Correia, M., Quaresma, B., André, F., and Sousa, P. DepSky: Dependable and Secure Storage in a Cloud-of-Clouds. ACM Transactions on Storage (TOS) 9, 4 (2013), 12.
    [18]
    Bogdanov, K. Reducing Long Tail Latencies in Geo-Distributed Systems. Licentiate Thesis, KTH Royal Institute of Technology, 2016. ISBN: 978-91-7729-160-2 URN: urn:nbn:se:kth:diva-194729.
    [19]
    Bogdanov, K. Enabling Fast and Accurate Run-Time Decisions in Geo-Distributed Systems: Better Achieving Service Level Objectives. Doctoral Dissertation, KTH Royal Institute of Technology, 2018. Planned for the Fall 2018.
    [20]
    Bogdanov, K., Reda, W., Maguire Jr, G. Q., Kostić, D., and Canini, M. Kurma: Fast and Efficient Load Balancing for Geo-Distributed Storage Systems (Technical Report). Technical report, KTH Royal Institute of Technology, 2018. http://urn.kb.se/resolve?um=urn%3Anbn%3Ase%3Akth%3Adiva-222289.
    [21]
    Brutlag, J. Speed matters for Google web search. Google. June (2009). https://services.google.com/fh/files/blogs/googledelayexp.pdf Accessed 28-Aug-2018.
    [22]
    Bryant, R., Tumanov, A., Irzak, O., Scannell, A., Joshi, K., Hiltunen, M., Lagar-Cavilla, A., and De Lara, E. Kaleidoscope: Cloud Micro-Elasticity via VM state Coloring. In Proceedings of the sixth conference on Computer systems (2011), ACM, pp. 273--286.
    [23]
    Buyya, R., Ranjan, R., and Calheiros, R. N. Intercloud: Utility-Oriented Federation of Cloud Computing Environments for Scaling of Application Services. In International Conference on Algorithms and Architectures for Parallel Processing (2010), Springer, pp. 13--31.
    [24]
    Callahan, T., Allman, M., and Rabinovich, M. On modern DNS behavior and properties. ACM SIGCOMM Computer Communication Review 43, 3 (2013), 7--15.
    [25]
    Cardellini, V., Colajanni, M., and Yu, P. S. Geographic Load Balancing for Scalable Distributed Web Systems. In Modeling, Analysis and Simulation of Computer and Telecommunication Systems, 2000. Proceedings. 8th International Symposium on (2000), IEEE, pp. 20--27.
    [26]
    Cardellini, V., Colajanni, M., and Yu, P. S. Request Redirection Algorithms for Distributed Web Systems. IEEE Transactions on Parallel and Distributed Systems 14, 4 (2003), 355--368.
    [27]
    Chandra, A., Gong, W., and Shenoy, P. Dynamic Resource Allocation for Shared Data Centers Using Online Measurements. ACM SIGMETRICS Performance Evaluation Review 31, 1 (2003), 300--301.
    [28]
    Chandrasekaran, B., Smaragdakis, G., Berger, A., Luckie, M., and Ng, K.-C. A Server-to-Server View of the Internet. In Proceedings of the 11th International Conference on emerging Networking Experiments and Technologies (Heidelberg, Germany, December 2015), CoNEXT '15, ACM, p. 40.
    [29]
    Cockcroft, A., and Sheahan, D. Benchmarking Cassandra Scalability on AWS over a million writes per second. 1-Nov-2011. Netflix Technology Blog. https://goo.gl/Gtn2XH Accessed 28-Aug-2018.
    [30]
    Colajanni, M., Yu, P. S., and Cardellini, V. Dynamic Load Balancing in Geographically Distributed Heterogeneous Web Servers. In Distributed Computing Systems, 1998. Proceedings. 18th International Conference on (1998), IEEE, pp. 295--302.
    [31]
    Cooper, B. F., Silberstein, A., Tam, E., Ramakrishnan, R., and Sears, R. Benchmarking cloud serving systems with YCSB. In Proceedings of the 1st ACM symposium on Cloud computing (2010), ACM, pp. 143--154.
    [32]
    Cormode, G., Shkapenyuk, V., Srivastava, D., and Xu, B. Forward Decay: A Practical Time Decay Model for Streaming Systems. In Data Engineering, 2009. ICDE'09. IEEE 25th International Conference on (2009), IEEE, pp. 138--149.
    [33]
    Datastax. Apache Cassandra Drivers, https://academy.datastax.com/download-drivers Accessed 1-Oct-2017.
    [34]
    Dean, J., and Barroso, L. A. The Tail at Scale. Communications of the ACM 56, 2 (2013), 74--80.
    [35]
    Dilley, J., Maggs, B., Parikh, J., Prokop, H., Sitaraman, R., and Weihl, B. Globally Distributed Content Delivery. IEEE Internet Computing 6, 5 (2002), 50--58.
    [36]
    Eisenbud, D. E., Yi, C., Contavalli, C., Smith, C., Kononov, R., Mann-Hielscher, E., Cilingiroglu, A., Cheyney, B., Shang, W., and Hosein, J. D. Maglev: A Fast and Reliable Software Network Load Balancer. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI 16) (2016), USENIX Association, pp. 523--535.
    [37]
    Eugster, P. T., Guerraoui, R., Kermarrec, A.-M., and Massoulié, L. Epidemic Information Dissemination in Distributed Systems. Computer 37, 5 (2004), 60--67.
    [38]
    Gandhi, A., Harchol-Balter, M., Raghunathan, R., and Kozuch, M. A. Autoscale: Dynamic, Robust Capacity Management for Multi-Tier Data Centers. ACM Transactions on Computer Systems (TOCS) 30, 4 (2012), 14.
    [39]
    Gandhi, R., Liu, H. H., Hu, Y. C., Lu, G., Padhye, J., Yuan, L., and Zhang, M. Duet: Cloud Scale Load Balancing with Hardware and Software. ACM SIGCOMM Computer Communication Review 44, 4 (2015), 27--38.
    [40]
    Gmach, D., Rolia, J., Cherkasova, L., and Kemper, A. Workload Analysis and Demand Prediction of Enterprise Data Center Applications. In Workload Characterization, 2007. IISWC 2007. IEEE 10th International Symposium on (2007), IEEE, pp. 171--180.
    [41]
    Gong, Z., Gu, X., and Wilkes, J. PRESS: PRedictive Elastic Resource Scaling for cloud systems. In 2010 International Conference on Network and Service Management (2010), IEEE, pp. 9--16.
    [42]
    Google. Google compute engine pricing. https://cloud.google.com/compute/pricing Accessed 28-Aug-2018.
    [43]
    Gray, W. D., and Boehm-Davis, D. A. Milliseconds matter: An introduction to microstrategies and to their use in describing and predicting interactive behavior. Journal of Experimental Psychology: Applied 6, 4 (2000), 322.
    [44]
    Guo, T., Shenoy, P., and Hacigümüs, H. H. Geoscale: Providing Geo-Elasticity in Distributed Clouds. In Cloud Engineering (IC2E), 2016 IEEE International Conference on (2016), IEEE, pp. 123--126.
    [45]
    Hajjat, M., Shankaranarayanan, P., Maltz, D., Rao, S., and Sripanidkulchai, K. Dynamic Request Splitting for Interactive Cloud Applications. IEEE Journal on Selected Areas in Communications 31, 12 (2013), 2722--2737.
    [46]
    Herodotou, H., Dong, F., and Babu, S. No One (Cluster) Size Fits All: Automatic Cluster Sizing for Data-intensive Analytics. In Proceedings of the 2nd ACM Symposium on Cloud Computing (2011), ACM, p. 18.
    [47]
    Høiland-Jørgensen, T., Ahlgren, B., Hurtig, P., and Brunstrom, A. Measuring Latency Variation in the Internet. In Proceedings of the 12th International on Conference on emerging Networking Experiments and Technologies (2016), ACM, pp. 473--480.
    [48]
    Ian Paul, P. Jackson's Death a Blow to the Internet. http://www.pcworld.com/article/167435/jacksodeathblowtointernet.html Accessed 28-Aug-2018.
    [49]
    Jung, J., Krishnamurthy, B., and Rabinovich, M. Flash crowds and denial of service attacks: Characterization and implications for CDNs and web sites. In Proceedings of the 11th international conference on World Wide Web (2002), ACM, pp. 293--304.
    [50]
    Kalantzis, C. Eventual Consistency != Hopeful Consistency. Talk at Cassandra Summit, 2013. https://www.youtube.com/watch?v=A6qzxHE3EU.
    [51]
    Kalyvianaki, E., Charalambous, T., and Hand, S. Self-adaptive and self-configured CPU resource provisioning for virtualized servers using Kalman filters. In Proceedings of the 6th international conference on Autonomic computing (2009), ACM, pp. 117--126.
    [52]
    Kanizo, Y., Raz, D., and Zlotnik, A. Efficient Use of Geographically Spread Cloud Resources. In Cluster, Cloud and Grid Computing (CCGrid), 2013 13th IEEE/ACM International Symposium on (2013), IEEE, pp. 450--457.
    [53]
    Kempe, D., Dobra, A., and Gehrke, J. Gossip-based computation of aggregate information. In Foundations of Computer Science, 2003. Proceedings. 44th Annual IEEE Symposium on (2003), IEEE, pp. 482--491.
    [54]
    Krioukov, A., Mohan, P., Alspaugh, S., Keys, L., Culler, D., and Katz, R. NapSAC: Design and Implementation of a Power-Proportional Web Cluster. ACM SIGCOMM computer communication review 41, 1 (2011), 102--108.
    [55]
    Lang, W., Shankar, S., Patel, J. M., and Kalhan, A. Towards Multi-Tenant Performance SLOs. IEEE Transactions on Knowledge and Data Engineering 26, 6 (2014), 1447--1463.
    [56]
    Li, C., Porto, D., Clement, A., Gehrke, J., Preguiça, N. M., and Rodrigues, R. Making Geo-Replicated Systems Fast as Possible, Consistent when Necessary. In OSDI (2012), vol. 12, pp. 265--278.
    [57]
    Lim, H. C., Babu, S., and Chase, J. S. Automated control for elastic storage. In Proceedings of the 7th international conference on Autonomic computing (2010), ACM, pp. 1--10.
    [58]
    Lin, M., Wierman, A., Andrew, L. L., and Thereska, E. Dynamic right-sizing for power-proportional data centers. IEEE/ACM Transactions on Networking (TON) 21, 5 (2013), 1378--1391.
    [59]
    Linden, G. Make Data Useful. http://goo.gl/DGKkzv. Slides for a talk for the course Data Mining (CS345) at Stanford University. Accessed 28-Aug-2018.
    [60]
    Liu, H. H., Viswanathan, R., Calder, M., Akella, A., Mahajan, R., Padhye, J., and Zhang, M. Efficiently Delivering Online Services over Integrated Infrastructure. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI 16) (2016), pp. 77--90.
    [61]
    Liu, Z., Lin, M., Wierman, A., Low, S. H., and Andrew, L. L. Greening Geographical Load Balancing. In Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems (2011), ACM, pp. 233--244.
    [62]
    Lloyd, W., Freedman, M. J., Kaminsky, M., and Andersen, D. G. Don't Settle for Eventual: Scalable Causal Consistency for Wide-Area Storage with COPS. In Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles (2011), ACM, pp. 401--416.
    [63]
    Mace, J., Bodik, P., Fonseca, R., and Musuvathi, M. Retro: Targeted Resource Management in Multi-tenant Distributed Systems. In NSDI (2015), pp. 589--603.
    [64]
    Madhyastha, H. V., Isdal, T., Piatek, M., Dixon, C., Anderson, T., Krishnamurthy, A., and Venkataramani, A. iPlane: An Information Plane for Distributed Services. In Proceedings of the 7th symposium on Operating systems design and implementation (2006), USENIX Association, pp. 367--380.
    [65]
    Microsoft. Microsoft Azure - Linux Virtual Machines Pricing, https://azure.microsoft.com/en-us/pricing/details/virtual-machines/linux/Accessed 28-Aug-2018.
    [66]
    Mills, D. L. Internet time synchronization: The network time protocol. Communications, IEEE Transactions on 39, 10 (1991), 1482--1493.
    [67]
    Mitzenmacher, M. How Useful Is Old Information? IEEE Transactions on Parallel and Distributed Systems 11, 1 (2000), 6--20.
    [68]
    Nethercote, N., Stuckey, P. J., Becket, R., Brand, S., Duck, G. J., and Tack, G. MiniZinc: Towards a standard CP modelling language. In International Conference on Principles and Practice of Constraint Programming (2007), Springer, pp. 529--543.
    [69]
    Nguyen, H., Shen, Z., Gu, X., Subbiah, S., and Wilkes, J. AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service. In Proceedings of the 10th International Conference on Autonomic Computing (ICAC 13) (2013), pp. 69--82.
    [70]
    Novakovic, D., Vasic, N., Novakovic, S., Kostic, D., and Bianchini, R. Deepdive: Transparently Identifying and Managing Performance Interference in Virtualized Environments. In Proceedings of the 2013 USENIX Annual Technical Conference (2013), no. EPFL-CONF-185984.
    [71]
    Olteanu, V., Agache, A., Voinescu, A., and Raiciu, C. Stateless datacenter load-balancing with beamer. In 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI) (2018), vol. 18, pp. 125--139.
    [72]
    Padala, P., Hou, K.-Y., Shin, K. G., Zhu, X., Uysal, M., Wang, Z., Singhal, S., and Merchant, A. Automated Control of Multiple Virtualized Resources. In Proceedings of the 4th ACM European conference on Computer systems (2009), ACM, pp. 13--26.
    [73]
    Padala, P., Shin, K. G., Zhu, X., Uysal, M., Wang, Z., Singhal, S., Merchant, A., and Salem, K. Adaptive control of virtualized resources in utility computing environments. In ACM SIGOPS Operating Systems Review (2007), vol. 41, ACM, pp. 289--302.
    [74]
    Pang, J., Akella, A., Shaikh, A., Krishnamurthy, B., and Seshan, S. On the Responsiveness of DNS-based Network Control. In Proceedings of the 4th ACM SIGCOMM conference on Internet measurement (2004), ACM, pp. 21--26.
    [75]
    Patel, P., Bansal, D., Yuan, L., Murthy, A., Greenberg, A., Maltz, D. A., Kern, R., Kumar, H., Zikos, M., Wu, H., et al. Ananta: Cloud Scale Load Balancing. In ACM SIGCOMM Computer Communication Review (2013), vol. 43, ACM, pp. 207--218.
    [76]
    Pucha, H., Zhang, Y., Mao, Z. M., and Hu, Y. C. Understanding Network Delay Changes Caused by Routing Events. In ACM SIGMETRICS Performance Evaluation Review (2007), vol. 35, ACM, pp. 73--84.
    [77]
    Qureshi, A., Weber, R., Balakrishnan, H., Guttag, J., and Maggs, B. Cutting the electric bill for internet-scale systems. In ACM SIGCOMM computer communication review (2009), vol. 39, ACM, pp. 123--134.
    [78]
    Raghavan, B., Vishwanath, K., Ramabhadran, S., Yocum, K., and Snoeren, A. C. Cloud Control with Distributed Rate Limiting. ACM SIGCOMM Computer Communication Review 37, 4 (2007), 337--348.
    [79]
    Ranjan, S. Request redirection for dynamic content. In Content Delivery Networks. Springer, 2008, pp. 155--179.
    [80]
    Ranjan, S., Karrer, R., and Knightly, E. Wide Area Redirection of Dynamic Content by Internet Data Centers. In INFOCOM 2004. Twenty-third AnnualJoint Conference of the IEEE Computer and Communications Societies (2004), vol. 2, IEEE, pp. 816--826.
    [81]
    Rao, L., Liu, X., Xie, L., and Liu, W. Minimizing Electricity Cost: Optimization of Distributed Internet Data Centers in a Multi-Electricity-Market Environment. In INFOCOM, 2010 Proceedings IEEE (2010), IEEE, pp. 1--9.
    [82]
    Reda, W., and Bogdanov, K. L. Open Loop YCSB source code. https://github.com/kirillsc/ycsb/tree/openloop.
    [83]
    Roussopoulos, M., and Baker, M. Practical load balancing for content requests in peer-to-peer networks. Distributed Computing 18, 6 (2006), 421--434.
    [84]
    Schulte, C., Tack, G., and Lagerkvist, M. Z. Modeling and programming with gecode, 2010. http://www.gecode.org/doc-latest/MPG.pdf Accessed 17-Jan-2017.
    [85]
    Shankaranarayanan, P., Sivakumar, A., Rao, S., and Tawarmalani, M. Performance Sensitive Replication in Geo-Distributed Cloud Datastores. In 2014 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (2014), IEEE, pp. 240--251.
    [86]
    Shen, Z., Subbiah, S., Gu, X., and Wilkes, J. CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems. In Proceedings of the 2nd ACM Symposium on Cloud Computing (2011), ACM, p. 5.
    [87]
    Souders, S. Velocity and the Bottom Line. http://radar.oreilly.com/2009/07/velocity-making-your-site-fast.html Accessed 1-May-2018.
    [88]
    Stanojevi, R., and Shorten, R. Fully Decentralized Emulation of Best-Effort and Processor Sharing Queues. ACM SIGMETRICS Performance Evaluation Review 36, 1 (2008), 383--394.
    [89]
    Suresh, P. L., Canini, M., Schmid, S., and Feldmann, A. C3: Cutting Tail Latency in Cloud Data Stores via Adaptive Replica Selection. In NSDI (2015), pp. 513--527.
    [90]
    Sverdlik, Y. Google to build and lease data centers in big cloud expansion. 22-apr-2016. Data Center Knowledge. http://goo.gl/hkxmXQ Accessed l-May-2018.
    [91]
    Tene, G., Iyengar, B., and Wolf, M. C4: The Continuously Concurrent Compacting Collector. ACM SIGPLAN Notices 46, 11 (2011), 79--88.
    [92]
    Terry, D. B., Prabhakaran, V., Kotla, R., Balakrishnan, M., Aguilera, M. K., and Abu-Libdeh, H. Consistency-Based Service Level Agreements for Cloud Storage. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (2013), ACM, pp. 309--324.
    [93]
    Thorsten von Eicken. Animoto's Facebook scale-up, 23-Apr-2008. Right Scale. http://goo.gl/UDNXS9 Accessed 28-Aug-2018.
    [94]
    Venkataramani, V., Amsden, Z., Bronson, N., Cabrera III, G., Chakka, P., Dimov, P., Ding, H., Ferris, J., Giardullo, A., Hoon, J., et al. TAO: How Facebook Serves the Social Graph. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data (2012), ACM, pp. 791--792.
    [95]
    Vulimiri, A., Godfrey, P. B., Mittal, R., Sherry, J., Ratnasamy, S., and Shenker, S. Low Latency via Redundancy. In Proceedings of the ninth ACM conference on Emerging networking experiments and technologies (2013), ACM, pp. 283--294.
    [96]
    Wendell, P., Jiang, J. W., Freedman, M. J., and Rexford, J. Donar: Decentralized Server Selection for Cloud Services. ACM SIGCOMM Computer Communication Review 41, 4 (2011), 231--242.
    [97]
    Wu, Z., Butkiewicz, M., Perkins, D., Katz-Bassett, E., and Madhyastha, H. V. SPANStore: Cost-Effective Geo-Replicated Storage Spanning Multiple Cloud Services. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (2013), ACM, pp. 292--308.
    [98]
    Wu, Z., Yu, C., and Madhyastha, H. V. CosTLO: Cost-Effective Redundancy for Lower Latency Variance on Cloud Storage Services. In Proc. 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI) (2015).
    [99]
    Xu, W., Zhu, X., Singhal, S., and Wang, Z. Predictive Control for Dynamic Resource Allocation in Enterprise Data Centers. In Network Operations and Management Symposium, 2006. NOMS 2006. 10th IEEE/IFIP (2006), IEEE, pp. 115--126.
    [100]
    Zhang, Q., Cherkasova, L., and Smirni, E. A Regression-Based Analytic Model for Dynamic Resource Provisioning of Multi-Tier Applications. In Autonomic Computing, 2007. ICAC'07. Fourth International Conference on (2007), IEEE, pp. 27--27.
    [101]
    Zhou, Z., Liu, F., Xu, Y., Zou, R., Xu, H., Lui, I. C., and Jin, H. Carbon-Aware Load Balancing for Geo-Distributed Cloud Services. In 2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems (2013), IEEE, pp. 232--241.
    [102]
    Zhu, T., Kozuch, M. A., and Harchol-Balter, M. WorkloadCompactor: Reducing Datacenter Cost While Providing Tail Latency SLO Guarantees. In Proceedings of the 2017 Symposium on Cloud Computing (2017), ACM, pp. 598--610.
    [103]
    Zhu, X., Young, D., Watson, B. J., Wang, Z., Rolia, J., Singhal, S., Mckee, B., Hyser, C., Gmach, D., Gardner, R., et al. 1000 islands: an integrated approach to resource management for virtualized data centers. Cluster Computing 12, 1 (2009), 45--57.

    Cited By

    View all
    • (2024)TraceUpscaler: Upscaling Traces to Evaluate Systems at High LoadProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3629581(942-961)Online publication date: 22-Apr-2024
    • (2024)Multi-criteria Rank Based Geo-distributed Load Balancing for Cloud Computing2024 Second International Conference on Emerging Trends in Information Technology and Engineering (ICETITE)10.1109/ic-ETITE58242.2024.10493818(1-7)Online publication date: 22-Feb-2024
    • (2023)Toward Optimal Repair and Load Balance in Locally Repairable CodesProceedings of the 52nd International Conference on Parallel Processing10.1145/3605573.3605635(725-735)Online publication date: 7-Aug-2023
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SoCC '18: Proceedings of the ACM Symposium on Cloud Computing
    October 2018
    546 pages
    ISBN:9781450360111
    DOI:10.1145/3267809
    Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 October 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Cloud Computing
    2. Distributed Systems
    3. Server Load Balancing
    4. Service Level Objectives
    5. Wide Area Networks

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    SoCC '18
    Sponsor:
    SoCC '18: ACM Symposium on Cloud Computing
    October 11 - 13, 2018
    CA, Carlsbad, USA

    Acceptance Rates

    Overall Acceptance Rate 169 of 722 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)151
    • Downloads (Last 6 weeks)21
    Reflects downloads up to 09 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)TraceUpscaler: Upscaling Traces to Evaluate Systems at High LoadProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3629581(942-961)Online publication date: 22-Apr-2024
    • (2024)Multi-criteria Rank Based Geo-distributed Load Balancing for Cloud Computing2024 Second International Conference on Emerging Trends in Information Technology and Engineering (ICETITE)10.1109/ic-ETITE58242.2024.10493818(1-7)Online publication date: 22-Feb-2024
    • (2023)Toward Optimal Repair and Load Balance in Locally Repairable CodesProceedings of the 52nd International Conference on Parallel Processing10.1145/3605573.3605635(725-735)Online publication date: 7-Aug-2023
    • (2023)Adaptive and Scalable Caching With Erasure Codes in Distributed Cloud-Edge Storage SystemsIEEE Transactions on Cloud Computing10.1109/TCC.2022.316866211:2(1840-1853)Online publication date: 1-Apr-2023
    • (2023)Architectural Vision for Quantum Computing in the Edge-Cloud Continuum2023 IEEE International Conference on Quantum Software (QSW)10.1109/QSW59989.2023.00021(88-103)Online publication date: Jul-2023
    • (2023)Industrial-Metadata Intelligent Service for Geo-Distributed File SystemIntelligent Industrial Internet Systems10.1007/978-981-99-5732-3_3(39-56)Online publication date: 21-Nov-2023
    • (2022)Optimal Caching for Low Latency in Distributed Coded Storage SystemsIEEE/ACM Transactions on Networking10.1109/TNET.2021.313321530:3(1132-1145)Online publication date: Jun-2022
    • (2022)Architecture of virtual edge data center with intelligent metadata service of a geo-distributed file systemJournal of Systems Architecture: the EUROMICRO Journal10.1016/j.sysarc.2022.102545128:COnline publication date: 1-Jul-2022
    • (2021)GeoCol: A Geo-distributed Cloud Storage System with Low Cost and Latency using Reinforcement Learning2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS51616.2021.00023(149-159)Online publication date: Jul-2021
    • (2020)Path persistence in the cloudACM SIGCOMM Computer Communication Review10.1145/3402413.340241650:2(11-23)Online publication date: 25-May-2020
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media