Modeling multi-attribute demand for sustainable cloud computing with copulae
Pages 2596 - 2602
Abstract
As cloud computing gains in popularity, understanding the patterns and structure of its loads is increasingly important in order to drive effective resource allocation, scheduling and pricing decisions. These efficiency increases are then associated with a reduction in the data center environmental footprint. Existing models have only treated a single resource type, such as CPU, or memory, at a time. We offer a sophisticated machine learning approach to capture the joint-distribution. We capture the relationship among multiple resources by carefully fitting both the marginal distributions of each resource type as well as the non-linear structure of their correlation via a copula distribution. We investigate several choices for both models by studying a public data set of Google datacenter usage. We show the Burr XII distribution to be a particularly effective choice for modeling the marginals and the Frank copula to be the best choice for stitching these together into a joint distribution. Our approach offers a significant fidelity improvement and generalizes directly to higher dimensions. In use, this improvement will translate directly to reductions in energy consumption.
References
[1]
Ashok K Agrawala, JM Mohr, and RM Bryant. An approach to the workload characterization problem. Computer, 9(6):18-32, 1976.
[2]
A. Beloglazov and R. Buyya. Energy efficient allocation of virtual machines in cloud data centers. In Proceedings of 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing (CCGrid 2010), pages 577-578, May 2010.
[3]
Anton Beloglazov, Jemal Abawajy, and Rajkumar Buyya. Energy-aware resource allocation heuristics for efficient management of data centers for cloud computing. Future Generation Computer Systems, 28(5):755-768, 2012. Special Section: Energy efficiency in large-scale distributed systems.
[4]
Eike Christian Brechmann and Ulf Schepsmeier. Modeling dependence with C-and D-vine copulas: The R-package CDVine. Journal of Statistical Software, 52(3):1-27, 2013.
[5]
Irving W. Burr. Cumulative frequency functions. The Annals of Mathematical Statistics, 13(2):215- 232, 1942.
[6]
Irving W. Burr. On a general system of distributions: Iii. the sample range. Journal of the American Statistical Association, 63(322):636-643, 1968.
[7]
Su-Hui Chiang and Mary K Vernon. Characteristics of a large shared memory production workload. In Proceedings of the 7th International Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP-01), pages 159-187, June 2001.
[8]
A. Clauset, C. Shalizi, and M. Newman. Power-law distributions in empirical data. SIAM Review, 51(4):661-703, 2009.
[9]
Ralph B D'Agostino. Goodness-of-Fit Techniques, volume 68. CRC press, 1986.
[10]
U.S. energy information administration frequently asked questions. U.S. EIA, 2015. http://www.eia.gov/tools/faqs/faq.cfm?id=97&t=3.
[11]
Marisabel Guevara, Benjamin Lubin, and Benjamin C. Lee. Market mechanisms for managing datacenters with heterogeneous microarchitectures. ACM Transactions on Compututer Systems, 32(1):3:1-3:31, Feb 2014.
[12]
Mor Harchol-Balter and Allen B. Downey. Exploiting process lifetime distributions for dynamic load balancing. ACM Transactions Computer Systems, 15(3):253-285, Aug 1997.
[13]
Will Leland and Teunis J. Ott. Load-balancing heuristics and process behavior. SIGMETRICS Performance Evaluation Revue, 14(1):54-69, May 1986.
[14]
Hui Li, David Groep, and Lex Wolters. Workload characteristics of a multi-cluster supercomputer. In Proceedings Of the 10th International Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP-04), pages 176-193, June 2004.
[15]
Benjamin Lubin, Jeffrey O Kephart, Rajarshi Das, and David C Parkes. Expressive power-based resource allocation for data centers. In Proceedings Of the 21st International Joint Conference on Artificial Intelligence (IJCAI-09), pages 1451-1456, 2009.
[16]
America's data center efficiency assessment. National Resources Defense Council, 2013. http://www.nrdc.org/energy/data-center-efficiency-assessment.asp.
[17]
Eric W. Parsons and Kenneth C. Sevcik. Coordinated allocation of memory and processors in multiprocessors. SIGMETRICS Performance Evaluation Revue, 24(1):57-67, May 1996.
[18]
Charles Reiss, John Wilkes, and Joseph L Hellerstein. Google cluster-usage traces: Format + schema. Google Inc., White Paper, 2011.
[19]
Robert F. Rosin. Determining a computing center environment. Communications of the ACM, 8(7):463-468, Jul 1965.
[20]
A. Sklar. Fonctions de répartition à n dimensions et leurs marges, volume 8. Publications de l'Institut de Statistique de L'Université de Paris, 1959.
[21]
Pandu R. Tadikamalla. A look at the burr and related distributions. International Statistical Review, 48(3):337-344, 1980.
[22]
Pravin K Trivedi and David M Zimmer. Copula Modeling: an Introduction for Practitioners. Now Publishers Inc, 2007.
[23]
Edward S. Walter and Victor L. Wallace. Further analysis of a computing center environment. Communications of the ACM, 10(5):266-272, May 1967.
[24]
Jun Yan. Enjoy the joy of copulas: with a package copula. Journal of Statistical Software, 21(4):1-21, 2007.
Recommendations
A Semi-structured Overlay for Multi-attribute Range Queries in Cloud Computing
CSE '10: Proceedings of the 2010 13th IEEE International Conference on Computational Science and EngineeringIn recent years, Cloud computing is considered the next-generation computing paradigm. Cloud services could be broadly classified as software as a service (SaaS), platform as a service (PaaS) and infrastructure as a service (IaaS). However, most cloud ...
Cloud Computing Security: Amazon Web Service
ACCT '15: Proceedings of the 2015 Fifth International Conference on Advanced Computing & Communication TechnologiesCloud Computing is a recently emerged model which is becoming popular among almost all enterprises. It involves the concept of on demand services which means using the cloud resources on demand and we can scale the resources as per demand. Cloud ...
Comments
Information & Contributors
Information
Published In
Sponsors
- The International Joint Conferences on Artificial Intelligence, Inc. (IJCAI)
Publisher
AAAI Press
Publication History
Published: 25 July 2015
Qualifiers
- Article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 0Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Reflects downloads up to 10 Oct 2024