Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2832581.2832612guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Modeling multi-attribute demand for sustainable cloud computing with copulae

Published: 25 July 2015 Publication History

Abstract

As cloud computing gains in popularity, understanding the patterns and structure of its loads is increasingly important in order to drive effective resource allocation, scheduling and pricing decisions. These efficiency increases are then associated with a reduction in the data center environmental footprint. Existing models have only treated a single resource type, such as CPU, or memory, at a time. We offer a sophisticated machine learning approach to capture the joint-distribution. We capture the relationship among multiple resources by carefully fitting both the marginal distributions of each resource type as well as the non-linear structure of their correlation via a copula distribution. We investigate several choices for both models by studying a public data set of Google datacenter usage. We show the Burr XII distribution to be a particularly effective choice for modeling the marginals and the Frank copula to be the best choice for stitching these together into a joint distribution. Our approach offers a significant fidelity improvement and generalizes directly to higher dimensions. In use, this improvement will translate directly to reductions in energy consumption.

References

[1]
Ashok K Agrawala, JM Mohr, and RM Bryant. An approach to the workload characterization problem. Computer, 9(6):18-32, 1976.
[2]
A. Beloglazov and R. Buyya. Energy efficient allocation of virtual machines in cloud data centers. In Proceedings of 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing (CCGrid 2010), pages 577-578, May 2010.
[3]
Anton Beloglazov, Jemal Abawajy, and Rajkumar Buyya. Energy-aware resource allocation heuristics for efficient management of data centers for cloud computing. Future Generation Computer Systems, 28(5):755-768, 2012. Special Section: Energy efficiency in large-scale distributed systems.
[4]
Eike Christian Brechmann and Ulf Schepsmeier. Modeling dependence with C-and D-vine copulas: The R-package CDVine. Journal of Statistical Software, 52(3):1-27, 2013.
[5]
Irving W. Burr. Cumulative frequency functions. The Annals of Mathematical Statistics, 13(2):215- 232, 1942.
[6]
Irving W. Burr. On a general system of distributions: Iii. the sample range. Journal of the American Statistical Association, 63(322):636-643, 1968.
[7]
Su-Hui Chiang and Mary K Vernon. Characteristics of a large shared memory production workload. In Proceedings of the 7th International Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP-01), pages 159-187, June 2001.
[8]
A. Clauset, C. Shalizi, and M. Newman. Power-law distributions in empirical data. SIAM Review, 51(4):661-703, 2009.
[9]
Ralph B D'Agostino. Goodness-of-Fit Techniques, volume 68. CRC press, 1986.
[10]
U.S. energy information administration frequently asked questions. U.S. EIA, 2015. http://www.eia.gov/tools/faqs/faq.cfm?id=97&t=3.
[11]
Marisabel Guevara, Benjamin Lubin, and Benjamin C. Lee. Market mechanisms for managing datacenters with heterogeneous microarchitectures. ACM Transactions on Compututer Systems, 32(1):3:1-3:31, Feb 2014.
[12]
Mor Harchol-Balter and Allen B. Downey. Exploiting process lifetime distributions for dynamic load balancing. ACM Transactions Computer Systems, 15(3):253-285, Aug 1997.
[13]
Will Leland and Teunis J. Ott. Load-balancing heuristics and process behavior. SIGMETRICS Performance Evaluation Revue, 14(1):54-69, May 1986.
[14]
Hui Li, David Groep, and Lex Wolters. Workload characteristics of a multi-cluster supercomputer. In Proceedings Of the 10th International Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP-04), pages 176-193, June 2004.
[15]
Benjamin Lubin, Jeffrey O Kephart, Rajarshi Das, and David C Parkes. Expressive power-based resource allocation for data centers. In Proceedings Of the 21st International Joint Conference on Artificial Intelligence (IJCAI-09), pages 1451-1456, 2009.
[16]
America's data center efficiency assessment. National Resources Defense Council, 2013. http://www.nrdc.org/energy/data-center-efficiency-assessment.asp.
[17]
Eric W. Parsons and Kenneth C. Sevcik. Coordinated allocation of memory and processors in multiprocessors. SIGMETRICS Performance Evaluation Revue, 24(1):57-67, May 1996.
[18]
Charles Reiss, John Wilkes, and Joseph L Hellerstein. Google cluster-usage traces: Format + schema. Google Inc., White Paper, 2011.
[19]
Robert F. Rosin. Determining a computing center environment. Communications of the ACM, 8(7):463-468, Jul 1965.
[20]
A. Sklar. Fonctions de répartition à n dimensions et leurs marges, volume 8. Publications de l'Institut de Statistique de L'Université de Paris, 1959.
[21]
Pandu R. Tadikamalla. A look at the burr and related distributions. International Statistical Review, 48(3):337-344, 1980.
[22]
Pravin K Trivedi and David M Zimmer. Copula Modeling: an Introduction for Practitioners. Now Publishers Inc, 2007.
[23]
Edward S. Walter and Victor L. Wallace. Further analysis of a computing center environment. Communications of the ACM, 10(5):266-272, May 1967.
[24]
Jun Yan. Enjoy the joy of copulas: with a package copula. Journal of Statistical Software, 21(4):1-21, 2007.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
IJCAI'15: Proceedings of the 24th International Conference on Artificial Intelligence
July 2015
4429 pages
ISBN:9781577357384

Sponsors

  • The International Joint Conferences on Artificial Intelligence, Inc. (IJCAI)

Publisher

AAAI Press

Publication History

Published: 25 July 2015

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Oct 2024

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media