Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A Sensor System for High-Fidelity Temperature Distribution Forecasting in Data Centers

Published: 15 December 2014 Publication History

Abstract

Data centers have become a critical computing infrastructure in the era of cloud computing. Temperature monitoring and forecasting are essential for preventing server shutdowns because of overheating and improving a data center’s energy efficiency. This article presents a novel cyber-physical approach for temperature forecasting in data centers, one that integrates Computational Fluid Dynamics (CFD) modeling, in situ wireless sensing, and real-time data-driven prediction. To ensure forecasting fidelity, we leverage the realistic physical thermodynamic models of CFD to generate transient temperature distribution and calibrate it using sensor feedback. Both simulated temperature distribution and sensor measurements are then used to train a real-time prediction algorithm. As a result, our approach reduces not only the computational complexity of online temperature modeling and prediction, but also the number of deployed sensors, which enables a portable, noninvasive thermal monitoring solution that does not rely on the infrastructure of a monitored data center. We extensively evaluated the proposed system on a rack of 15 servers and a testbed of five racks and 229 servers in a small-scale production data center. Our results show that our system can predict the temperature evolution of servers with highly dynamic workloads at an average error of 0.52○C, within a duration up to 10 minutes. Moreover, our approach can reduce the required number of sensors by 67% while maintaining desirable prediction fidelity.

References

[1]
Active Power, Inc. 2007. Data center thermal runaway: A review of cooling challenges in high density mission critical environments. http://new.activepower.com/documents/white_papers/.
[2]
Aperture Research Institute. 2007. Data center professionals turn to high-density computing as major boom continues. http://www.emersonnetworkpower.com/en-EMEA/Brands/Aperture/ApertureResearch Institute/Research/Documents/ari_high_density_4_01_07.pdf.
[3]
ASHRAE Technical Committee 9.9. 2011. 2011 thermal guidelines for data processing environments—Expanded data center classes and usage guidance. http://ecoinfo.cnrs.fr/IMG/pdf/ashrae_2011_thermal_ guidelines_data_center.pdf.
[4]
Cullen Bash and George Forman. 2007. Cool job alloction: Measuring the power savings of placing jobs at cooling-efficient locations in the data center. In Proceedings of USENIX Annual Technical Conference (USENIX). 1--6.
[5]
Cullen E. Bash, Chandrakant D. Patel, and Ratnesh K. Sharma. 2006. Dynamic thermal management of air cooled data centers. In Proceedings of the 10th Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronics Systems (ITherm).
[6]
Geoffrey C. Bell. 2012. Improving data center efficiency with rack or row cooling devices: Results of “Chill-Off 2” comparative testing. Federal Energy Management Program. http://datacenters.lbl.gov/sites/all/files/dc_chilloff2.pdf.
[7]
Susmit Biswas, Mohit Tiwari, Timothy Sherwood, Luke Theogarajan, and Frederic T. Chong. 2011. Fighting fire with fire: Modeling the datacenter-scale effects of targeted superlattice thermal management. In Proceedings of the 38th International Symposium on Computer Architecture. 331--340.
[8]
Jinzhu Chen, Rui Tan, Guoliang Xing, and Xiaorui Wang. 2014. PTEC: A system for predictive thermal and energy control in data centers. In Proceedings of the 35th IEEE Real-Time Systems Symposium (RTSS). 218--227.
[9]
Jeonghwan Choi, Youngjae Kim An, Jelena Srebric, Qian Wang, and Joonwon Lee. 2007. Modeling and managing thermal profiles of rack-mounted servers with thermostat. In Proceedings of the 13th International Symposium on High-Performance Computer Architecture. 205--215.
[10]
Supasate Choochaisri, Vit Niennattrakul, Saran Jenjaturong, Chalermek Intanagonwiwat, and Chotirat Ann Ratanamahatana. 2010. SENVM: Server environment monitoring and controlling system for small data center using wireless sensor network. http://arxiv.org/pdf/1105.6160.pdf.
[11]
Degree Controls, Inc. 2011. F333 Airflow Sensor User Guide.
[12]
Nosayba El-Sayed, Ioan Stefanovici, George Amvrosiadis, Andy A. Hwang, and Bianca Schroeder. 2012. Temperature management in data centers: Why some (might) like it hot. In Proceedings of the 12th ACM Sigmetrics/Performace Joint International Conference on Measurement and Modeling of Computer Systems. 163--174.
[13]
Emerson Network Power. 2011. State of the Data Center. Retrieved from http://www.emersonnetworkpower.com/.
[14]
Taliver Heath, Ana Paula Centeno, Pradeep George, Luiz Ramos, Yogesh Jaluria, and Ricardo Bianchini. 2006. Mercury and freon: Temperature emulation and management for server systems. In Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS). 106--116.
[15]
Arther E. Hoerl and Robert W. Kennard. 1970. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 12, 1 (1970), 55--67.
[16]
Michael Jonas, Rose Robin Gilbert, Joshua Ferguson, Georgios Varsamopoulos, and Sandeep Gupta. 2012. A transient model for data center thermal prediction. In Proceedings of the 3rd International Green Computing Conference (IGCC). 1--10.
[17]
Jon Lenchner, Canturk Isci, Jeffrey Kephart, Christopher Mansley, Jonathan Connell, and Suzanne McIntosh. 2011. Toward data center self-diagnosis using a mobile robot. In Proceedings of the 8th IEEE International Conference on Autonomic Computing (ICAC). 81--90.
[18]
Philip Levis, Sam Madden, Joseph Polastre, Robert Szewczyk, Kamin Whitehouse, Alec Woo, David Gay, Jason Hill, Matt Welsh, Eric Brewer, and David Culler. 2005. TinyOS: An operating system for sensor networks. Ambient Intelligence (2005), 115--148.
[19]
Lei Li, Chieh-Jan Mike Liang, Jie Liu, Suman Nath, Andreas Terzis, and Christos Faloutsos. 2011. ThermoCast: A cyber-physical forecasting model for data centers. In Proceedings of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD). 1370--1378.
[20]
Chieh Jan Mike Liang, Jie Liu, Liqian Luo, Andreas Terzis, and Feng Zhao. 2009. RACNet: A high-fidelity data center sensing network. In Proceedings of the 7th ACM Conference on Embedded Networked Sensor Systems (SenSys). 15--28.
[21]
Chris Mansley, Jonathan Connell, Canturk Isci, Jonathan Lenchner, Jeffrey O. Kephart, Suzanne McIntosh, and Michael Schappert. 2011. Robotic mapping and monitoring of data centers. In Proceedings of IEEE International Conference on Robotics and Automation (ICRA). 5905--5910.
[22]
Memsic Corp. 2012. TelosB, Iris datasheets. http://www.memsic.com/wireless-sensor-networks/.
[23]
Justin Moore, Jeff Chasey, and Parthasarathy Ranganathanz. 2006. Weatherman: Automated, online, and predictive thermal mapping and management. In Proceedings of the 3rd IEEE International Conference on Autonomic Computing (ICAC). 155--164.
[24]
Justin Moore, Jeff Chasey, Parthasarathy Ranganathanz, and Ratnesh Sharmaz. 2005. Making scheduling “Cool”: Temperature-aware workload placement in data centers. In Proceedings of the USENIX Annual Technical Conference (USENIX). 5.
[25]
John Niemann. 2006. Best practices for designing data centers with the InfraStruXure InRow RC. Application note of American Power Conversion. http://www.apcmedia.com/salestools/JNIN-6N7SRZ/JNIN- 6N7SRZ_R0_EN.pdf?sdirect=true.
[26]
Luiz Ramos and Ricardo Bianchini. 2008. C-Oracle: Predictive thermal management for data centers. In IEEE 17th International Symposium on High Performance Computer Architecture (HPCA). 111--122.
[27]
Neil Rasmussen. 2011. Cooling options for rack equipment with side-to-side airflow. http://www.apcmedia.com/salestools/SADE-5TNRKJ/SADE-5TNRKJ_R1_EN.pdf?sdirect=true.
[28]
Umesh Singh, Amarendra K. Singh, S. Parvez, and Anand Sivasubramaniam. 2010. CFD-based operational thermal efficiency improvement of a production data center. In Proceedings of the 1st USENIX Workshop on Sustainable Information Technology (SustainIT). 6.
[29]
Alan Stuart, Keith Ord, and Steven Arnold. 2009. Kendall’s Advanced Theory of Statistics: Classical Inference and the Linear Model (6 ed.). John Wiley & Sons.
[30]
Qinghui Tang, Sandeep K. S. Gupta, and Georgios Varsamopoulos. 2008. Energy-efficient thermal-aware task scheduling for homogeneous high-performance computing data centers: A cyber-physical approach. In IEEE Transactions on Parallel and Distributed Systems, 19: 1458--1472.
[31]
Qinghui Tang, Tridib Mukherjee, Sandeep K. S. Gupta, and Phil Cayton. 2006. Sensor-based fast thermal evaluation model for energy efficient high-performance datacenters. In Proceedings of the 4th International Conference on Intelligent Sensing and Information Processing (ICISIP). 203--208.
[32]
U.S. Environmental Protection Agency. 2007. Report to Congress on Server and Data Center Energy Efficiency. http://hightech.lbl.gov/documents/data_centers/epa-datacenters.pdf.
[33]
L. J. P. Van der Maaten, E. O. Postma, and H. J. Van Den Herik. 2009. Dimensionality reduction: A comparative review. Journal of Machine Learning Research 10 (2009), 1--41.
[34]
P. K. Varshney. 1996. Distributed Detection and Data Fusion. Springer.
[35]
Xiaodong Wang, Xiaorui Wang, Guoliang Xing, Jinzhu Chen, Cheng-Xian Lin, and Yixin Chen. 2011. Towards optimal sensor placement for hot server detection in data centers. In Proceedings of the 31st International Conference on Distributed Computing Systems (ICDCS). 899--908.
[36]
John F. Wendt (Ed.). 1995. Computational Fluid Dynamics—An Introduction (3rd ed.). Springer.
[37]
WikiMedia Foundation. 2010. Global outage (cooling failure and DNS). http://blog.wikimedia.org/2010/03/24/global-outage-cooling-failure-and-dns/.
[38]
Nong Ye (Ed.). 2003. The Handbook of Data Mining. Lawrence Erlbaum Associates.

Cited By

View all
  • (2023)Active Acoustic Sensing for “Hearing” Temperature Under Acoustic InterferenceIEEE Transactions on Mobile Computing10.1109/TMC.2021.309679222:2(661-673)Online publication date: 1-Feb-2023
  • (2021)A Mobile Robotic System for Data Center Thermal Environment Measurement and Reconstruction2021 China Automation Congress (CAC)10.1109/CAC53003.2021.9728460(2755-2760)Online publication date: 22-Oct-2021
  • (2019)A network clock model for time awareness in the Internet of things and artificial intelligence applicationsThe Journal of Supercomputing10.1007/s11227-019-02774-075:8(4309-4328)Online publication date: 1-Aug-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Sensor Networks
ACM Transactions on Sensor Networks  Volume 11, Issue 2
February 2015
563 pages
ISSN:1550-4859
EISSN:1550-4867
DOI:10.1145/2656931
  • Editor:
  • Chenyang Lu
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

Publication History

Published: 15 December 2014
Accepted: 01 June 2014
Revised: 01 December 2013
Received: 01 February 2013
Published in TOSN Volume 11, Issue 2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Data center
  2. computational fluid dynamics
  3. cyber-physical system
  4. temperature prediction
  5. wireless sensor network

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Active Acoustic Sensing for “Hearing” Temperature Under Acoustic InterferenceIEEE Transactions on Mobile Computing10.1109/TMC.2021.309679222:2(661-673)Online publication date: 1-Feb-2023
  • (2021)A Mobile Robotic System for Data Center Thermal Environment Measurement and Reconstruction2021 China Automation Congress (CAC)10.1109/CAC53003.2021.9728460(2755-2760)Online publication date: 22-Oct-2021
  • (2019)A network clock model for time awareness in the Internet of things and artificial intelligence applicationsThe Journal of Supercomputing10.1007/s11227-019-02774-075:8(4309-4328)Online publication date: 1-Aug-2019
  • (2018)Energy efficient quality of service aware virtual machine migration in cloud computing2018 4th International Conference on Recent Advances in Information Technology (RAIT)10.1109/RAIT.2018.8389047(1-6)Online publication date: Mar-2018
  • (2018)SDN-based hybrid server and link load balancing in multipath distributed storage systemsNOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium10.1109/NOMS.2018.8406286(1-6)Online publication date: 23-Apr-2018
  • (2018)Low Rank Representation on SPD matrices with Log-Euclidean metricPattern Recognition10.1016/j.patcog.2017.07.00976:C(623-634)Online publication date: 1-Apr-2018
  • (2017)Dynamic Social-Aware Peer Selection Scheme for Cooperative Device-to-Device Communications2017 IEEE Wireless Communications and Networking Conference (WCNC)10.1109/WCNC.2017.7925733(1-6)Online publication date: Mar-2017
  • (2017)Sparse learning based fuzzy c-means clusteringKnowledge-Based Systems10.1016/j.knosys.2016.12.006119:C(113-125)Online publication date: 1-Mar-2017
  • (2016)A survey of transfer learning for collaborative recommendation with auxiliary dataNeurocomputing10.1016/j.neucom.2015.11.059177:C(447-453)Online publication date: 12-Feb-2016
  • (2016)BibliographyDeploying Wireless Sensor Networks10.1016/B978-1-78548-099-7.50011-8(121-137)Online publication date: 2016
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media