Abstract
Performance and power consumption are two important design objectives for data centers consisting of thousands or tens of thousands of disks (or disk arrays). To leverage the two objectives, in this study we propose a multi-version disk array (MDA). The main idea of MDA is to exploit the I/O workload characteristics to guide the replication strategy by replicating multiple versions of the popular data blocks and simply offloading the write data to the free space of the reserved version region, thus achieving high performance in the burst period and low power consumption in the idle period. Our prototype implementation of MDA and the performance evaluations show that the performance of MDA outperforms that of traditional RAID10 by up to 34.4% and 42.3% in terms of the average response time for the online transaction processing (OLTP) application I/O and search engine I/O, respectively. Moreover, the energy efficiency of MDA outperforms that of RAID10 by up to 48.7% and 36.4%, respective to the aforementioned measures.
Similar content being viewed by others
References
Arlitt, M., Williamson, C., 1996. Web Server Workload Characterization: the Search for Invariants. Proc. ACM SIGMETRICS Int. Conf. on Measurement and Modeling of Computer Systems, p.126–137. [doi:10.1145/233013.233034]
Bairavasundaram, L.N., Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H., Goodson, G.R., Schroeder, B., 2008. An analysis of data corruption in the storage stack. ACM Trans. Storage, 4(3):1–28. [doi:10.1145/1416944.1416947]
Batsakis, A., Burns, R., Kanevsky, A., Lentini, J., Talpey, T., 2008. AWOL: an Adaptive Write Optimizations Layer. Proc. 6th USENIX Conf. on File and Storage Technologies, p.67–80.
Colarelli, D., Grunwald, D., 2002. Massive Arrays of Idle Disks for Storage Archives. Proc. ACM/IEEE Conf. on Supercomputing, p.1–11. [doi:10.1109/SC.2002.10058]
Gomez, M.E., Santonja, V., 2002. Characterizing Temporal Locality in I/O Workload. Proc. Int. Symp. on Performance Evaluation of Computer and Telecommunication Systems, p.1–8.
Gray, J., Shenoy, P., 2000. Rules of Thumb in Data Engineering. Proc. 16th Int. Conf. on Data Engineering, p.3–10. [doi:10.1109/ICDE.2000.839382]
Gurumurthi, S., Sivasubramaniam, A., Kandemir, M., Franke, H., 2003. DRPM: Dynamic Speed Control for Power Management in Server Class Disks. Proc. 30th Annual Int. Symp. on Computer Architecture, p.169–179. [doi:10.1109/ISCA.2003.1206998]
Hennessy, J.L., Patterson, D.A., 2006. Computer Architecture: a Quantitative Approach (4th Ed.). Morgan Kaufmann, USA, p.38–44.
Hsieh, J.W., Kuo, T.W., Chang, L.P., 2006. Efficient identification of hot data for flash memory storage systems. ACM Trans. Storage, 2(1):22–40. [doi:10.1145/1138041.1138043]
Huang, H., Hung, W., Shin, K.G., 2005. FS2: Dynamic Data Replication in Free Disk Space for Improving Disk Performance and Energy Consumption. Proc. 20th ACM Symp. on Operating Systems Principles, p.263–276. [doi:10.1145/1095810.1095836]
Krioukov, A., Bairavasundaram, L.N., Goodson, G.R., Srinivasan, K., Thelen, R., Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H., 2008. Parity Lost and Parity Regained. Proc. 6th USENIX Conf. on File and Storage Technologies, p.127–141.
Li, D., Wang, J., 2004. EERAID: Energy Efficient Redundant and Inexpensive Disk Array. Proc. 11th Workshop on ACM SIGOPS European Workshop, p.1–14. [doi:10.1145/1133572.1133577]
Mao, B., Feng, D., Jiang, H., Wu, S., Chen, J., Zeng, L., 2008. GRAID: a Green RAID Storage Architecture with Improved Energy Efficiency and Reliability. Proc. Int. Symp. on Modeling, Analysis and Simulation of Computers and Telecommunication Systems, p.1–8. [doi:10.1109/MASCOT.2008.4770574]
Mi, N., Casale, G., Cherkasova, L., Smirni, E., 2008. Burstiness in multi-tier applications: symptoms, causes, and new models. LNCS, 5346:265–286. [doi:10.1007/978-3-540-89856-6_14]
Mogi, K., Kitsuregawa, M., 1996. Hot mirroring: a method of hiding parity update penalty and degradation during rebuilds for RAID5. ACM SIGMOD Rec., 25(2):183–194. [doi:10.1145/235968.233331]
Narayanan, D., Donnelly, A., Rowstron, A., 2008. Write off-loading: practical power management for enterprise storage. ACM Trans. Storage, 4(3):1–23. [doi:10.1145/1416944.1416949]
Orji, C.U., Solworth, J.A., 1993. Doubly distorted mirrors. ACM SIGMOD Rec., 22(2):307–316. [doi:10.1145/170036.170082]
Patterson, D., Gibson, G., Katz, R., 1988. A case for redundant arrays of inexpensive disks (RAID). ACM SIGMOD Rec., 17(3):109–116. [doi:10.1145/971701.50214]
Pinheiro, E., Bianchini, R., 2004. Energy Conservation Techniques for Disk Array-Based Servers. Proc. 18th Annual Int. Conf. on Supercomputing, p.68–78. [doi:10.1145/1006209.1006220]
Pinheiro, E., Bianchini, R., Dubnicki, C., 2006. Exploiting Redundancy to Conserve Energy in Storage Systems. Proc. Joint Int. Conf. on Measurement and Modeling of Computer Systems, p.15–26. [doi:10.1145/1140277.1140281]
Riska, A., Riedel, E., 2006. Disk Drive Level Workload Characterization. Proc. Annual USENIX Technical Conf., p.97–103.
Ruemmler, C., Wilkes, J., 1993. UNIX Disk Access Patterns. Proc. USENIX Winter Technical Conf., p.405–420.
Savage, S., Wilkes, J., 1996. AFRAID: a Frequently Redundant Array of Independent Disks. Proc. USENIX Annual Technical Conf., p.27–39.
Schmuck, F., Haskin, R., 2002. GPFS: a Shared-Disk File System for Large Computing Clusters. Proc. 1st USENIX Conf. on File and Storage Technologies, p.231–244.
Sivathanu, M., Bairavasundaram, L., Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H., 2004. Life or Death at Block-Level. Proc. 6th Conf. Symp. on Opearting Systems Design and Implementation, p.379–394.
Son, S.W., Chen, G., Kandemir, M., 2005. Disk Layout Optimization for Reducing Energy Consumption. Proc. 19th Annual Int. Conf. on Supercomputing, p.274–283. [doi:10.1145/1088149.1088186]
Storer, M.W., Greenan, K.M., Miller, E.L., Voruganti, K., 2008. Pergamum: Replacing Tape with Energy Efficient, Reliable, Disk-Based Archival Storage. Proc. 6th USENIX Conf. on File and Storage Technologies, p.1–16.
Tian, L., Feng, D., Jiang, H., Zhou, K., Zeng, L., Chen, J., Wang, Z., Song, Z., 2007. PRO: a Popularity-Based Multi-threaded Reconstruction Optimization for RAID-Structured Storage Systems. Proc. 5th USENIX Conf. on File and Storage Technologies, p.277–290.
UMass Trace Repository, 2002. OLTP Application I/O and Search Engine I/O. Available from http://traces.cs.umass.edu/index.php/storage/storage.
Wang, J., Hu, Y., 2002. WOLF-a Novel Reordering Write Buffer to Boost the Performance of Log-Structured File Systems. Proc. 1st USENIX Conf. on File and Storage Technologies, p.47–60.
Weddle, C., Oldham, M., Qian, J., Wang, A.A., Reiher, P., Kuenning, G., 2005. PARAID: a gear-shifting power-aware RAID. ACM Trans. Storage, 3(3):245–260. [doi:10.1145/1288783.1289721]
Weil, S., Brandt, S., Miller, E., Long, D., Maltzahn, C., 2006. Ceph: a Scalable, High-Performance Distributed File System. Proc. 7th Symp. on Operating Systems Design and Implementation, p.307–320.
Wilkes, J., Golding, R., Staelin, C., Sullivan, T., 1996. The HP AutoRAID hierarchical storage system. ACM Trans. Comput. Syst., 14(1):108–136. [doi:10.1145/225535.225539]
Xie, T., Sun, Y., 2008. Sacrificing Reliability for Energy Saving: Is It Worthwhile for Disk Arrays? IEEE Int. Symp. on Parallel and Distributed Processing, p.1–12. [doi:10.1109/IPDPS.2008.4536247]
Zhang, C., Krishnamurthy, X., Yu, A., Wang, R.Y., 2002. Configuring and Scheduling an Eager-Writing Disk Array for a Transaction Processing Workload. Proc. 1st USENIX Conf. on File and Storage Technologies, p.289–304.
Zhu, Q., Chen, Z., Tan, L., Zhou, Y., Keeton, K., Wilkes, J., 2005. Hibernator: helping disk arrays sleep through the winter. ACM SIGOPS Oper. Syst. Rev., 39(5):177–190. [doi:10.1145/1095809.1095828]
Author information
Authors and Affiliations
Corresponding author
Additional information
Project supported by the National Natural Science Foundation of China (No. 61100033), the US National Science Foundation (Nos. NSF-CNS-1016609 and NSF-IIS-0916859), and the Changjiang Innovative Group of Education of China (No. IRT0725)
Rights and permissions
About this article
Cite this article
Mao, B., Wu, Sz. & Feng, D. Beyond mirroring: multi-version disk arraywith improved performance and energy efficiency. J. Zhejiang Univ. - Sci. C 12, 721–729 (2011). https://doi.org/10.1631/jzus.C1000407
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1631/jzus.C1000407