Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

LDM: Log Disk Mirroring with Improved Performance and Reliability for SSD-Based Disk Arrays

Published: 20 May 2016 Publication History

Abstract

With the explosive growth in data volume, the I/O bottleneck has become an increasingly daunting challenge for big data analytics. Economic forces, driven by the desire to introduce flash-based Solid-State Drives (SSDs) into the high-end storage market, have resulted in hybrid storage systems in the cloud. However, a single flash-based SSD cannot satisfy the performance, reliability, and capacity requirements of enterprise or HPC storage systems in the cloud. While an array of SSDs organized in a RAID structure, such as RAID5, provides the potential for high storage capacity and bandwidth, reliability and performance problems will likely result from the parity update operations. In this article, we propose a Log Disk Mirroring scheme (LDM) to improve the performance and reliability of SSD-based disk arrays. LDM is a hybrid disk array architecture that consists of several SSDs and two hard disk drives (HDDs). In an LDM array, the two HDDs are mirrored as a write buffer that temporally absorbs the small write requests. The small and random write data are written on the mirroring buffer by using the logging technique that sequentially appends new data. The small write data are merged and destaged to the SSD-based disk array during the system idle periods. Our prototype implementation of the LDM array and the performance evaluations show that the LDM array significantly outperforms the pure SSD-based disk arrays by a factor of 20.4 on average, and outperforms HPDA by a factor of 5.0 on average. The reliability analysis shows that the MTTDL of the LDM array is 2.7 times and 1.7 times better than that of pure SSD-based disk arrays and HPDA disk arrays.

References

[1]
N. Agrawal, V. Prabhakaran, T. Wobber, J. Davis, M. Manasse, and R. Panigrahy. 2008. Design tradeoffs for SSD performance. In Proceedings of the 2008 USENIX Annual Technical Conference (USENIX’08). 57--70.
[2]
M. Balakrishnan, A. Kadav, V. Prabhakaran, and D. Malkhi. 2010a. Differential RAID: Rethinking RAID for SSD reliability. In Proceedings of the 5th European Conference on Computer systems (EuroSys’10). 15--26.
[3]
M. Balakrishnan, A. Kadav, V. Prabhakaran, and D. Malkhi. 2010b. Differential RAID: Rethinking RAID for SSD reliability. ACM Transactions on Storage 6, 2 (2010), 1--22.
[4]
A. M. Caulfield, J. Coburn, T. Mollov, A. De, A. Akel, J. He, A. Jagatheesan, R. K. Gupta, A. Snavely, and S. Swanson. 2010. Understanding the impact of emerging non-volatile memories on high-performance, IO-intensive computing. In Proceedings of the 2010 International Conference for High Performance Computing, Networking, Storage and Analysis (SC’10). 1--11.
[5]
J. C. W. Chan, Q. Ding, P. P. C. Lee, and H. H. W. Chan. 2014. Parity logging with reserved space: Towards efficient updates and recovery in erasure-coded clustered storage. In Proceedings of the 12th USENIX Conference on File and Storage Technologies (FAST’14). 163--176.
[6]
F. Chen, D. A. Koufaty, and X. Zhang. 2009. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. In Proceedings of the 11th ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’09). 181--192.
[7]
C. Chung and H. Hsu. 2014. Partial parity cache and data cache management method to improve the performance of an SSD-based RAID. IEEE Transactions on Very Large Scale Integration Systems 22, 7 (2014), 1470--1480.
[8]
C. Dirik and B. Jacob. 2009. The performance of PC solid-state disks as a function of bandwidth, concurrency, device architecture, and system organization. In Proceedings of the 36th International Symposium on Computer Architecture (ISCA’09). 279--289.
[9]
SanDisk Solid State Driver. 2015. https://itblog.sandisk.com/truth-ssds-hdd-vendors-do-not-want-you-to-know/.
[10]
Y. Du, F. Liu, Z. Chen, and X. Ma. 2011. WeLe-RAID: A SSD-based RAID for system endurance and performance. In Proceedings of the 8th IFIP International Conference on Network and Parallel Computing (NPC’11). 248--262.
[11]
EMC Symmetrix DMX Architecture. 2010. Retrieved from https://www.emc.com/collateral/hardware/solution-overview/c1011-symm-dmx-architecture-prod-desc-gd.pdf.
[12]
R. Golding, P. Bosch, and C. Staelin. 1995. Idleness is not sloth. In Proceedings of the USENIX Technical Conference (USENIX’95). 201--212.
[13]
K. Greenan, D. D. E. Long, E. L. Miller, T. Schwarz, and A. Wildani. 2009. Building flexible, fault-tolerant flash-based storage systems. In Proceedings of the 5th Workshop on Hot Topics in System Dependability (HotDep’09).
[14]
L. M. Grupp, J. D. Davis, and S. Swanson. 2012. The bleak future of NAND flash memory. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST’12).
[15]
S. Im and D. Shin. 2011. Flash-aware RAID techniques for dependable and high-performance flash memory SSD. IEEE Transactions on Computers 1, 60 (2011), 80--92.
[16]
N. Jeremic, G. Mühl, A. Busse, and J. Richling. 2011. The pitfalls of deploying solid-state drive RAIDs. In Proceedings of the 4th Annual International Conference on Systems and Storage (SYSTOR’11). 1--13.
[17]
H. Kao, J. Paris, D. D. E. Long, and T. Schwarz. 2013. A flexible simulation tool for estimating data loss risks in storage arrays. In Proceedings of the 29th IEEE Symposium on Massive Storage Systems and Technologies (MSST’13). 1--5.
[18]
Y. Kim, A. Gupta, B. Urgaonkar, P. Berman, and A. Sivasubramaniam. 2011a. HybridStore: A cost-efficient, high-performance storage system combining SSDs and HDDs. In Proceedings of the 19th Annual IEEE/ACM International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS’11). 227--236.
[19]
Y. Kim, S. Oral, G. M. Shipman, J. Lee, D. A. Dillow, and F. Wang. 2011b. Harmonia: A globally coordinated garbage collector for arrays of solid-state drives. In Proceedings of the 27th IEEE Symposium on Mass Storage Systems and Technologies (MSST’11). 1--12.
[20]
L. Lin, Y. Zhu, J. Yue, Z. Cai, and B. Segee. 2011. Hot random off-loading: A hybrid storage system with dynamic data migration. In Proceedings of the 19th Annual Meeting of the IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOSTS’11). 318--325.
[21]
Logical Volume Manager (Linux). 2015. Retrieved from https://en.wikipedia.org/wiki/Logical_Volume_Manager_ (Linux).
[22]
B. Mao, H. Jiang, D. Feng, S. Wu, J. Chen, L. Zeng, and L. Tian. 2010. HPDA: A hybrid parity-based disk array for enhanced performance and reliability. In Proceedings of 24th International Parallel & Distributed Processing Symposium (IPDPS’’10). 1--12.
[23]
B. Mao, H. Jiang, S. Wu, Y. Fu, and L. Tian. 2014. Read performance optimization for deduplication-based storage systems in the cloud. ACM Transactions on Storage 10, 2 (2014), 1--22.
[24]
B. Mao, H. Jiang, S. Wu, L. Tian, D. Feng, J. Chen, and L. Zeng. 2012. HPDA: A hybrid parity-based disk array for enhanced performance and reliability. ACM Transactions on Storage 8, 1 (2012), Article 4.
[25]
B. Mao and S. Wu. 2015. Exploiting request characteristics and internal parallelism to improve SSD performance. In Proceedings of the 33rd IEEE International Conference on Computer Design (ICCD’15). 476--479.
[26]
B. Mao, S. Wu, and H. Jiang. 2015. Improving storage availability in cloud-of-clouds with hybrid redundant data distribution. In Proceedings of the 29th IEEE International Parallel & Distributed Processing Symposium (IPDPS’’15). 633--642.
[27]
J. Meza, Q. Wu, S. Kumar, and O. Mutlu. 2015. A large-scale study of flash memory failures in the field. In Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’15). 177--190.
[28]
Microsoft Enterprise Traces. 2009. Retrieved from http://iotta.snia.org/traces/list/BlockIO.
[29]
C. Min, K. Kim, H. Cho, S. Lee, and Y. Eom. 2012. SFS: Random write considered harmful in solid state drives. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST’12).
[30]
S. Moon and A. L. Narasimha Reddy. 2013. Don’t Let RAID raid the lifetime of your SSD array. In Proceedings of the 5th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’13).
[31]
D. Narayanan, E. Thereska, A. Donnelly, S. Elnikety, and A. Rowstron. 2009. Migrating server storage to SSDs: Analysis of tradeoffs. In Proceedings of the 4th European Conference on Computer Systems (EuroSys’09). 145--158.
[32]
D. A. Patterson, G. Gibson, and R. H. Katz. 1988. A case for redundant arrays of inexpensive disks (RAID). In Proceedings of the International Conference on Management of Data (SIGMOD’88). 109--116.
[33]
M. Rosenblum and J. Ousterhout. 1992. The design and implementation of a log-structured file system. ACM Transactions on Computer Systems 10, 1 (1992), 26--52.
[34]
Samsung Report. 2008. Retrieved from http://news.cnet.com/8301-13924_3-9876557-64.html.
[35]
B. Schroeder and G. A. Gibson. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST’07). 1--16.
[36]
Solid State Storage Initiative. 2010. Retreived from http://www.snia.org/forums/sssi.
[37]
G. Soundararajan, V. Prabhakaran, M. Balakrishnan, and T. Wobber. 2010. Extending SSD lifetimes with disk-based write caches. In Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST’10). 101--114.
[38]
UMass Trace Repository. 2010. Retrieved from http://traces.cs.umass.edu/index.php/Storage/Storage.
[39]
Virtualizing Storage for Scale, Resiliency, and Efficiency. 2012. Retrieved from http://blogs.msdn.com/b/b8/archive/2012/01/05/virtualizing-storage-for-scale-resiliency-and-efficiency.aspx.
[40]
Y. Wang, W. Wang, T. Xie, W. Pan, Y. Gao, and Y. Ouyang. 2014. CR5M: A mirroring-powered channel-RAID5 architecture for an SSD. In Proceedings of 30th Symposium on Mass Storage Systems and Technologies (MSST’14). 1--10.
[41]
S. Wu, H. Jiang, D. Feng, L. Tian, and B. Mao. 2009. WorkOut: I/O workload outsourcing for boosting the raid reconstruction performance. In Proceedings of the 7th USENIX Conference on File and Storage Technologies (FAST’09). 239--252.
[42]
S. Wu, H. Jiang, D. Feng, L. Tian, and B. Mao. 2011. Improving availability of RAID-structured storage systems by workload outsourcing. IEEE Transactions on Computers 60, 1 (2011), 64--79.
[43]
S. Wu, H. Jiang, and B. Mao. 2012. IDO: Intelligent data outsourcing with improved raid reconstruction performance in large-scale data centers. In Proceedings of the 26th USENIX Large Installation System Administration (LISA’12). 17--32.
[44]
S. Wu, H. Jiang, and B. Mao. 2015. Proactive data migration for improved storage availability in large-scale data centers. IEEE Transactions on Computers 64, 9 (2015), 2637--2651.
[45]
S. Wu, W. Yang, B. Mao, and Y. Lin. 2015. MC-RAIS: Multi-chunk redundant array of independent SSDs with improved performance. In Proceedings of the 15th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP’15). 18--32.
[46]
T. Xie and Y. Sun. 2010. Dynamic data reallocation in hybrid disk arrays. IEEE Transactions on Parallel and Distributed Systems 21, 9 (2010), 1330--1341.
[47]
Q. Yang and J. Ren. 2011. I-CASH: Intelligently coupled array of SSD and HDD. In Proceedings of the 17th International Symposium on High Performance Computer Architecture (HPCA’11). 278--289.
[48]
L. Yi, J. Shu, J. Ou, and W. Zheng. 2013. CG-Resync: Conversion-guided resynchronization for a SSD-based RAID array. In Proceedings of the 31st International Conference on Computer Design (ICCD’13). 455--458.
[49]
L. Zeng, D. Feng, J. Chen, Q. Wei, B. Veeravalli, and W. Liu. 2012. HRAID6ML: A hybrid RAID6 storage architecture with mirrored logging. In Proceedings of the 28th IEEE Conference on Massive Data Storage (MSST’12). 1--6.
[50]
Y. Zhang, A. C. Arpaci-Dusseau, and R. H. Arpaci-Dusseau. 2013. Warped mirrors for flash. In Proceedings of the 29th IEEE Symposium on Massive Storage Systems and Technologies (MSST’13). 1--12.

Cited By

View all
  • (2024)Explorations and Exploitation for Parity-based RAIDs with Ultra-fast SSDsACM Transactions on Storage10.1145/362799220:1(1-32)Online publication date: 30-Jan-2024
  • (2024)Privacy-Enhanced Cooperative Storage Scheme for Contact-Free Sensory Data in AIoT with Efficient SynchronizationACM Transactions on Sensor Networks10.1145/361799820:4(1-19)Online publication date: 11-May-2024
  • (2024)SSRAID: A Stripe-Queued and Stripe-Threaded Merging I/O Strategy to Improve Write Performance of Serial Interface SSD RAIDIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.344308335:10(1841-1853)Online publication date: Oct-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Storage
ACM Transactions on Storage  Volume 12, Issue 4
August 2016
213 pages
ISSN:1553-3077
EISSN:1553-3093
DOI:10.1145/2940403
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 May 2016
Accepted: 01 February 2016
Revised: 01 November 2015
Received: 01 October 2014
Published in TOS Volume 12, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. SSD-based disk arrays
  2. disk buffer
  3. log technique
  4. performance evaluation
  5. reliability analysis

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • U.S. NSF
  • National Natural Science Foundation of China
  • Fundamental Research Funds for the Central Universities
  • Scientific Research Foundation for the Returned Overseas Chinese Scholars, State Education Ministry

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)2
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Explorations and Exploitation for Parity-based RAIDs with Ultra-fast SSDsACM Transactions on Storage10.1145/362799220:1(1-32)Online publication date: 30-Jan-2024
  • (2024)Privacy-Enhanced Cooperative Storage Scheme for Contact-Free Sensory Data in AIoT with Efficient SynchronizationACM Transactions on Sensor Networks10.1145/361799820:4(1-19)Online publication date: 11-May-2024
  • (2024)SSRAID: A Stripe-Queued and Stripe-Threaded Merging I/O Strategy to Improve Write Performance of Serial Interface SSD RAIDIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.344308335:10(1841-1853)Online publication date: Oct-2024
  • (2022)Short Tail: taming tail latency for erasure-code-based in-memory systemsShortTail:降低纠删码内存存储系统的尾部延迟Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.210056623:11(1646-1657)Online publication date: 1-Jun-2022
  • (2022)Mlog: Multi-log Write Buffer upon Ultra-fast SSD RAIDProceedings of the 51st International Conference on Parallel Processing10.1145/3545008.3545034(1-11)Online publication date: 29-Aug-2022
  • (2022)A modeling approach for estimating performance and energy consumption of storage systemsJournal of Computer and System Sciences10.1016/j.jcss.2022.04.001128:C(86-106)Online publication date: 1-Sep-2022
  • (2020)GC-Steering: GC-aware Request Steering and Parallel Reconstruction Optimizations for SSD-based RAIDsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2020.2974346(1-1)Online publication date: 2020
  • (2019)RFPLProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337887(1-10)Online publication date: 5-Aug-2019
  • (2019)Interpretable Partitioned Embedding for Intelligent Multi-item Fashion Outfit CompositionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/332633215:2s(1-20)Online publication date: 29-Jul-2019
  • (2019)DCR: Deterministic Crash Recovery for NAND Flash Storage SystemsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2018.287817938:12(2201-2214)Online publication date: 19-Nov-2019
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media