Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

LDM: Log Disk Mirroring with Improved Performance and Reliability for SSD-Based Disk Arrays

Published: 20 May 2016 Publication History
  • Get Citation Alerts
  • Abstract

    With the explosive growth in data volume, the I/O bottleneck has become an increasingly daunting challenge for big data analytics. Economic forces, driven by the desire to introduce flash-based Solid-State Drives (SSDs) into the high-end storage market, have resulted in hybrid storage systems in the cloud. However, a single flash-based SSD cannot satisfy the performance, reliability, and capacity requirements of enterprise or HPC storage systems in the cloud. While an array of SSDs organized in a RAID structure, such as RAID5, provides the potential for high storage capacity and bandwidth, reliability and performance problems will likely result from the parity update operations. In this article, we propose a Log Disk Mirroring scheme (LDM) to improve the performance and reliability of SSD-based disk arrays. LDM is a hybrid disk array architecture that consists of several SSDs and two hard disk drives (HDDs). In an LDM array, the two HDDs are mirrored as a write buffer that temporally absorbs the small write requests. The small and random write data are written on the mirroring buffer by using the logging technique that sequentially appends new data. The small write data are merged and destaged to the SSD-based disk array during the system idle periods. Our prototype implementation of the LDM array and the performance evaluations show that the LDM array significantly outperforms the pure SSD-based disk arrays by a factor of 20.4 on average, and outperforms HPDA by a factor of 5.0 on average. The reliability analysis shows that the MTTDL of the LDM array is 2.7 times and 1.7 times better than that of pure SSD-based disk arrays and HPDA disk arrays.

    References

    [1]
    N. Agrawal, V. Prabhakaran, T. Wobber, J. Davis, M. Manasse, and R. Panigrahy. 2008. Design tradeoffs for SSD performance. In Proceedings of the 2008 USENIX Annual Technical Conference (USENIX’08). 57--70.
    [2]
    M. Balakrishnan, A. Kadav, V. Prabhakaran, and D. Malkhi. 2010a. Differential RAID: Rethinking RAID for SSD reliability. In Proceedings of the 5th European Conference on Computer systems (EuroSys’10). 15--26.
    [3]
    M. Balakrishnan, A. Kadav, V. Prabhakaran, and D. Malkhi. 2010b. Differential RAID: Rethinking RAID for SSD reliability. ACM Transactions on Storage 6, 2 (2010), 1--22.
    [4]
    A. M. Caulfield, J. Coburn, T. Mollov, A. De, A. Akel, J. He, A. Jagatheesan, R. K. Gupta, A. Snavely, and S. Swanson. 2010. Understanding the impact of emerging non-volatile memories on high-performance, IO-intensive computing. In Proceedings of the 2010 International Conference for High Performance Computing, Networking, Storage and Analysis (SC’10). 1--11.
    [5]
    J. C. W. Chan, Q. Ding, P. P. C. Lee, and H. H. W. Chan. 2014. Parity logging with reserved space: Towards efficient updates and recovery in erasure-coded clustered storage. In Proceedings of the 12th USENIX Conference on File and Storage Technologies (FAST’14). 163--176.
    [6]
    F. Chen, D. A. Koufaty, and X. Zhang. 2009. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. In Proceedings of the 11th ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’09). 181--192.
    [7]
    C. Chung and H. Hsu. 2014. Partial parity cache and data cache management method to improve the performance of an SSD-based RAID. IEEE Transactions on Very Large Scale Integration Systems 22, 7 (2014), 1470--1480.
    [8]
    C. Dirik and B. Jacob. 2009. The performance of PC solid-state disks as a function of bandwidth, concurrency, device architecture, and system organization. In Proceedings of the 36th International Symposium on Computer Architecture (ISCA’09). 279--289.
    [9]
    SanDisk Solid State Driver. 2015. https://itblog.sandisk.com/truth-ssds-hdd-vendors-do-not-want-you-to-know/.
    [10]
    Y. Du, F. Liu, Z. Chen, and X. Ma. 2011. WeLe-RAID: A SSD-based RAID for system endurance and performance. In Proceedings of the 8th IFIP International Conference on Network and Parallel Computing (NPC’11). 248--262.
    [11]
    EMC Symmetrix DMX Architecture. 2010. Retrieved from https://www.emc.com/collateral/hardware/solution-overview/c1011-symm-dmx-architecture-prod-desc-gd.pdf.
    [12]
    R. Golding, P. Bosch, and C. Staelin. 1995. Idleness is not sloth. In Proceedings of the USENIX Technical Conference (USENIX’95). 201--212.
    [13]
    K. Greenan, D. D. E. Long, E. L. Miller, T. Schwarz, and A. Wildani. 2009. Building flexible, fault-tolerant flash-based storage systems. In Proceedings of the 5th Workshop on Hot Topics in System Dependability (HotDep’09).
    [14]
    L. M. Grupp, J. D. Davis, and S. Swanson. 2012. The bleak future of NAND flash memory. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST’12).
    [15]
    S. Im and D. Shin. 2011. Flash-aware RAID techniques for dependable and high-performance flash memory SSD. IEEE Transactions on Computers 1, 60 (2011), 80--92.
    [16]
    N. Jeremic, G. Mühl, A. Busse, and J. Richling. 2011. The pitfalls of deploying solid-state drive RAIDs. In Proceedings of the 4th Annual International Conference on Systems and Storage (SYSTOR’11). 1--13.
    [17]
    H. Kao, J. Paris, D. D. E. Long, and T. Schwarz. 2013. A flexible simulation tool for estimating data loss risks in storage arrays. In Proceedings of the 29th IEEE Symposium on Massive Storage Systems and Technologies (MSST’13). 1--5.
    [18]
    Y. Kim, A. Gupta, B. Urgaonkar, P. Berman, and A. Sivasubramaniam. 2011a. HybridStore: A cost-efficient, high-performance storage system combining SSDs and HDDs. In Proceedings of the 19th Annual IEEE/ACM International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS’11). 227--236.
    [19]
    Y. Kim, S. Oral, G. M. Shipman, J. Lee, D. A. Dillow, and F. Wang. 2011b. Harmonia: A globally coordinated garbage collector for arrays of solid-state drives. In Proceedings of the 27th IEEE Symposium on Mass Storage Systems and Technologies (MSST’11). 1--12.
    [20]
    L. Lin, Y. Zhu, J. Yue, Z. Cai, and B. Segee. 2011. Hot random off-loading: A hybrid storage system with dynamic data migration. In Proceedings of the 19th Annual Meeting of the IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOSTS’11). 318--325.
    [21]
    Logical Volume Manager (Linux). 2015. Retrieved from https://en.wikipedia.org/wiki/Logical_Volume_Manager_ (Linux).
    [22]
    B. Mao, H. Jiang, D. Feng, S. Wu, J. Chen, L. Zeng, and L. Tian. 2010. HPDA: A hybrid parity-based disk array for enhanced performance and reliability. In Proceedings of 24th International Parallel & Distributed Processing Symposium (IPDPS’’10). 1--12.
    [23]
    B. Mao, H. Jiang, S. Wu, Y. Fu, and L. Tian. 2014. Read performance optimization for deduplication-based storage systems in the cloud. ACM Transactions on Storage 10, 2 (2014), 1--22.
    [24]
    B. Mao, H. Jiang, S. Wu, L. Tian, D. Feng, J. Chen, and L. Zeng. 2012. HPDA: A hybrid parity-based disk array for enhanced performance and reliability. ACM Transactions on Storage 8, 1 (2012), Article 4.
    [25]
    B. Mao and S. Wu. 2015. Exploiting request characteristics and internal parallelism to improve SSD performance. In Proceedings of the 33rd IEEE International Conference on Computer Design (ICCD’15). 476--479.
    [26]
    B. Mao, S. Wu, and H. Jiang. 2015. Improving storage availability in cloud-of-clouds with hybrid redundant data distribution. In Proceedings of the 29th IEEE International Parallel & Distributed Processing Symposium (IPDPS’’15). 633--642.
    [27]
    J. Meza, Q. Wu, S. Kumar, and O. Mutlu. 2015. A large-scale study of flash memory failures in the field. In Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’15). 177--190.
    [28]
    Microsoft Enterprise Traces. 2009. Retrieved from http://iotta.snia.org/traces/list/BlockIO.
    [29]
    C. Min, K. Kim, H. Cho, S. Lee, and Y. Eom. 2012. SFS: Random write considered harmful in solid state drives. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST’12).
    [30]
    S. Moon and A. L. Narasimha Reddy. 2013. Don’t Let RAID raid the lifetime of your SSD array. In Proceedings of the 5th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’13).
    [31]
    D. Narayanan, E. Thereska, A. Donnelly, S. Elnikety, and A. Rowstron. 2009. Migrating server storage to SSDs: Analysis of tradeoffs. In Proceedings of the 4th European Conference on Computer Systems (EuroSys’09). 145--158.
    [32]
    D. A. Patterson, G. Gibson, and R. H. Katz. 1988. A case for redundant arrays of inexpensive disks (RAID). In Proceedings of the International Conference on Management of Data (SIGMOD’88). 109--116.
    [33]
    M. Rosenblum and J. Ousterhout. 1992. The design and implementation of a log-structured file system. ACM Transactions on Computer Systems 10, 1 (1992), 26--52.
    [34]
    Samsung Report. 2008. Retrieved from http://news.cnet.com/8301-13924_3-9876557-64.html.
    [35]
    B. Schroeder and G. A. Gibson. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST’07). 1--16.
    [36]
    Solid State Storage Initiative. 2010. Retreived from http://www.snia.org/forums/sssi.
    [37]
    G. Soundararajan, V. Prabhakaran, M. Balakrishnan, and T. Wobber. 2010. Extending SSD lifetimes with disk-based write caches. In Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST’10). 101--114.
    [38]
    UMass Trace Repository. 2010. Retrieved from http://traces.cs.umass.edu/index.php/Storage/Storage.
    [39]
    Virtualizing Storage for Scale, Resiliency, and Efficiency. 2012. Retrieved from http://blogs.msdn.com/b/b8/archive/2012/01/05/virtualizing-storage-for-scale-resiliency-and-efficiency.aspx.
    [40]
    Y. Wang, W. Wang, T. Xie, W. Pan, Y. Gao, and Y. Ouyang. 2014. CR5M: A mirroring-powered channel-RAID5 architecture for an SSD. In Proceedings of 30th Symposium on Mass Storage Systems and Technologies (MSST’14). 1--10.
    [41]
    S. Wu, H. Jiang, D. Feng, L. Tian, and B. Mao. 2009. WorkOut: I/O workload outsourcing for boosting the raid reconstruction performance. In Proceedings of the 7th USENIX Conference on File and Storage Technologies (FAST’09). 239--252.
    [42]
    S. Wu, H. Jiang, D. Feng, L. Tian, and B. Mao. 2011. Improving availability of RAID-structured storage systems by workload outsourcing. IEEE Transactions on Computers 60, 1 (2011), 64--79.
    [43]
    S. Wu, H. Jiang, and B. Mao. 2012. IDO: Intelligent data outsourcing with improved raid reconstruction performance in large-scale data centers. In Proceedings of the 26th USENIX Large Installation System Administration (LISA’12). 17--32.
    [44]
    S. Wu, H. Jiang, and B. Mao. 2015. Proactive data migration for improved storage availability in large-scale data centers. IEEE Transactions on Computers 64, 9 (2015), 2637--2651.
    [45]
    S. Wu, W. Yang, B. Mao, and Y. Lin. 2015. MC-RAIS: Multi-chunk redundant array of independent SSDs with improved performance. In Proceedings of the 15th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP’15). 18--32.
    [46]
    T. Xie and Y. Sun. 2010. Dynamic data reallocation in hybrid disk arrays. IEEE Transactions on Parallel and Distributed Systems 21, 9 (2010), 1330--1341.
    [47]
    Q. Yang and J. Ren. 2011. I-CASH: Intelligently coupled array of SSD and HDD. In Proceedings of the 17th International Symposium on High Performance Computer Architecture (HPCA’11). 278--289.
    [48]
    L. Yi, J. Shu, J. Ou, and W. Zheng. 2013. CG-Resync: Conversion-guided resynchronization for a SSD-based RAID array. In Proceedings of the 31st International Conference on Computer Design (ICCD’13). 455--458.
    [49]
    L. Zeng, D. Feng, J. Chen, Q. Wei, B. Veeravalli, and W. Liu. 2012. HRAID6ML: A hybrid RAID6 storage architecture with mirrored logging. In Proceedings of the 28th IEEE Conference on Massive Data Storage (MSST’12). 1--6.
    [50]
    Y. Zhang, A. C. Arpaci-Dusseau, and R. H. Arpaci-Dusseau. 2013. Warped mirrors for flash. In Proceedings of the 29th IEEE Symposium on Massive Storage Systems and Technologies (MSST’13). 1--12.

    Cited By

    View all
    • (2024)Explorations and Exploitation for Parity-based RAIDs with Ultra-fast SSDsACM Transactions on Storage10.1145/362799220:1(1-32)Online publication date: 30-Jan-2024
    • (2024)Privacy-Enhanced Cooperative Storage Scheme for Contact-Free Sensory Data in AIoT with Efficient SynchronizationACM Transactions on Sensor Networks10.1145/361799820:4(1-19)Online publication date: 11-May-2024
    • (2022)Short Tail: taming tail latency for erasure-code-based in-memory systemsShortTail:降低纠删码内存存储系统的尾部延迟Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.210056623:11(1646-1657)Online publication date: 1-Jun-2022
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Storage
    ACM Transactions on Storage  Volume 12, Issue 4
    August 2016
    213 pages
    ISSN:1553-3077
    EISSN:1553-3093
    DOI:10.1145/2940403
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 May 2016
    Accepted: 01 February 2016
    Revised: 01 November 2015
    Received: 01 October 2014
    Published in TOS Volume 12, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. SSD-based disk arrays
    2. disk buffer
    3. log technique
    4. performance evaluation
    5. reliability analysis

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • U.S. NSF
    • National Natural Science Foundation of China
    • Fundamental Research Funds for the Central Universities
    • Scientific Research Foundation for the Returned Overseas Chinese Scholars, State Education Ministry

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)19
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 09 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Explorations and Exploitation for Parity-based RAIDs with Ultra-fast SSDsACM Transactions on Storage10.1145/362799220:1(1-32)Online publication date: 30-Jan-2024
    • (2024)Privacy-Enhanced Cooperative Storage Scheme for Contact-Free Sensory Data in AIoT with Efficient SynchronizationACM Transactions on Sensor Networks10.1145/361799820:4(1-19)Online publication date: 11-May-2024
    • (2022)Short Tail: taming tail latency for erasure-code-based in-memory systemsShortTail:降低纠删码内存存储系统的尾部延迟Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.210056623:11(1646-1657)Online publication date: 1-Jun-2022
    • (2022)Mlog: Multi-log Write Buffer upon Ultra-fast SSD RAIDProceedings of the 51st International Conference on Parallel Processing10.1145/3545008.3545034(1-11)Online publication date: 29-Aug-2022
    • (2022)A modeling approach for estimating performance and energy consumption of storage systemsJournal of Computer and System Sciences10.1016/j.jcss.2022.04.001128:C(86-106)Online publication date: 1-Sep-2022
    • (2020)GC-Steering: GC-aware Request Steering and Parallel Reconstruction Optimizations for SSD-based RAIDsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2020.2974346(1-1)Online publication date: 2020
    • (2019)RFPLProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337887(1-10)Online publication date: 5-Aug-2019
    • (2019)Interpretable Partitioned Embedding for Intelligent Multi-item Fashion Outfit CompositionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/332633215:2s(1-20)Online publication date: 29-Jul-2019
    • (2019)DCR: Deterministic Crash Recovery for NAND Flash Storage SystemsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2018.287817938:12(2201-2214)Online publication date: 19-Nov-2019
    • (2019)FvRS: Efficiently identifying performance-critical data for improving performance of big data processingFuture Generation Computer Systems10.1016/j.future.2018.09.00391(157-166)Online publication date: Feb-2019
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media