Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1995896.1995902acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
research-article

Hystor: making the best use of solid state drives in high performance storage systems

Published: 31 May 2011 Publication History

Abstract

With the fast technical improvement, flash memory based Solid State Drives (SSDs) are becoming an important part of the computer storage hierarchy to significantly improve performance and energy efficiency. However, due to its relatively high price and low capacity, a major system research issue to address is on how to make SSDs play their most effective roles in a high-performance storage system in cost- and performance-effective ways.
In this paper, we will answer several related questions with insights based on the design and implementation of a high performance hybrid storage system, called Hystor. We make the best use of SSDs in storage systems by achieving a set of optimization objectives from both system deployment and algorithm design perspectives. Hystor manages both SSDs and hard disk drives (HDDs) as one single block device with minimal changes to existing OS kernels. By monitoring I/O access patterns at runtime, Hystor can effectively identify blocks that (1) can result in long latencies or (2) are semantically critical (e.g. file system metadata), and stores them in SSDs for future accesses to achieve a significant performance improvement. In order to further leverage the exceptionally high performance of writes in the state-of-the-art SSDs, Hystor also serves as a write-back buffer to speed up write requests. Our measurements on Hystor implemented in the Linux kernel 2.6.25.8 show that it can take advantage of the performance merits of SSDs with only a few lines of changes to the stock Linux kernel. Our system study shows that in a highly effective hybrid storage system, SSDs should play a major role as an independent storage where the best suitable data are adaptively and timely migrated in and retained, and it can also be effective to serve as a write-back buffer.

References

[1]
N. Agrawal, V. Prabhakaran, T. Wobber, J. D. Davis, M. Manasse, and R. Panigrahy. Design tradeoffs for SSD performance. In Proceedings of USENIX'08, Boston, MA, June 2008.
[2]
Blktrace. http://linux.die.net/man/8/blktrace.
[3]
A. M. Caulfield, L. M. Grupp, and S. Swanson. Gordon: using flash memory to build fast, power-efficient clusters for data-intensive applications. In Proceedings of ASPLOS'09, Washington, D.C., March 2009.
[4]
F. Chen, S. Jiang, and X. Zhang. SmartSaver: Turning flash drive into a disk energy saver for mobile computers. In Proceedings of ISLPED'06, Tegernsee, Germany, Oct. 2006.
[5]
F. Chen, D. A. Koufaty, and X. Zhang. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. In Proceedings of SIGMETRICS/Performance'09, Seattle, WA, June 2009.
[6]
F. Chen, R. Lee, and X. Zhang. Essential roles of exploiting internal parallelism of flash memory based solid state drives in high-speed data processing. In Proceedings of HPCA'11, San Antonio, Texas, Feb 12-16 2011.
[7]
F. Chen, T. Luo, and X. Zhang. CAFTL: A content-aware flash translation layer enhancing the lifespan of flash memory based solid state drives. In Proceedings of FAST'11, San Jose, CA, Feb 15-17 2011.
[8]
C. Dirik and B. Jacob. The performance of PC solid-state disks (SSDs) as a function of bandwidth, concurrency, device, architecture, and system organization. In Proceedings of ISCA'09, Austin, TX, June 2009.
[9]
E. Gal and S. Toledo. Algorithms and data structures for flash memories. In ACM Computing Survey'05, volume 37(2), pages 138--163, 2005.
[10]
A. Gupta, Y. Kim, and B. Urgaonkar. DFTL: A flash translation layer employing demand-based selective caching of page-level address mappings. In Proceedings of ASPLOS'09, Washington, D.C., March 2009.
[11]
J. Handy. Flash memory vs. hard disk drives - which will win? http://www.storagesearch.com/semico-art1.html.
[12]
L. Huang and T. Chieuh. Experiences in building a software-based SATF scheduler. In Tech. Rep. ECSL-TR81, 2001.
[13]
Intel. Intel X25-E extreme SATA solid-state drive. http://www.intel.com/design/flash/nand/extreme, 2008.
[14]
S. Jiang, F. Chen, and X. Zhang. CLOCK-Pro: An Effective Improvement of the CLOCK Replacement. In Proceedings of USENIX'05, Anaheim, CA, April 2005.
[15]
S. Jiang, X. Ding, F. Chen, E. Tan, and X. Zhang. DULO: An effective buffer cache management scheme to exploit both temporal and spatial localities. In Proceedings of FAST'05, San Francisco, CA, December 2005.
[16]
T. Kgil, D. Roberts, and T. Mudge. Improving NAND flash based disk caches. In Proceedings of ISCA'08, Beijing, China, June 2008.
[17]
I. Koltsidas and S. D. Viglas. Flashing up the storage layer. In Proceedings of VLDB'08, Auckland, New Zealand, August 2008.
[18]
A. Leventhal. Flash storage memory. In Communications of the ACM, volume 51(7), pages 47--51, July 2008.
[19]
B. Marsh, F. Douglis, and P. Krishnan. Flash memory file caching for mobile computers. In Proceedings of the 27th Hawaii Conference on Systems Science, Wailea, HI, Jan 1994.
[20]
J. Matthews, S. Trika, D. Hensgen, R. Coulson, and K. Grimsrud. Intel Turbo Memory: Nonvolatile disk caches in the storage hierarchy of mainstream computer systems. In ACM Transactions on Storage, volume 4, May 2008.
[21]
M. P. Mesnier. Intel open storage toolkit. http://www.sourceforge.org/projects/intel-iscsi.
[22]
M. P. Mesnier and J. B. Akers. Differentiated storage services. SIGOPS Oper. Syst. Rev., 45:45--53, February 2011.
[23]
Microsoft. Microsoft Windows Readyboost. http://www.microsoft.com/windows/windows-vista/features/readyboost.aspx, 2008.
[24]
A. Patrizio. UCSD plans first flash-based supercomputer. http://www.internetnews.com/hardware/article.php/3847456, November 2009.
[25]
D. A. Patterson, G. Gibson, and R. H. Katz. A case for redundant arrays of inexpensive disks (RAID). In Proceedings of SIGMOD'88, Chicago, IL, June 1988.
[26]
H. Payer, M. A. Sanvido, Z. Z. Bandic, and C. M. Kirsch. Combo Drive: Optimizing cost and performance in a heterogeneous storage device. In Proceedings of the 1st Workshop on integrating solid-state memory into the storage hierarchy (WISH'09), 2009.
[27]
F. I. Popovici, A. C. Arpaci-Dusseau, and R. H. Arpaci-Dusseau. Robust, portable I/O scheduling with the disk mimic. In Proceedings of USENIX'03, San Antonio, TX, June 2003.
[28]
Postmark. A new file system benchmark. http://www.netapp.com/tech_library/3022.html, 1997.
[29]
T. Pritchett and M. Thottethodi. SieveStore: A highly-selective, ensemble-level disk cache for cost-performance. In Proceedings of ISCA'10, Saint-Malo, France, June 2010.
[30]
J. Ren and Q. Yang. I-CASH: Intelligently coupled array of ssd and hdd. In Proceedings of HPCA'11, San Antonio, Texas, Feb 2011.
[31]
S. Shah and B. D. Noble. A study of e-mail patterns. In Software Practice and Experience, volume 37(14), 2007.
[32]
G. Soundararajan, V. Prabhakaran, M. Balakrishnan, and T. Wobber. Extending SSD lifetimes with disk-based write caches. In Proceedings of FAST'10, San Jose, CA, February 2010.
[33]
Transaction Processing Performance Council. TPC Benchmark H. http://www.tpc. org/tpch, 2008.
[34]
A. A. Wang, P. Reiher, G. J. Popek, and G. H. Kuenning. Conquest: Better performance through a Disk/Persistent-RAM hybrid file system. In Proceedings of the USENIX'02, Monterey, CA, June 2002.
[35]
J. Wilkes, R. Golding, C. Staelin, and T. Sullivan. The HP AutoRAID hierarchical storage system. In ACM Tran. on Computer Systems, volume 14, pages 108--136, Feb 1996.

Cited By

View all
  • (2025)CDA-GC: An effective cache data allocation for garbage collection in flash-based solid-state drivesIntegration10.1016/j.vlsi.2025.102359102(102359)Online publication date: May-2025
  • (2024)NOMADProceedings of the 18th USENIX Conference on Operating Systems Design and Implementation10.5555/3691938.3691940(19-35)Online publication date: 10-Jul-2024
  • (2024)A Detailed Analysis of Issues with Solid-State DevicesRobotics and Automation in Industry 4.010.2174/9789815223491124010013(168-188)Online publication date: 13-Oct-2024
  • Show More Cited By

Index Terms

  1. Hystor: making the best use of solid state drives in high performance storage systems

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICS '11: Proceedings of the international conference on Supercomputing
May 2011
398 pages
ISBN:9781450301022
DOI:10.1145/1995896
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 May 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. hard disk drive
  2. hybrid storage system
  3. solid state drive

Qualifiers

  • Research-article

Conference

ICS '11
Sponsor:
ICS '11: International Conference on Supercomputing
May 31 - June 4, 2011
Arizona, Tucson, USA

Acceptance Rates

Overall Acceptance Rate 629 of 2,180 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)51
  • Downloads (Last 6 weeks)5
Reflects downloads up to 06 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)CDA-GC: An effective cache data allocation for garbage collection in flash-based solid-state drivesIntegration10.1016/j.vlsi.2025.102359102(102359)Online publication date: May-2025
  • (2024)NOMADProceedings of the 18th USENIX Conference on Operating Systems Design and Implementation10.5555/3691938.3691940(19-35)Online publication date: 10-Jul-2024
  • (2024)A Detailed Analysis of Issues with Solid-State DevicesRobotics and Automation in Industry 4.010.2174/9789815223491124010013(168-188)Online publication date: 13-Oct-2024
  • (2024)Fast Online Reconstruction for SSD-Based RAID-5 Storage SystemsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2023.334803343:6(1886-1899)Online publication date: Jun-2024
  • (2024)HA-CSD: Host and SSD Coordinated Compression for Capacity and Performance2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00078(825-838)Online publication date: 27-May-2024
  • (2023)Hybrid Block Storage for Efficient Cloud Volume ServiceACM Transactions on Storage10.1145/359644619:4(1-25)Online publication date: 3-Oct-2023
  • (2023)Venice: Improving Solid-State Drive Parallelism at Low Cost via Conflict-Free AccessesProceedings of the 50th Annual International Symposium on Computer Architecture10.1145/3579371.3589071(1-16)Online publication date: 17-Jun-2023
  • (2023)HyF2FS: A Filesystem to Fully Exploit the Parallelism of Hybrid Storage2023 IEEE 41st International Conference on Computer Design (ICCD)10.1109/ICCD58817.2023.00049(267-274)Online publication date: 6-Nov-2023
  • (2023)Cache eviction for SSD-HDD hybrid storage based on sequential packingJournal of Systems Architecture: the EUROMICRO Journal10.1016/j.sysarc.2023.102930141:COnline publication date: 1-Aug-2023
  • (2022)Building a Fast and Efficient LSM-tree Store by Integrating Local Storage with Cloud StorageACM Transactions on Architecture and Code Optimization10.1145/352745219:3(1-26)Online publication date: 25-May-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media