Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3132847.3133120acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Machine Learning based Performance Modeling of Flash SSDs

Published: 06 November 2017 Publication History

Abstract

Flash memory based solid state drives(SSDs) have alleviated the I/O bottleneck by exploiting its data parallel design. In an enterprise environment, Flash SSD used in the form of a hybrid storage architecture to achieve the better performance with lower cost. In this architecture, I/O load balancing is one of the important factors. However, the internal parallelism distorts the performance measures of the flash SSDs. Despite the criticality of load balancing on I/O intensive environments, these studies have rarely been addressed. In this paper, we examine the effectiveness of applying classification method using machine learning techniques to the I/O saturation estimation by using Linux kernel I/O statistics instead of the utilization measure that is currently used for HDDs. We conclude that machine learning techniques that we employed (Support Vector Machine and LASSO Generalized Linear Model) performs well compared to the existing utilization measure even we cannot collect the internal information of the flash SSDs.

References

[1]
CHEN, CL Philip; ZHANG, Chun-Yang. Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. Information Sciences, 2014, 275: 314--347.
[2]
Katz, R. H., et al. Disk system architectures for high performance computing. University of California, Berkeley, Computer Science Division, 1989.
[3]
Baccelli, F. and Foss, S. On the saturation rule for the stability of queues. Journal of Applied Probability(1995), 494--507.
[4]
Agrawal, N., et al. Design Tradeoffs for SSD Performance. In USENIX Annual Technical Conference (USENIX '08) (San Diego, CA, June 14--19, 2008). 57--70.
[5]
HAAS, X. Y. H. R.; HU, X. The fundamental limit of flash random write performance: Understanding, analysis and performance modelling. IBM Research Report, 2010/3/31, Tech. Rep, 2010.
[6]
Iostat. http://man7.org/linux/man-pages/man1/iostat.1.html
[7]
V. Vapnik. Statistical Learning Theory. Wiley, NY, 1998.
[8]
Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B. 1996;58:267--288.
[9]
KOHAVI, Ron, et al. A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Ijcai. 1995. p. 1137--1145.
[10]
F. Chen, R. Lee, and X. Zhang. Essential roles of exploiting internal parallelism of flash memory based solid state drives in high-speed data processing. In HPCA, pages 266--277, 2011.
[11]
fio.https://github.com/axboe/fio

Cited By

View all
  • (2023)Enabling Multi-tenancy on SSDs with Accurate IO Interference ModelingProceedings of the 2023 ACM Symposium on Cloud Computing10.1145/3620678.3624657(216-232)Online publication date: 30-Oct-2023
  • (2020)Fast Performance Estimation and Design Space Exploration of SSD Using AI TechniquesEmbedded Computer Systems: Architectures, Modeling, and Simulation10.1007/978-3-030-60939-9_1(1-17)Online publication date: 5-Jul-2020

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
November 2017
2604 pages
ISBN:9781450349185
DOI:10.1145/3132847
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 November 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Flash SSD
  2. Load balancing
  3. Machine Learning

Qualifiers

  • Short-paper

Funding Sources

  • Institute for Information & communications Technology Promotion

Conference

CIKM '17
Sponsor:

Acceptance Rates

CIKM '17 Paper Acceptance Rate 171 of 855 submissions, 20%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Enabling Multi-tenancy on SSDs with Accurate IO Interference ModelingProceedings of the 2023 ACM Symposium on Cloud Computing10.1145/3620678.3624657(216-232)Online publication date: 30-Oct-2023
  • (2020)Fast Performance Estimation and Design Space Exploration of SSD Using AI TechniquesEmbedded Computer Systems: Architectures, Modeling, and Simulation10.1007/978-3-030-60939-9_1(1-17)Online publication date: 5-Jul-2020

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media