Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3064176.3064207acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article

ROS: A Rack-based Optical Storage System with Inline Accessibility for Long-Term Data Preservation

Published: 23 April 2017 Publication History
  • Get Citation Alerts
  • Abstract

    The combination of the explosive growth in digital data and the need to preserve much of this data in the long term has made it an imperative to find a more cost-effective way than HDD arrays and more easily accessible way than tape libraries to store massive amounts of data. While modern optical discs are capable of guaranteeing more than 50-year data preservation without migration, individual optical disks' lack of the performance and capacity relative to HDDs or tapes has significantly limited their use in datacenters. This paper presents a Rack-scale Optical disc library System, or ROS in short, that provides a PB-level total capacity and inline accessibility on thousands of optical discs built within a 42U Rack. A rotatable roller and robotic arm separating and fetching the discs are designed to improve disc placement density and simplify the mechanical structure. A hierarchical storage system based on SSD, hard disks and optical discs are presented to hide the delay of mechanical operation. On the other hand, an optical library file system is proposed to schedule mechanical operation and organize data on the tiered storage with a POSIX user interface to provide an illusion of inline data accessibility. We evaluate ROS on a few key performance metrics including operation delays of the mechanical structure and software overhead in a prototype PB-level ROS system. The results show that ROS stacked on Samba and FUSE can provide almost 323MB/s read and 236MB/s write throughput, about 53ms file write and 15ms read latency via 10GbE network for external users, exhibiting its inline accessibility. Besides, ROS is able to effectively hide and virtualize internal complex operational behaviors and be easily deployable in datacenters.

    References

    [1]
    O. S. T. Association. Universal disk format specification. www.osta.org/specs/pdf/udf250.pdf, 2003.
    [2]
    S. Balakrishnan, R. Black, A. Donnelly, P. England, A. Glass, D. Harper, S. Legtchenko, A. Ogus, E. Peterson, and A. Rowstron. Pelican: A building block for exascale cold data storage. In 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14), pages 351--365, 2014.
    [3]
    S. Boyd, A. Horvath, and D. Dornfeld. Life-cycle assessment of nand flash memory. Semiconductor Manufacturing, IEEE Transactions on, 24(1):117--124, 2011.
    [4]
    P. Corp. Data archiver lb-dh8 series. Technical report, 2016.
    [5]
    D. Crockford. Javascript object notation. http://www.json.org/, 2016.
    [6]
    G. Deepika. Holographic versatile disc. In Innovations in Emerging Technology (NCOIET), 2011 National Conference on, pages 145--146. IEEE, 2011.
    [7]
    G. R. Ganger and M. F. Kaashoek. Embedded inodes and explicit grouping: Exploiting disk bandwidth for small files. In USENIX Annual Technical Conference, pages 1--17, 1997.
    [8]
    V. T. George Amvrosiadis. filebench. https://github.com/filebench/filebench/wiki, 2016.
    [9]
    B. Godard, J. Schmidtke, J.-J. Cassiman, and S. Aymé. Data storage and dna banking for biomedical research: informed consent, confidentiality, quality issues, ownership, return of benefits. a professional perspective. European Journal of Human Genetics, 11:S88--S122, 2003.
    [10]
    M. Grawinkel, L. Nagel, M. Mäsker, F. Padua, A. Brinkmann, and L. Sorth. Analysis of the ecmwf storage landscape. In 13th USENIX Conference on File and Storage Technologies (FAST 15), pages 15--27, 2015.
    [11]
    M. Gu, X. Li, and Y. Cao. Optical storage arrays: a perspective for future big data storage. Light: Science & Applications,3 (5):e177, 2014.
    [12]
    P. Gupta, A. Wildani, E. L. Miller, D. S. H. Rosenthal, and D. D. E. Long. Effects of prolonged media usage and long-term planning on archival systems. In International Conference on Massive Storage Systems and Technologies, 2016.
    [13]
    C. R. Hertel. Implementing CIFS: The Common Internet File System. Prentice Hall Professional, 2004.
    [14]
    N. Kishore and S. Sharma. Secured data migration from enterprise to cloud storage-analytical survey. BVICAM's International Journal of Information Technology, 8(1), 2016.
    [15]
    S. Kumar and T. R. McCaffrey. Engineering economics at a hard disk drive manufacturer. Technovation, 23(9):749--755, 2003.
    [16]
    R. Miller. Inside facebook's blu-ray cold storage data center. http://datacenterfrontier.com/inside-facebooks-blu-ray-cold-storage-data-center/, 2014.
    [17]
    H. Minemura, K. Watanabe, K. Adachi, and R. Tamura. High-speed write/read techniques for blu-ray write-once discs. Japanese journal of applied physics, 45(2S):1213, 2006.
    [18]
    B. Nikoobakht and M. A. El-Sayed. Preparation and growth mechanism of gold nanorods (nrs) using seed-mediated growth method. Chemistry of Materials, 15(10):1957--1962, 2003.
    [19]
    Y. Okazaki, K. Hara, T. Kawashima, A. Sato, and T. Hirano. Estimating the archival life of metal particulate tape. Magnetics, IEEE Transactions on, 28(5):2365--2367, 1992.
    [20]
    D. Pease, A. Amir, L. V. Real, B. Biskeborn, M. Richmond, and A. Abe. The linear tape file system. In Mass Storage Systems and Technologies (MSST), 2010 IEEE 26th Symposium on, pages 1--8. IEEE, 2010.
    [21]
    A. Rajgarhia and A. Gehani. Performance and extension of user space file systems. In Proceedings of the 2010 ACM Symposium on Applied Computing, pages 206--213. ACM, 2010.
    [22]
    P. Rattan et al. Disaster management and electronic storage media: An overview. International Journal of Information Dissemination and Technology, 2(1):1, 2012.
    [23]
    A. Rosenthal, P. Mork, M. H. Li, J. Stanford, D. Koester, and P. Reynolds. Cloud computing: a new business paradigm for biomedical information sharing. Journal of biomedical informatics, 43(2):342--353, 2010.
    [24]
    Sony. Sony everspan. Technical report, 2016.
    [25]
    C. Thompson. Optical disc system for long term archiving of multi-media content. In Systems, Signals and Image Processing (IWSSIP), 2014 International Conference on, pages 11--14. IEEE, 2014.
    [26]
    W. Wamsteker, I. Skillen, J. Ponz, A. De La Fuente, M. Barylak, and I. Yurrita. Ines: astronomy data distribution for the future. Astrophysics and Space Science, 273(1-4):155--161, 2000.
    [27]
    A. Watanabe. Optical library system for long-term preservation with extended error correction coding. In 29th IEEE conference on massive data storage, 2013.
    [28]
    N. Yamamoto, O. Tatebe, and S. Sekiguchi. Parallel and distributed astronomical data analysis on grid datafarm. In Grid Computing, 2004. Proceedings. Fifth IEEE/ACM International Workshop on, pages 461--166. IEEE, 2004.
    [29]
    P. Zijlstra, J. W. Chon, and M. Gu. Five-dimensional optical recording mediated by surface plasmons in gold nanorods. Nature, 459(7245):410--413, 2009.

    Cited By

    View all
    • (2023)Project Silica: Towards Sustainable Cloud Archival Storage in GlassProceedings of the 29th Symposium on Operating Systems Principles10.1145/3600006.3613208(166-181)Online publication date: 23-Oct-2023
    • (2020)Batch-file Operations to Optimize Massive Files AccessingACM Transactions on Storage10.1145/339428616:3(1-25)Online publication date: 16-Jul-2020
    • (2019)Cold Storage Data ArchivesProceedings of the 15th International Workshop on Data Management on New Hardware10.1145/3329785.3329921(1-7)Online publication date: 1-Jul-2019
    • Show More Cited By
    1. ROS: A Rack-based Optical Storage System with Inline Accessibility for Long-Term Data Preservation

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      EuroSys '17: Proceedings of the Twelfth European Conference on Computer Systems
      April 2017
      648 pages
      ISBN:9781450349383
      DOI:10.1145/3064176
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 23 April 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Conference

      EuroSys '17
      Sponsor:
      EuroSys '17: Twelfth EuroSys Conference 2017
      April 23 - 26, 2017
      Belgrade, Serbia

      Acceptance Rates

      Overall Acceptance Rate 241 of 1,308 submissions, 18%

      Upcoming Conference

      EuroSys '25
      Twentieth European Conference on Computer Systems
      March 30 - April 3, 2025
      Rotterdam , Netherlands

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Project Silica: Towards Sustainable Cloud Archival Storage in GlassProceedings of the 29th Symposium on Operating Systems Principles10.1145/3600006.3613208(166-181)Online publication date: 23-Oct-2023
      • (2020)Batch-file Operations to Optimize Massive Files AccessingACM Transactions on Storage10.1145/339428616:3(1-25)Online publication date: 16-Jul-2020
      • (2019)Cold Storage Data ArchivesProceedings of the 15th International Workshop on Data Management on New Hardware10.1145/3329785.3329921(1-7)Online publication date: 1-Jul-2019
      • (2019)The five-minute rule 30 years later and its impact on the storage hierarchyCommunications of the ACM10.1145/331816362:11(114-120)Online publication date: 24-Oct-2019
      • (2019)BFO: Batch-File Operations on Massive Files for Consistent Performance Improvement2019 35th Symposium on Mass Storage Systems and Technologies (MSST)10.1109/MSST.2019.00-17(38-50)Online publication date: May-2019
      • (2019)Data Longevity and CompatibilityEncyclopedia of Big Data Technologies10.1007/978-3-319-77525-8_331(559-563)Online publication date: 20-Feb-2019
      • (2019)Cheap Data Analytics on Cold StorageEncyclopedia of Big Data Technologies10.1007/978-3-319-77525-8_147(435-443)Online publication date: 20-Feb-2019
      • (2018)Data Longevity and CompatibilityEncyclopedia of Big Data Technologies10.1007/978-3-319-63962-8_331-1(1-5)Online publication date: 1-May-2018
      • (2018)Cheap Data Analytics on Cold StorageEncyclopedia of Big Data Technologies10.1007/978-3-319-63962-8_147-1(1-8)Online publication date: 5-Feb-2018

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media