Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2287076.2287095acmconferencesArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
poster

A system-aware optimized data organization for efficient scientific analytics

Published: 18 June 2012 Publication History

Abstract

Large-scale scientific applications on High End Computing systems produce a large volume of highly complex datasets. Such data imposes a grand challenge to conventional storage systems for the need of efficient I/O solutions during both the simulation runtime and data post-processing phases. With the mounting needs of scientific discovery, the read performance of large-scale simulations has becomes a critical issue for the HPC community. In this study, we propose a system-aware optimized data organization strategy that can organize data blocks of multidimensional scientific data efficiently based on simulation output and the underlying storage systems, thereby enabling efficient scientific analytics. Our experimental results demonstrate a performance speedup up to 72 times for the combustion simulation S3D, compared to the logically contiguous data layout.

References

[1]
Adaptable I/O System. http://www.nccs.gov/user-support/center-projects/adios.
[2]
J. H. Chen et al. Terascale direct numerical simulations of turbulent combustion using S3D. Comp. Sci. & Disc., 2(1):015001 (31pp), 2009.
[3]
NCCS. http://www.nccs.gov/computing-resources/.
[4]
T. Shimada, T. Tsuji, and K. Higuchi. A storage scheme for multidimensional data alleviating dimension dependency. In Digital Information Management, 2008. ICDIM 2008. Third International Conference on, pages 662--668, nov. 2008.
[5]
Y. Tian, S. Klasky, H. Abbasi, J. Lofstead, N. P. R. Grout, Q. Liu, Y. Wang, and W. Yu. Edo: Improving read performance for scientific applications through elastic data organization. In CLUSTER'11: Proceedings of the 2011 IEEE International Conference on Cluster Computing, Washington, DC, USA, 2011. IEEE Computer Society.
[6]
Y. Tian and W. Yu. Finding the optimized chunking for multidimensional array on large-scale systems. Technical Report AU-CSSE-PASL/12-TR01, Auburn University, 2012.
[7]
W. Yu and J. Vetter. ParColl: Partitioned Collective I/O on the Cray XT. In International Conference on Parallel Processing (ICPP'08), Portland, OR, 2008.
[8]
W. Yu, J. Vetter, and H. Oral. Performance characterization and optimization of parallel I/O on the cray XT. IPDPS, pages 1--11, April 2008.

Cited By

View all
  • (2019)Understanding Performance-Quality Trade-offs in Scientific Visualization Workflows with Lossy Compression2019 IEEE/ACM 5th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-5)10.1109/DRBSD-549595.2019.00006(1-7)Online publication date: Nov-2019
  • (2016)LandrushProceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2016.58(32-41)Online publication date: 16-May-2016
  • (2012)SMART-IOProceedings of the 2012 IEEE 20th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems10.1109/MASCOTS.2012.30(181-188)Online publication date: 7-Aug-2012

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HPDC '12: Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
June 2012
308 pages
ISBN:9781450308052
DOI:10.1145/2287076
  • General Chair:
  • Dick Epema,
  • Program Chairs:
  • Thilo Kielmann,
  • Matei Ripeanu

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 June 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. data layout
  2. i/o

Qualifiers

  • Poster

Conference

HPDC'12
Sponsor:

Acceptance Rates

HPDC '12 Paper Acceptance Rate 23 of 143 submissions, 16%;
Overall Acceptance Rate 166 of 966 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Understanding Performance-Quality Trade-offs in Scientific Visualization Workflows with Lossy Compression2019 IEEE/ACM 5th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-5)10.1109/DRBSD-549595.2019.00006(1-7)Online publication date: Nov-2019
  • (2016)LandrushProceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2016.58(32-41)Online publication date: 16-May-2016
  • (2012)SMART-IOProceedings of the 2012 IEEE 20th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems10.1109/MASCOTS.2012.30(181-188)Online publication date: 7-Aug-2012

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media