Export Citations
1 Results for: Book/Issue: PDSW '15: Proceedings of the 10th Parallel Data Storage Workshop
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
Searched The ACM Guide to Computing Literature (3,834,922 records)|Limit your search to The ACM Full-Text Collection (773,172 records)
Showing 1 - 1of1 Results
- research-articleNovember 2015
BAD-check: bulk asynchronous distributed checkpointing
PDSW '15: Proceedings of the 10th Parallel Data Storage WorkshopPages 19–24https://doi.org/10.1145/2834976.2834981Leadership-scale scientific simulations running as tens of thousands of tightly-coupled MPI processes are vulnerable to interruption due to a single process or node failure. Due to the dependence of each state calculation on the successful completion of ...