Deep Optimizer States: Towards Scalable Training of Transformer Models using Interleaved Offloading
Abstract
References
Index Terms
- Deep Optimizer States: Towards Scalable Training of Transformer Models using Interleaved Offloading
Recommendations
Towards achieving performance portability using directives for accelerators
WACCPD '16: Proceedings of the Third International Workshop on Accelerator Programming Using DirectivesIn this paper we explore the performance portability of directives provided by OpenMP 4 and OpenACC to program various types of node architectures with attached accelerators, both self-hosted multicore and offload multicore/GPU. Our goal is to examine ...
Towards Locality-Aware Host-to-Device Offloading in OpenMP
Advancing OpenMP for Future AcceleratorsAbstractThe computational demand from scientific and industrial applications has grown significantly, driven by advances in scientific simulations across various fields such as climate forecasting, molecular dynamics, and medicine. This has led to a shift ...
Transparent offloading and mapping (TOM): enabling programmer-transparent near-data processing in GPU systems
ISCA '16: Proceedings of the 43rd International Symposium on Computer ArchitectureMain memory bandwidth is a critical bottleneck for modern GPU systems due to limited off-chip pin bandwidth. 3D-stacked memory architectures provide a promising opportunity to significantly alleviate this bottleneck by directly connecting a logic layer ...
Comments
Information & Contributors
Information
Published In
- Chair:
- Jiannong Cao,
- Program Chair:
- Zhi Jin,
- Program Co-chair:
- Valerio Schiavoni,
- Workshop Chair:
- Janick Edinger
In-Cooperation
- IFIP
- Usenix
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Check for updates
Badges
Author Tags
Qualifiers
- Research-article
Funding Sources
Conference
Acceptance Rates
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 68Total Downloads
- Downloads (Last 12 months)68
- Downloads (Last 6 weeks)47
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in