default search action
37th ISC 2022: Hamburg, Germany
- Ana Lucia Varbanescu, Abhinav Bhatele, Piotr Luszczek, Marc Baboulin:
High Performance Computing - 37th International Conference, ISC High Performance 2022, Hamburg, Germany, May 29 - June 2, 2022, Proceedings. Lecture Notes in Computer Science 13289, Springer 2022, ISBN 978-3-031-07311-3
Architecture, Networks, and Storage
- Qinghua Zhou, Pouya Kousha, Quentin Anthony, Kawthar Shafie Khorassani, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Accelerating MPI All-to-All Communication with Online Compression on Modern GPU Clusters. 3-25 - Yuval Shpigelman, Gilad Shainer, Richard L. Graham, Yong Qin, Gerardo Cisneros-Stoianowski, Craig B. Stunkel:
NVIDIA's Quantum InfiniBand Network Congestion Control Technology and Its Impact on Application Performance. 26-43 - Marjan Fariborz, Mahyar Samani, Pouya Fotouhi, Roberto Proietti, Il-Min Yi, Venkatesh Akella, Jason Lowe-Power, Samuel Palermo, S. J. Ben Yoo:
LLM: Realizing Low-Latency Memory by Exploiting Embedded Silicon Photonics for Irregular Workloads. 44-64 - Jesmin Jahan Tithi, Fabio Checconi, Douglas Doerfler, Fabrizio Petrini:
SU3_Bench on a Programmable Integrated Unified Memory Architecture (PIUMA) and How that Differs from Standard NUMA CPUs. 65-84
Machine Learning, AI, and Emerging Technologies
- Pouya Kousha, Arpan Jain, Ayyappa Kolli, Prasanna Sainath, Hari Subramoni, Aamir Shafi, Dhabaleswar K. Panda:
"Hey CAI" - Conversational AI Enabled User Interface for HPC Tools. 87-108 - Arpan Jain, Aamir Shafi, Quentin Anthony, Pouya Kousha, Hari Subramoni, Dhabaleswar K. Panda:
Hy-Fi: Hybrid Five-Dimensional Parallel DNN Training on High-Performance GPU Clusters. 109-130
HPC Algorithms and Applications
- Peter Munch, Karl Ljungkvist, Martin Kronbichler:
Efficient Application of Hanging-Node Constraints for Matrix-Free High-Order FEM Computations on CPU and GPU. 133-152 - Baojiu Li, Holger Schulz, Tobias Weinzierl, Han Zhang:
Dynamic Task Fusion for a Block-Structured Finite Volume Solver over a Dynamically Adaptive Mesh with Local Time Stepping. 153-173 - Yi-Hua Chung, Cheng-Jhih Shih, Shih-Hao Hung:
Accelerating Simulated Quantum Annealing with GPU and Tensor Cores. 174-191 - Ioannis Sakiotis, Kamesh Arumugam, Marc F. Paterno, Desh Ranjan, Balsa Terzic, Mohammad Zubair:
m-Cubes: An Efficient and Portable Implementation of Multi-dimensional Integration for GPUs. 192-209
Performance Modeling, Evaluation, and Analysis
- Onur Cankur, Abhinav Bhatele:
Comparative Evaluation of Call Graph Generation by Profiling Tools. 213-232 - Mohammad Alaul Haque Monil, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony:
MAPredict: Static Analysis Driven Memory Access Prediction Framework for Modern CPUs. 233-255 - Nicolas Denoyelle, Swann Perarnau, Kamil Iskra, Balazs Gerofi:
Rapid Execution Time Estimation for Heterogeneous Memory Systems Through Differential Tracing. 256-274 - Ana Luisa Veroneze Solórzano, Lucas Mello Schnorr:
Understanding Distributed Deep Learning Performance by Correlating HPC and Machine Learning Measurements. 275-292 - Oliver Hacker, Matthias Korch, Johannes Seiferth:
A Motivating Case Study on Code Variant Selection by Reinforcement Learning. 293-312
Programming Environments and System Software
- Atmn Patel, Johannes Doerfert:
Remote OpenMP Offloading. 315-333 - Raju Ram, Daniel Grünewald, Nicolas R. Gauger:
Hybrid Parallel ILU Preconditioner in Linear Solver Library GaspiLS. 334-353 - Alexandre F. Boyer, Christophe Haen, Federico Stagni, David R. C. Hill:
A Subset of the CERN Virtual Machine File System: Fast Delivering of Complex Software Stacks for Supercomputing Resources. 354-371
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.