Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–7 of 7 results for author: Cojean, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2103.10116  [pdf, other

    cs.DC cs.MS cs.PF

    Porting a sparse linear algebra math library to Intel GPUs

    Authors: Yuhsiang M. Tsai, Terry Cojean, Hartwig Anzt

    Abstract: With the announcement that the Aurora Supercomputer will be composed of general purpose Intel CPUs complemented by discrete high performance Intel GPUs, and the deployment of the oneAPI ecosystem, Intel has committed to enter the arena of discrete high performance GPUs. A central requirement for the scientific computing community is the availability of production-ready software stacks and a glimps… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: preprint, not submitted

  2. arXiv:2011.08879  [pdf, other

    cs.DC cs.MS cs.PF cs.SE

    Ginkgo -- A Math Library designed for Platform Portability

    Authors: Terry Cojean, Yu-Hsiang "Mike" Tsai, Hartwig Anzt

    Abstract: The first associations to software sustainability might be the existence of a continuous integration (CI) framework; the existence of a testing framework composed of unit tests, integration tests, and end-to-end tests; and also the existence of software documentation. However, when asking what is a common deathblow for a scientific software product, it is often the lack of platform and performance… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: Submitted to Parallel Computing Journal (PARCO)

  3. arXiv:2008.08478  [pdf, other

    cs.MS cs.PF

    Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse Linear Algebra Computations

    Authors: Yuhsiang Mike Tsai, Terry Cojean, Hartwig Anzt

    Abstract: GPU accelerators have become an important backbone for scientific high performance computing, and the performance advances obtained from adopting new GPU hardware are significant. In this paper we take a first look at NVIDIA's newest server line GPU, the A100 architecture part of the Ampere generation. Specifically, we assess its performance for sparse linear algebra operations that form the backb… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

  4. arXiv:2007.06674  [pdf, other

    cs.MS math.NA

    A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic

    Authors: Ahmad Abdelfattah, Hartwig Anzt, Erik G. Boman, Erin Carson, Terry Cojean, Jack Dongarra, Mark Gates, Thomas Grützmacher, Nicholas J. Higham, Sherry Li, Neil Lindquist, Yang Liu, Jennifer Loe, Piotr Luszczek, Pratik Nayak, Sri Pranesh, Siva Rajamanickam, Tobias Ribizel, Barry Smith, Kasia Swirydowicz, Stephen Thomas, Stanimire Tomov, Yaohung M. Tsai, Ichitaro Yamazaki, Urike Meier Yang

    Abstract: Within the past years, hardware vendors have started designing low precision special function units in response to the demand of the Machine Learning community and their demand for high compute power in low precision formats. Also the server-line products are increasingly featuring low-precision special function units, such as the NVIDIA tensor cores in ORNL's Summit supercomputer providing more t… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: Technical report as a part of the Exascale computing project (ECP)

    ACM Class: G.1.3; G.4

  5. arXiv:2006.16852  [pdf, other

    cs.MS

    Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing

    Authors: Hartwig Anzt, Terry Cojean, Goran Flegar, Fritz Göbel, Thomas Grützmacher, Pratik Nayak, Tobias Ribizel, Yuhsiang Mike Tsai, Enrique S. Quintana-Ortí

    Abstract: In this paper, we present Ginkgo, a modern C++ math library for scientific high performance computing. While classical linear algebra libraries act on matrix and vector objects, Ginkgo's design principle abstracts all functionality as "linear operators", motivating the notation of a "linear operator algebra library". Ginkgo's current focus is oriented towards providing sparse linear algebra functi… ▽ More

    Submitted 1 July, 2020; v1 submitted 30 June, 2020; originally announced June 2020.

    Comments: Preprint submitted to ACM Transactions on Mathematical Software

    ACM Class: D.2; G.1.3; G.4

  6. arXiv:2006.14290  [pdf, other

    cs.MS

    Preparing Ginkgo for AMD GPUs -- A Testimonial on Porting CUDA Code to HIP

    Authors: Yuhsiang M. Tsai, Terry Cojean, Tobias Ribizel, Hartwig Anzt

    Abstract: With AMD reinforcing their ambition in the scientific high performance computing ecosystem, we extend the hardware scope of the Ginkgo linear algebra package to feature a HIP backend for AMD GPUs. In this paper, we report and discuss the porting effort from CUDA, the extension of the HIP framework to add missing features such as cooperative groups, the performance price of compiling HIP code for A… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: Preprint submitted to HeteroPar

  7. arXiv:2003.05361  [pdf, other

    cs.DC cs.MS

    Evaluating Abstract Asynchronous Schwarz solvers on GPUs

    Authors: Pratik Nayak, Terry Cojean, Hartwig Anzt

    Abstract: With the commencement of the exascale computing era, we realize that the majority of the leadership supercomputers are heterogeneous and massively parallel even on a single node with multiple co-processors such as GPUs and multiple cores on each node. For example, ORNLs Summit accumulates six NVIDIA Tesla V100s and 42 core IBM Power9s on each node. Synchronizing across all these compute resources… ▽ More

    Submitted 5 May, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: Preprint submitted to IJHPCA