Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–3 of 3 results for author: Goli, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.05125  [pdf, other

    cs.GR

    Adaptive Dynamic Global Illumination

    Authors: Sayantan Datta, Negar Goli, Jerry Zhang

    Abstract: We present an adaptive extension of probe based global illumination solution that enhances the response to dynamic changes in the scene while while also enabling an order of magnitude increase in probe count. Our adaptive sampling strategy carefully places samples in regions where we detect time varying changes in radiosity either due to a change in lighting, geometry or both. Even with large numb… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: Project page: https://sayan1an.github.io/adgi.html

  2. arXiv:1811.08933  [pdf, other

    cs.DC

    Analyzing Machine Learning Workloads Using a Detailed GPU Simulator

    Authors: Jonathan Lew, Deval Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew D. Sinclair, Timothy G. Rogers, Tor Aamodt

    Abstract: Most deep neural networks deployed today are trained using GPUs via high-level frameworks such as TensorFlow and PyTorch. This paper describes changes we made to the GPGPU-Sim simulator to enable it to run PyTorch by running PTX kernels included in NVIDIA's cuDNN library. We use the resulting modified simulator, which has been made available publicly with this paper, to study some simple deep lear… ▽ More

    Submitted 26 January, 2019; v1 submitted 18 November, 2018; originally announced November 2018.

    Comments: Source code available at: https://github.com/gpgpu-sim/gpgpu-sim_distribution/tree/dev

  3. arXiv:1811.08309  [pdf, other

    cs.MS cs.AR

    Modeling Deep Learning Accelerator Enabled GPUs

    Authors: Md Aamir Raihan, Negar Goli, Tor Aamodt

    Abstract: The efficacy of deep learning has resulted in its use in a growing number of applications. The Volta graphics processor unit (GPU) architecture from NVIDIA introduced a specialized functional unit, the "tensor core", that helps meet the growing demand for higher performance for deep learning. In this paper we study the design of the tensor cores in NVIDIA's Volta and Turing architectures. We furth… ▽ More

    Submitted 20 February, 2019; v1 submitted 18 November, 2018; originally announced November 2018.