Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–8 of 8 results for author: Samajdar, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2110.01752  [pdf, other

    cs.AR cs.AI cs.LG

    RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU

    Authors: Geonhwa Jeong, Eric Qin, Ananda Samajdar, Christopher J. Hughes, Sreenivas Subramoney, Hyesoon Kim, Tushar Krishna

    Abstract: As AI-based applications become pervasive, CPU vendors are starting to incorporate matrix engines within the datapath to boost efficiency. Systolic arrays have been the premier architectural choice as matrix engines in offload accelerators. However, we demonstrate that incorporating them inside CPUs can introduce under-utilization and stalls due to limited register storage to amortize the fill and… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: This paper is accepted to DAC 2021

  2. arXiv:2108.08295  [pdf, other

    cs.LG cs.AI cs.AR

    AIRCHITECT: Learning Custom Architecture Design and Mapping Space

    Authors: Ananda Samajdar, Jan Moritz Joseph, Matthew Denton, Tushar Krishna

    Abstract: Design space exploration is an important but costly step involved in the design/deployment of custom architectures to squeeze out maximum possible performance and energy efficiency. Conventionally, optimizations require iterative sampling of the design space using simulation or heuristic tools. In this paper we investigate the possibility of learning the optimization task using machine learning an… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

  3. arXiv:2101.04799  [pdf, other

    cs.AR cs.LG

    Self-Adaptive Reconfigurable Arrays (SARA): Using ML to Assist Scaling GEMM Acceleration

    Authors: Ananda Samajdar, Michael Pellauer, Tushar Krishna

    Abstract: With increasing diversity in Deep Neural Network(DNN) models in terms of layer shapes and sizes, the research community has been investigating flexible/reconfigurable accelerator substrates. This line of research has opened up two challenges. The first is to determine the appropriate amount of flexibility within an accelerator array that that can trade-off the performance benefits versus the area… ▽ More

    Submitted 23 April, 2022; v1 submitted 12 January, 2021; originally announced January 2021.

  4. arXiv:2012.12563  [pdf, other

    cs.AR

    Architecture, Dataflow and Physical Design Implications of 3D-ICs for DNN-Accelerators

    Authors: Jan Moritz Joseph, Ananda Samajdar, Lingjun Zhu, Rainer Leupers, Sung-Kyu Lim, Thilo Pionteck, Tushar Krishna

    Abstract: The everlasting demand for higher computing power for deep neural networks (DNNs) drives the development of parallel computing architectures. 3D integration, in which chips are integrated and connected vertically, can further increase performance because it introduces another level of spatial parallelism. Therefore, we analyze dataflows, performance, area, power and temperature of such 3D-DNN-acce… ▽ More

    Submitted 18 February, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

  5. arXiv:2008.11881  [pdf, other

    cs.NE cs.DC cs.LG

    CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity Edge Devices

    Authors: Parth Mannan, Ananda Samajdar, Tushar Krishna

    Abstract: Recent advancements in machine learning algorithms, especially the development of Deep Neural Networks (DNNs) have transformed the landscape of Artificial Intelligence (AI). With every passing day, deep learning based methods are applied to solve new problems with exceptional results. The portal to the real world is the edge. The true impact of AI can only be fully realized if we can have AI agent… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: Accepted and appears in ISPASS 2020

  6. arXiv:1811.02883  [pdf, other

    cs.DC cs.AR

    SCALE-Sim: Systolic CNN Accelerator Simulator

    Authors: Ananda Samajdar, Yuhao Zhu, Paul Whatmough, Matthew Mattina, Tushar Krishna

    Abstract: Systolic Arrays are one of the most popular compute substrates within Deep Learning accelerators today, as they provide extremely high efficiency for running dense matrix multiplications. However, the research community lacks tools to insights on both the design trade-offs and efficient mapping strategies for systolic-array based accelerators. We introduce Systolic CNN Accelerator Simulator (SCALE… ▽ More

    Submitted 1 February, 2019; v1 submitted 16 October, 2018; originally announced November 2018.

  7. arXiv:1808.01363  [pdf, other

    cs.NE

    GeneSys: Enabling Continuous Learning through Neural Network Evolution in Hardware

    Authors: Ananda Samajdar, Parth Mannan, Kartikay Garg, Tushar Krishna

    Abstract: Modern deep learning systems rely on (a) a hand-tuned neural network topology, (b) massive amounts of labeled training data, and (c) extensive training over large-scale compute resources to build a system that can perform efficient image classification or speech recognition. Unfortunately, we are still far away from implementing adaptive general purpose intelligent systems which would need to lear… ▽ More

    Submitted 13 September, 2018; v1 submitted 3 August, 2018; originally announced August 2018.

    Comments: This work is accepted and will appear in MICRO-51

  8. arXiv:1803.11232  [pdf, other

    cs.CV

    Euphrates: Algorithm-SoC Co-Design for Low-Power Mobile Continuous Vision

    Authors: Yuhao Zhu, Anand Samajdar, Matthew Mattina, Paul Whatmough

    Abstract: Continuous computer vision (CV) tasks increasingly rely on convolutional neural networks (CNN). However, CNNs have massive compute demands that far exceed the performance and energy constraints of mobile devices. In this paper, we propose and develop an algorithm-architecture co-designed system, Euphrates, that simultaneously improves the energy-efficiency and performance of continuous vision task… ▽ More

    Submitted 29 March, 2018; originally announced March 2018.