Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–7 of 7 results for author: Yuan, G

Searching in archive eess. Search in all archives.
.
  1. arXiv:2208.05163  [pdf, other

    cs.CV cs.LG eess.IV

    Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization

    Authors: Zhengang Li, Mengshu Sun, Alec Lu, Haoyu Ma, Geng Yuan, Yanyue Xie, Hao Tang, Yanyu Li, Miriam Leeser, Zhangyang Wang, Xue Lin, Zhenman Fang

    Abstract: Vision transformers (ViTs) are emerging with significantly improved accuracy in computer vision tasks. However, their complex architecture and enormous computation/storage demand impose urgent needs for new hardware accelerator design methodology. This work proposes an FPGA-aware automatic ViT acceleration framework based on the proposed mixed-scheme quantization. To the best of our knowledge, thi… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: Published in FPL2022

  2. arXiv:2206.01244  [pdf, other

    cs.CV eess.IV

    Real-Time Portrait Stylization on the Edge

    Authors: Yanyu Li, Xuan Shen, Geng Yuan, Jiexiong Guan, Wei Niu, Hao Tang, Bin Ren, Yanzhi Wang

    Abstract: In this work we demonstrate real-time portrait stylization, specifically, translating self-portrait into cartoon or anime style on mobile devices. We propose a latency-driven differentiable architecture search method, maintaining realistic generative quality. With our framework, we obtain $10\times$ computation reduction on the generative model and achieve real-time video stylization on off-the-sh… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  3. arXiv:2108.08910  [pdf, other

    eess.IV cs.AI cs.CV cs.LG cs.NE

    Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search

    Authors: Zheng Zhan, Yifan Gong, Pu Zhao, Geng Yuan, Wei Niu, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David Kaeli, Bin Ren, Xue Lin, Yanzhi Wang

    Abstract: Though recent years have witnessed remarkable progress in single image super-resolution (SISR) tasks with the prosperous development of deep neural networks (DNNs), the deep learning methods are confronted with the computation and memory consumption issues in practice, especially for resource-limited platforms such as mobile devices. To overcome the challenge and facilitate the real-time deploymen… ▽ More

    Submitted 14 February, 2023; v1 submitted 18 August, 2021; originally announced August 2021.

  4. arXiv:1912.05416  [pdf, other

    eess.SP cs.CV cs.DC cs.ET cs.LG cs.NE

    A SOT-MRAM-based Processing-In-Memory Engine for Highly Compressed DNN Implementation

    Authors: Geng Yuan, Xiaolong Ma, Sheng Lin, Zhengang Li, Caiwen Ding

    Abstract: The computing wall and data movement challenges of deep neural networks (DNNs) have exposed the limitations of conventional CMOS-based DNN accelerators. Furthermore, the deep structure and large model size will make DNNs prohibitive to embedded systems and IoT devices, where low power consumption are required. To address these challenges, spin orbit torque magnetic random-access memory (SOT-MRAM)… ▽ More

    Submitted 24 November, 2019; originally announced December 2019.

  5. arXiv:1908.10017  [pdf, other

    eess.SP cs.AR cs.ET cs.LG cs.NE

    Tiny but Accurate: A Pruned, Quantized and Optimized Memristor Crossbar Framework for Ultra Efficient DNN Implementation

    Authors: Xiaolong Ma, Geng Yuan, Sheng Lin, Caiwen Ding, Fuxun Yu, Tao Liu, Wujie Wen, Xiang Chen, Yanzhi Wang

    Abstract: The state-of-art DNN structures involve intensive computation and high memory storage. To mitigate the challenges, the memristor crossbar array has emerged as an intrinsically suitable matrix computation and low-power acceleration framework for DNN applications. However, the high accuracy solution for extreme model compression on memristor crossbar array architecture is still waiting for unravelin… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

  6. arXiv:1806.04716  [pdf

    eess.SP physics.ins-det

    Application of FPGA Acceleration in ADC Performance Calibration

    Authors: Guangyuan Yuan, Zhe cao, Shuwen Wang, Shubin Liu, Qi An

    Abstract: In recent years, high speed and high resolution analog-to-digital converter (ADC) is widely employed in many physical experiments, especially in high precision time and charge measurement. The rapid increasing amount of digitized data demands faster computing. FPGA acceleration has an attracting prospect in data process for its stream process and parallel process feature. In this paper, an ADC per… ▽ More

    Submitted 9 June, 2018; originally announced June 2018.

    Comments: 2 pages, 4 figures, 21st IEEE Real Time Conference

  7. arXiv:1802.09879  [pdf, other

    math.NA eess.IV math.OC

    L0TV: A Sparse Optimization Method for Impulse Noise Image Restoration

    Authors: Ganzhao Yuan, Bernard Ghanem

    Abstract: Total Variation (TV) is an effective and popular prior model in the field of regularization-based image processing. This paper focuses on total variation for removing impulse noise in image restoration. This type of noise frequently arises in data acquisition and transmission due to many reasons, e.g. a faulty sensor or analog-to-digital converter errors. Removing this noise is an important task i… ▽ More

    Submitted 28 December, 2018; v1 submitted 27 February, 2018; originally announced February 2018.

    Comments: to appear in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)