Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3239576.3239609acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicaipConference Proceedingsconference-collections
research-article

A Deep Learning Prediction Process Based on Low-power Heterogeneous Multi Core Architecture

Published: 16 June 2018 Publication History

Abstract

With the rapid development of machine learning both in theory and practice in the past decade. And recently, it is widely used in applications and cloud services. As the emerging field of machine learning, deep learning shows excellent ability in solving complex learning problems. In this paper, we designed a deep learning prediction process based on low-power heterogeneous multi core architecture. Firstly, the fundamental principle of image recognition method based on deep learning reviewed as the basis of the research. Secondly, a set of key algorithm design to parallel access and process image for object detection based on Parallella multi core platform was proposed to improve the detection speed and the computational resource efficiency on single node. Thirdly, Rockchip RK3288 SoC with 4 Arm Cortex-A17 cores hardware platform, Xilinx Zynq and Adapteva Epiphany combined heterogeneous multi core hardware platform was introduced. Some key designs based on Parallella board's architecture to achieve image recognition was proposed to improve the recognition speed and the computational resource efficiency. Finally, The experimental results that based on Parallella board indicate that the proposed image recognition system can achieve nearly 14.8 times speedup than dual-core Arm which was integrated in Parallella board with similar accuracy and achieve 8.6 times speedup than RK3288 board which has the newest series of high-performance Arm core CPU as the control included 4 Arm Cortex-A17 cores.

References

[1]
Tanabe Y, Sumiyoshi M, Nishiyama M, et al. A 464GOPS 620GOPS/W heterogeneous multi-core SoC for image-recognition applications{C}// Solid-State Circuits Conference Digest of Technical Papers. IEEE, 2012:222--223.
[2]
Loten T, Green R. Embedded computer vision framework on a multimedia processor{C}// Image and Vision Computing New Zealand, 2008. Ivcnz 2008. International Conference. IEEE, 2009:1--5.
[3]
Mannan M A, Kassim A, Ho C S, et al. Software implementation of a computer-vision-based tool condition monitoring system{J}. Technical Paper - Society of Manufacturing Engineers. MR, 2002.
[4]
Hegde G, Siddhartha, Ramasamy N, et al. "CaffePresso: An optimized library for Deep Learning on embedded accelerator-based platforms" International Conference on Compilers, Architectures and Synthesis for Embedded Systems. 2016. ACM.
[5]
Ashiquzzaman A, Tushar A K. Handwritten Arabic numeral recognition using deep learning neural networks{C}// IEEE International Conference on Imaging, Vision & Pattern Recognition. IEEE, 2017:1--4.
[6]
T. Chen, Z. Du, N. Sun, J. Wang, C. Wu, Y. Chen, et al., "DianNao: a small-footprint high-throughput accelerator for ubiquitous machine learning," in Proceedings of the 19th internation conference on Architectural support for programming languages and operating systems, 2014, pp. 269--284.
[7]
Yu Q, Wang C, Ma X, et al. A Deep Learning Prediction Process Accelerator Based FPGA{J}. Proceedings of the Annual Acm Symposium on Theory of Computing, 2015:585--594."
[8]
L. Cavigelli, D. Gschwend, C. Mayer, S. Willi, B. Muheim, and L. Benini. Origami: A convolutional network accelerator. In Proceedings of the 25th Edition on Great Lake Symposium on VLSI, GLSVLSI '15, pages 199--204, New York, NY, USA, 2015. ACM.
[9]
Rath AK, Dehuri S. Non-dominated sorting genetic algorithms for heterogeneous embedded system design{C}//International Conference on Intelligent Sensing and Information Processing. IEEE, 2006:46--50.
[10]
Architecture Support for Task Out-of-order Execution in MPSoCs," IEEE Transactions on Computers, pp. 1--1.2014.
[11]
Xilinx Inc, "Zynq-7000 All Programmable SoC Technical Reference
[12]
Adapteva Inc, "Epiphany E16G3 datasheet," Revision 14.03.11, 2013.
[13]
Adapteva Inc, "Parallella-1.x Reference Manual," Revision 14.09.09, 2014.
[14]
Adapteva Inc, "Epiphany E16G3 datasheet," Revision 14.03.11, 2013.

Index Terms

  1. A Deep Learning Prediction Process Based on Low-power Heterogeneous Multi Core Architecture

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICAIP '18: Proceedings of the 2nd International Conference on Advances in Image Processing
    June 2018
    261 pages
    ISBN:9781450364607
    DOI:10.1145/3239576
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • University of Electronic Science and Technology of China: University of Electronic Science and Technology of China
    • Southwest Jiaotong University

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 June 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Accelerator
    2. Arm Cortex core
    3. Epiphany
    4. multi core
    5. the deep learning prediction process

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICAIP '18

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 86
      Total Downloads
    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media