Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2228360.2228568acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article

Platform 2012, a many-core computing accelerator for embedded SoCs: performance evaluation of visual analytics applications

Published: 03 June 2012 Publication History

Abstract

P2012 is an area- and power-efficient many-core computing accelerator based on multiple globally asynchronous, locally synchronous processor clusters. Each cluster features up to 16 processors with independent instruction streams sharing a multi-banked one-cycle access L1 data memory, a multi-channel DMA engine and specialized hardware for synchronization and aggressive power management. P2012 is 3D stacking ready and can be customized to achieve extreme area and energy efficiency by adding domain-specific HW IPs to the cluster. The first P2012 SoC prototype in 28nm CMOS will sample in Q3, featuring four 16-processor clusters, a 1MB L2 memory and delivering 80GOPS (with 32 bit single precision floating point support) in 18mm2 with 2W power consumption (worst-case). P2012 can run standard OpenCL™ and proprietary Native Programming Model SW components to achieve the highest level of control on application-to-resource mapping. A dedicated version of the OpenCV vision library is provided in the P2012 SW Development Kit to enable visual analytics acceleration. This paper will discuss preliminary performance measurements of common feature extraction and tracking algorithms, parallelized on P2012, versus sequential execution on ARM CPUs.

References

[1]
F. Arnaud, S. Colquhoun, A. L. Mareau, S. Kohler, S. Jeannot, F. Hasbani, R. Paulin, S. Cremer, C. Charbuillet, G. Druais, P. Scheer, 2011. Technology-Circuit Convergence for Full-SOC Platform in 28 nm and Beyond. International Electron Devices Meeting.
[2]
Y Thonnart, P. Vivet, F. Clermidy, DATE 2010. A fully-asynchronous low-power framework for GALS NoC integration.
[3]
L. Benini, E. Flamand, D. Fuin, D. Melpignano, DATE 2012. P2012: Building an Ecosystem for a Scalable, Modular and high-efficiency Embedded Computing Accelerator.
[4]
MIND Component framework project - Online: mind.ow2.org
[5]
Khronos OpenCL - Online: http://www.khronos.org/opencl/
[6]
FAST corner detection--Online: http://www.edwardrosten.com/work/fast.html
[7]
Jason Clemons, Haishan Zhu, Silvio Savarese, and Todd Austin, 2011. MEVBench, An Embedded Vision Benchmarking Suite IEEE Interational Symposium on Workload Characterization. Online: http://www.eecs.umich.edu/mevbench/
[8]
David G. Lowe, 2004. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60, 2 (2004), pp. 91--110.
[9]
The VLFeat open source library--Online: www.vlfeat.org
[10]
In Kyu Park et al., 2011. Design and Performance Evaluation of Image Processing Algorithms. IEEE Transactions on Parallel and Distributed Systems, vol. 22, no. 1 (Jan 2011)
[11]
A. Ensor, S. Hall, 2011. GPU-based Image Analysis on Mobile Devices. Twenty-sixth International Conference Image and Vision Computing New
[12]
Zealand (IVCNZ 2011)
[13]
Khronos Vision--Online: http://www.khronos.org/vision
[14]
Zynq-7000 Extensible Processing Platform--Online: www.xilinx.com
[15]
Arria V FPGA SX SoC--Online: www.altera.com
[16]
Jean-Yves Bouguet, Pyramidal Implementation of the Lucas Kanade Feature Tracker.--Online: http://robots.stanford.edu/cs223b04/algo_tracking.pdf.

Cited By

View all
  • (2022)FLIA: Architecture of Collaborated Mobile GPU and FPGA Heterogeneous ComputingElectronics10.3390/electronics1122375611:22(3756)Online publication date: 16-Nov-2022
  • (2021)The Design of a 2D Graphics Accelerator for Embedded SystemsElectronics10.3390/electronics1004046910:4(469)Online publication date: 15-Feb-2021
  • (2021)Inter-kernel communication facility of a distributed operating system for NoC-based lightweight manycoresJournal of Parallel and Distributed Computing10.1016/j.jpdc.2021.04.002154(1-15)Online publication date: Aug-2021
  • Show More Cited By

Index Terms

  1. Platform 2012, a many-core computing accelerator for embedded SoCs: performance evaluation of visual analytics applications

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    DAC '12: Proceedings of the 49th Annual Design Automation Conference
    June 2012
    1357 pages
    ISBN:9781450311991
    DOI:10.1145/2228360
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 June 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. 3D stacking
    2. SoC
    3. computer vision
    4. feature extraction
    5. low-power
    6. many-core
    7. process aware

    Qualifiers

    • Research-article

    Conference

    DAC '12
    Sponsor:
    DAC '12: The 49th Annual Design Automation Conference 2012
    June 3 - 7, 2012
    California, San Francisco

    Acceptance Rates

    Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

    Upcoming Conference

    DAC '25
    62nd ACM/IEEE Design Automation Conference
    June 22 - 26, 2025
    San Francisco , CA , USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)18
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 15 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)FLIA: Architecture of Collaborated Mobile GPU and FPGA Heterogeneous ComputingElectronics10.3390/electronics1122375611:22(3756)Online publication date: 16-Nov-2022
    • (2021)The Design of a 2D Graphics Accelerator for Embedded SystemsElectronics10.3390/electronics1004046910:4(469)Online publication date: 15-Feb-2021
    • (2021)Inter-kernel communication facility of a distributed operating system for NoC-based lightweight manycoresJournal of Parallel and Distributed Computing10.1016/j.jpdc.2021.04.002154(1-15)Online publication date: Aug-2021
    • (2021)LWMPI: An MPI library for NoC‐based lightweight manycore processors with on‐chip memory constraintsConcurrency and Computation: Practice and Experience10.1002/cpe.669335:17Online publication date: 8-Nov-2021
    • (2019)Queue Based Memory Management Unit for Heterogeneous MPSoCs2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)10.23919/DATE.2019.8715129(1297-1300)Online publication date: Mar-2019
    • (2019)Alleviating Scalability Limitation of Accelerator-Based PlatformsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2018.284663238:7(1317-1330)Online publication date: Jul-2019
    • (2019)On the Performance and Isolation of Asymmetric Microkernel Design for Lightweight Manycores2019 IX Brazilian Symposium on Computing Systems Engineering (SBESC)10.1109/SBESC49506.2019.9046080(1-8)Online publication date: Nov-2019
    • (2019)A Memory-Optimized and Energy-Efficient CNN Acceleration Architecture Based on FPGA2019 IEEE 28th International Symposium on Industrial Electronics (ISIE)10.1109/ISIE.2019.8781162(2137-2141)Online publication date: Jun-2019
    • (2019)An Interconnect-Centric Approach to the Flexible Partitioning and Isolation of Many-Core Accelerators for Fog Computing2019 XXXIV Conference on Design of Circuits and Integrated Systems (DCIS)10.1109/DCIS201949030.2019.8959943(1-6)Online publication date: Nov-2019
    • (2019)StreamDriveJournal of Signal Processing Systems10.1007/s11265-018-1351-191:3-4(275-301)Online publication date: 1-Mar-2019
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media