Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A high performance processor architecture for multimedia applications

Published: 01 February 2018 Publication History

Abstract

In this paper, an efficient sub-word parallelism (SWP)-enabled Reduced instruction-set Computer (RISC) architecture is proposed. The proposed architecture can perform efficiently for both conventional and multimedia-oriented applications. Speed-up for multimedia applications is achieved by adding the customized SWP instructions in RISC processor core. Rather than operating on a single data, customized instructions perform parallel computations on multiple pixels, packed in word-size registers. The sub-word-sizes in SWP instructions are selected, based upon the pixel sizes (8, 10, 12, 16-bit) in modern multimedia applications. The SWP-RISC processor is designed and implemented on two different CMOS technology nodes (90nm and 45nm). The performance of processor is characterized for different multimedia applications and compared with the state-of-the-art TMS320C64X processor.

References

[1]
R.N.M. Watson, P.G. Neumann, J. Woodruff, M. Roe, J. Anderson, J. Baldwin, Capability hardware enhanced RISC instructions: CHERI instruction-set architecture (version 6), University of Cambridge, Computer Laboratory, 2017.
[2]
M. Rashid, L. Apvrille, R. Pacalet, Application specific processors for multimedia applications, 2008.
[3]
M. Rashid, L. Apvrille, R. Pacalet, Evaluation of ASIPs design with LISATek, 2008.
[4]
K.N. Huu, A.V.D. Duc, T. Vu, Q. Minh, D. Do, V. Lvu, A proposed RISC instruction-set architecture for the MAC unit of 32-bit VLIW DSP processor core, 2014.
[5]
S. Khan, E. Casseau, D. Menard, Re-configurable SWP operator for multimedia processing, 2009.
[6]
S. Khan, E. Casseau, D. Menard, High performance discrete cosine transform operator using multimedia oriented sub-word parallelism, 2015.
[7]
D. F.Liu, L. Zhao, G. Chen, W. Yu, An enhanced FDTD method using SSE instruction-set, 2012.
[8]
Y. Fukazawa, K. Watanabe, Y. Iminoura, T. Kondo, T. Sasaki, SIMD-based datapath with efficient operation structure for motion estimation, 2016.
[9]
S.D. Kim, M.H. Sunwoo, MESIP: A configurable and data reusable motion estimation specific instruction-set processor, IEEE Trans. Circuits Syst. Video Technol., 23 (2013) 1767-1780.
[10]
S.H.F. Chen, H.C.W. Hsiao, Fast detection of microaneurysms in color fundus images, 2016.
[11]
M. Rashid, F. Ferrandi, K. Bertels, HARTES design flow for heterogeneous platforms, 2009.
[12]
M. Gautschi, P. D.Schiavone, A. Traber, I. Loi, A. Pullini, D. Rossi, A near-threshold RISC-v core with DSP extensions for scalable iot endpoint devices, 2017.
[13]
W. Hussain, H. Hoffmann, T. Ahonen, J. Nurmi, Power mitigation by performance equalization in a heterogeneous re-configurable multi-core architecture, J Signal Process Syst (2016) 1-11.
[14]
Processor Designer Tool from Synopsys, Web link: www.synopsys.com/IP/ProcssorIP/asip/procssor-designer/.
[15]
Digital Signal Processor TMS320C64x from Texas Instruments, web link: www.ti.com/lit/ug/spru395b/spru395b.pdf.
[16]
C. Brunelli, F. Garzia, D. Rossi, J. Nurmi, A coarse-grain re-configurable architecture for multimedia applications supporting sub-word and floating-point calculations, J Syst Archit, 56 (2010) 38-47.
[17]
P.V. Bhandarkar, S.S. Limaye, Modelling with LISA-a powerful machine description language, IJCSET, 3 (2013) 1-7.
[18]
S. Khan, E. Casseau, High performance motion estimation operator using multimedia oriented sub-word parallelism, J. Commun. Comput. (2012) 1-14.
[19]
N.I. Cho, S.U. Lee, DCT algorithms for VLSI parallel implementations, IEEE Trans. Acoustics, Speech Signal Process., 38 (2002) 121-127.
[20]
B.K. Mohanty, P.K. Meher, A high-performance FIR filter architecture for fixed and re-configurable applications, IEEE Trans. Very Large Scale Integration (VLSI) Syst., 24 (2016).
[21]
D. Chen, S. Wan, F.S. Bao, EEG-Based seizure detection using discrete wavelet transform through full-level decomposition, Nov 2015.
[22]
N.S. Bhat, Design and ASIC implementation of 2-d DWT IDWT, 2011.
[23]
CSMC technologies FAB Ltd. China, www.csmc.com.
[24]
Cadence: EDA tools and IPs provider, www.cadence.com.
[25]
Joint Photographic Experts Group (JPEG), www.jpeg.org.
[26]
Joint Photographic Experts Group 2000 (JPEG2000), www.jpeg2000.org.
[27]
The Moving Picture Experts Group (MPEG), www.mpeg.chiariglione.org.
[28]
Video compression format H.264, www.openh264.org.
[29]
M. Rashid, M. Imran, A.R. Jafri, Comparative analysis of flexible cryptographic implementations, 2016.

Index Terms

  1. A high performance processor architecture for multimedia applications
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Computers and Electrical Engineering
    Computers and Electrical Engineering  Volume 66, Issue C
    February 2018
    530 pages

    Publisher

    Pergamon Press, Inc.

    United States

    Publication History

    Published: 01 February 2018

    Author Tags

    1. Application-specific instruction-set processor (ASIP)
    2. Multimedia processing
    3. Re-configurable system
    4. Reduced instruction-set computer (RISC)
    5. Sub-word parallelism

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 0
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 22 Sep 2024

    Other Metrics

    Citations

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media