Multimedia Execution Hardware Accelerator

Hakkennes, Edwin; Vassiliadis, Stamatis

doi:10.1023/A:1011117608815

Edwin Hakkennes¹ &
Stamatis Vassiliadis¹

78 Accesses
Explore all metrics

Abstract

In this paper we show that some expressions frequently used in multimedia applications can be formulated as a general add-multiply-add operation. We further show a hardwired implementation of the Add-Multiply-Add instruction which is no more complex than the multiplier implementation. Furthermore we show that two frequently motion estimation operations, the Sum and Mean of Absolute Differences, can be implemented in hardware requiring also approximately the same cycle time as the multiplication. We also show that our approach can be extended easily to provide the computation of the Sum and Mean of Absolute Difference of a 16×16 pixel block in no more than four machine cycles. Additionally we propose a codec hardwired mechanism for the Paeth predictor used in the Portable Network Standard (PNG) that requires at most two general purpose ALU cycles. We extend the paeth unit to include the median, maximum and minimum operations on three inputs with no additional cycle time and we also extend the Add-Multiply-Add unit to include the mean of three numbers. Finally we propose a multimedia hardware accelerator to accommodate all the proposed operations. The proposed unit is an extension of the multiply pipeline with ALU extensions with no extra stages added. The unit operates on 32 instructions in total.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Low-complexity motion estimation design using modified XOR function

Article 24 September 2015

Hardware implementation of iterative method for enhanced affine motion estimation in Versatile video coding

Article 10 December 2024

Optimizing Motion Estimation with an ReRAM-Based PIM Architecture

References

K. Aono, M. Toyokura, T. Araki, A. Ohtani, H. Kodama, and K. Okamoto, "A Video Digital Signal Processor with a Vector-Pipeline Architecture," IEEE Journal of Solid-State Circuits, Vol. 27, No. 12, pp. 1886–1894, December 1992.
Article Google Scholar
P.A. Ruetz, P. Tong, D. Bailey, D.A. Luthi, and P.H. Ang, "A High-Performance Full-Motion Video Compression Chip Set,"IEEE Transactions on Circuits and Systems for Video Technol-ogy, Vol. 2, No. 2, pp. 111–122, June 1992.
Article Google Scholar
K. Herrmann, M. Seifert, K. Gaedke, H. Jeschke, and P. Pirsch, Architecture and VLSI Implementation of a RISC Core for a Monolithic Video Signal Processor, VLSI Signal Processing. New York: IEEE, 1994, pp. 368–377.
Google Scholar
S. Rathnam and G. Slavenburg, "An Architectural Overview of the Programmable Multimedia Processor, TM-1," in Proceedings of COMPCON '96, IEEE, 1996, pp. 319–326, Los Alamitos, 25–28 February 1996.
K. Guttag, R.J. Gove, and J.R. van Aken, "A Single-Chip Multi-Processor for Multimedia: The MVP," IEEE Computer Graphics and Applications, Vol. 12, No. 6, pp. 53–64, November 1992.
Article Google Scholar
A. Peleg and U. Weiser, "MMX Technology Extension to the Intel Architecture," IEEE Micro, Vol. 16, No.4, 1996, pp. 42–50.
R.L. Sites and R. Witek, Alpha AXP Architecture: Reference Manual, 2nd edn., Digital Press, Burlington, 1995.
Google Scholar
P.M. Kogge, The Architecture of Pipelined Computers, Advanced computer science series. McGraw-Hill Book Company, New York, 1981.
MATH Google Scholar
R. Montoye, E. Hokenek, and S. Runyon, "Design of the IBM RISC System/6000 Floating-Point Execution Unit," IBM Journal of Research and Development, Vol. 34, No. 1, 1990, pp. 59–70.
Article Google Scholar
S. Vassiliadis, J. Phillips, and B. Blaner, "Interlock Collapsing ALU's," IEEE Transactions on Computers, Vol. 42, No. 11, 1993, pp. 825–839.
Article Google Scholar
F. Onion, A. Nicolau, and N. Dutt, "Compiler Feedback in ASIP Design," Technical Report 94–2, Department of Information and Computer Science, University of California, September 1994.
Google Scholar
J.L. Mitchell, W.B. Pennebaker, C.E. Fogg, and D.J. LeGall, MPEG Video Compression Standard, Digital Multimedia Standard Series. Chapman and Hall, New York, 1996.
Google Scholar
L. Dadda, "Some Schemes for Parallel Multipliers," Alta Frequenza, Vol. 34, 1965, pp. 349–356.
Google Scholar
T. Boutell and T. Lane, "PNG (Portable Network Graphics) Specification," version 1.0. ftp://ftp.uu.net/graphics/png/docu-ments/ png-1.0-w3c.ps.gz.
S. Waser and M.J. Flynn, Introduction to Arithmetic for Digital Systems Designers, CBS College Publishing, 1982.
T. Doyle and P. Frencken, "Median Filtering of Television Images," in Proceedings of the International Conference on Consumer Electronics, Digest of Technical Papers, June 1986, pp. 186–187.
E.A. Hakkennes, "Multimedia Hardware Accelerators," Ph.D. Thesis, Delft University of Technology, December 1999.

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, Delft University of Technology, Mekelweg 4, 2628 CD, Delft, The Netherlands
Edwin Hakkennes & Stamatis Vassiliadis

Authors

Edwin Hakkennes
View author publications
You can also search for this author in PubMed Google Scholar
Stamatis Vassiliadis
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hakkennes, E., Vassiliadis, S. Multimedia Execution Hardware Accelerator. The Journal of VLSI Signal Processing-Systems for Signal, Image, and Video Technology 28, 221–234 (2001). https://doi.org/10.1023/A:1011117608815

Download citation

Published: 01 July 2001
Issue Date: July 2001
DOI: https://doi.org/10.1023/A:1011117608815

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multimedia Execution Hardware Accelerator

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Low-complexity motion estimation design using modified XOR function

Hardware implementation of iterative method for enhanced affine motion estimation in Versatile video coding

Optimizing Motion Estimation with an ReRAM-Based PIM Architecture

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Subscribe and save

Buy Now

Navigation

Multimedia Execution Hardware Accelerator

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Low-complexity motion estimation design using modified XOR function

Hardware implementation of iterative method for enhanced affine motion estimation in Versatile video coding

Optimizing Motion Estimation with an ReRAM-Based PIM Architecture

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Subscribe and save

Buy Now

Search

Navigation