Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1572769.1572775acmconferencesArticle/Chapter ViewAbstractPublication PageshpgConference Proceedingsconference-collections
research-article

CFU: multi-purpose configurable filtering unit for mobile multimedia applications on graphics hardware

Published: 01 August 2009 Publication History

Abstract

In order to increase the capability of mobile GPUs in image/video processing, a multi-purpose configurable filtering unit (CFU), which is a new configurable unit for image filtering on stream processing architecture, is proposed in this paper. CFU is located in the texture unit of a GPU and can efficiently execute many kinds of filtering operations by directly accessing multi-bank texture cache and specially-designed data-paths. The following programmabilities are supported in our proposed CFU. First, different sampling point windows can be selected by programmers. Besides, the arithmetic type of the filter can be chosen. Not only original texture filtering functions and finite impulse response (FIR) filters, morphological operations in computer vision are also embedded in CFU. Furthermore, the weighting coefficients of FIR filters and morphological operations can be defined by programmers. Simulation results show that in average, compared with conventional texture unit, 25.35% of processing time in H.264/AVC motion compensation and 58.6% of processing time in video segmentation can be reduced with the assistance of CFU.

References

[1]
Akenine-Möller, T., and Ström, J. 2003. Graphics for the masses: A hardware rasterization architecture for mobile phones. ACM Transactions on Graphics 22, 3 (July), 801--808.
[2]
Chien, S.-Y., Huang, Y.-W., Hsieh, B.-Y., Ma, S.-Y., and Chen, L.-G. 2004. Fast video segmentation algorithm with shadow cancellation, global motion compensation, and adaptive threshold techniques. IEEE Transactions on Multimedia 6, 5 (Oct.), 732--748.
[3]
Colantoni, P., and Nabil Boukala, J. D. R. 2003. Fast and accurate color image processing using 3D graphics cards. In Proceedings of Vision, Modeling, and Visualization.
[4]
Fernando, R., and Kilgard, M. J. 2003. The Cg Tutorial. Addison-Wesley.
[5]
Hakura, Z., and Gupta, A. 1997. The design and analysis of cache architecture for texture mapping. In Proceedings of 24th International Symposium of Computer Architecture, 108--120.
[6]
Han, B., and Zhou, B. 2006. Efficient video decoding on GPUs by point based rendering. In Proceedings of the ACM SIGGRAPH/Eurographics conference on Graphics hardware, 79--86.
[7]
Haralick, R. M., and Shapiro, L. G. 1992. Computer and Robot Vision. Addison Wesley Longman Publishing Company.
[8]
Hirvonen, A., and Leppänen, T. 2005. H.263 video decoding on programmable graphics hardware. In Proceedings of IEEE International Symposium on Circuits and Systems, 902--907.
[9]
Igehy, H., Eldridge, M., and Proudfoot, K. 1998. Prefetching in a texture cache architecture. In Proceedings of the ACM SIGGRAPH/Eurographics conference on Graphics hardware.
[10]
ITU-T International Telecommunication Union. 1995. Draft ITU-T Recommendation H.263(Video Coding for Low Bitrate Communication). KPN Research.
[11]
Kelly, F., and Kokaram, A. 2004. Fast image interpolation for motion estimation using graphics hardware. In Proceedings of the IS&T/SPIE Electronic Imaging, 184--194.
[12]
Lee, C.-Y., Lin, Y.-C., Wu, C.-L., Chang, C.-H., Tsao, Y.- M., and Chien, S.-Y. 2007. Multi-pass and frame parallel algorithms of motion estimation in H.264/AVC for generic GPU. In IEEE International Conference on Multimedia and Expo, 1603--1606.
[13]
Meinds, K., and Barenbrug, B. 2002. Resample hardware for 3D graphics. In Proceedings of the ACM SIGGRAPH/Eurographics conference on Graphics hardware, 17--26.
[14]
Owens, J. D., Kapasi, U. J., Mattson, P., Towles, B., Serebrin, B., Rixner, S., and Dally, W. J. 2002. Media processing applications on the Imagine stream processor. In Proceedings of the IEEE International Conference on Computer Design, 295--302.
[15]
Owens, J. D., Luebke, D., Govindaraju, N., Harris, M., Krüger, J., Lefohn, A. E., and Purcell, T. J. 2007. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 26, 1 (Mar.), 80--113.
[16]
Ström, J., and Akenine-Möller, T. 2005. iPACKMAN: high-quality, low-complexity texture compression for mobile phones. In Proceedings of the ACM SIGGRAPH/Eurographics conference on Graphics hardware, 63--70.
[17]
Sun, C.-H., Tsao, Y.-M., and Chien, S.-Y. 2009. High-quality mipmapping texture compression with alpha maps for graphics processing units. IEEE Transactions on Multimedia 11, 4 (June), 589--599.
[18]
Wiegand, T. 2003. Draft ITU-T recommendation and final draft international standard of joint video specification(ITU-T Rec. H.264/ISO/IEC 14496-10 AVC). Joint Video Team(JVT) of ISO/IEC MPEG and ITU-T VCEG, JVT-G050rl, May.
[19]
Williams, L. 1983. Pyramidal parametrics. In Proceedings of ACM SIGGRAPH.
[20]
Yang, R., and Welch, G. 2002. Fast image segmentation and smoothing using commodity graphics hardware. Journal of Graphics Tools 7, 4 (Dec.), 91--100.

Cited By

View all
  • (2019)Configurable Texture Unit for Convolutional Neural Networks on Graphics Processing Units2019 IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS)10.1109/AICAS.2019.8771629(234-238)Online publication date: Mar-2019
  • (2013)HD video decoding scheme based on mobile heterogeneous system architecture2013 IEEE International Conference on Acoustics, Speech and Signal Processing10.1109/ICASSP.2013.6638159(2761-2765)Online publication date: May-2013
  • (2012)A Reconfigurable Heterogeneous Multimedia Processor for IC-Stacking on Si-InterposerIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2011.217120922:4(589-604)Online publication date: 1-Apr-2012
  • Show More Cited By

Index Terms

  1. CFU: multi-purpose configurable filtering unit for mobile multimedia applications on graphics hardware

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      HPG '09: Proceedings of the Conference on High Performance Graphics 2009
      August 2009
      185 pages
      ISBN:9781605586038
      DOI:10.1145/1572769
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 August 2009

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. configurable filtering unit
      2. streaming architecture
      3. texture filtering

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      HPG 2009
      Sponsor:
      HPG 2009: High Performance Graphics
      August 1 - 3, 2009
      Louisiana, New Orleans

      Acceptance Rates

      Overall Acceptance Rate 15 of 44 submissions, 34%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 01 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2019)Configurable Texture Unit for Convolutional Neural Networks on Graphics Processing Units2019 IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS)10.1109/AICAS.2019.8771629(234-238)Online publication date: Mar-2019
      • (2013)HD video decoding scheme based on mobile heterogeneous system architecture2013 IEEE International Conference on Acoustics, Speech and Signal Processing10.1109/ICASSP.2013.6638159(2761-2765)Online publication date: May-2013
      • (2012)A Reconfigurable Heterogeneous Multimedia Processor for IC-Stacking on Si-InterposerIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2011.217120922:4(589-604)Online publication date: 1-Apr-2012
      • (2011)Variable bit rate GPU texture decompressionProceedings of the Twenty-second Eurographics conference on Rendering10.1111/j.1467-8659.2011.01989.x(1299-1308)Online publication date: 27-Jun-2011
      • (2011)Power-Efficient Hardware Architecture of K-Means Clustering With Bayesian-Information-Criterion Processor for Multimedia Processing ApplicationsIEEE Journal on Emerging and Selected Topics in Circuits and Systems10.1109/JETCAS.2011.21652311:3(357-368)Online publication date: Sep-2011
      • (2011)A 275mW heterogeneous multimedia processor for IC-stacking on Si-interposer2011 IEEE International Solid-State Circuits Conference10.1109/ISSCC.2011.5746249(128-130)Online publication date: Feb-2011
      • (2011)A 172.6mW 43.8GFLOPS energy-efficient scalable eight-core 3D graphics processor for mobile multimedia applicationsIEEE Asian Solid-State Circuits Conference 201110.1109/ASSCC.2011.6123602(405-408)Online publication date: Nov-2011

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media