Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

A multiple-window video embedding transcoder based on H.264/AVC standard

Published: 01 June 2007 Publication History

Abstract

This paper proposes a low-complexity multiple-window video embedding transcoder (MW-VET) based on H.264/AVC standard for various applications that require video embedding services including picture-in-picture (PIP), multichannel mosaic, screen-split, pay-per-view, channel browsing, commercials and logo insertion, and other visual information embedding services. The MW-VET embeds multiple foreground pictures at macroblock-aligned positions. It improves the transcoding speed with three block level adaptive techniques including slice group based transcoding (SGT), reduced frame memory transcoder (RFMT), and syntax level bypassing (SLB). The SGT utilizes prediction from the slice-aligned data partitions in the original bitstreams such that the transcoder simply merges the bitstreams by parsing. When the prediction comes from the newly covered area without slice-group data partitions, the pixels at the affected macroblocks are transcoded with the RFMT based on the concept of partial reencoding to minimize the number of refined blocks. The RFMT employs motion vector remapping (MVR) and intra mode switching (IMS) to handle intercoded blocks and intracoded blocks, respectively. The pixels outside the macroblocks that are affected by newly covered reference frame are transcoded by the SLB. Experimental results show that, as compared to the cascaded pixel domain transcoder (CPDT) with the highest complexity, our MW-VET can significantly reduce the processing complexity by 25 times and retain the rate-distortion performance close to the CPDT. At certain bit rates, the MW-VET can achieve up to 1.5 dB quality improvement in peak signal-to-noise-ratio (PSNR).

References

[1]
{1} ITU-T Rec. H.264 and ISO/IEC 14496-10 (MPEG4-AVC), "Advanced Video Coding for Generic Audiovisual Services," v1, May 2003; v2, January 2004; v3, September 2004; v4, July 2005.
[2]
{2} S. Wenger, "H.264/AVC over IP," IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, no. 7, pp. 645-656, 2003.
[3]
{3} M. D. Nava and C. Del-Toso, "A short overview of the VDSL system requirements," IEEE Communications Magazine, vol. 40, no. 12, pp. 82-90, 2002.
[4]
{4} S. Naimpally, L. Johnson, T. Darby, R. Meyer, L. Phillips, and J. Vantrease, "Integrated digital IDTV receiver with features," IEEE Transactions on Consumer Electronics, vol. 34, no. 3, pp. 410-419, 1988.
[5]
{5} D. Gillies, R. Schweer, and H. Zibold, "VLSI realisations for picture in picture and flicker free television display," IEEE Transactions on Consumer Electronics, vol. 34, no. 1, pp. 253-261, 1988.
[6]
{6} M. Burkert, F. Frieling, U. Langenkamp, U. Libal, M. Mende, and G. Scheffler, "IC set for a picture-in-picture system with on-chip memory," IEEE Transactions on Consumer Electronics, vol. 36, no. 1, pp. 23-31, 1990.
[7]
{7} C. A. Mancini and C. P. Markhauser, "Microprocessor controlled picture in picture system," IEEE Transactions on Consumer Electronics, vol. 36, no. 3, pp. 375-379, 1990.
[8]
{8} M. Honzawa, M. Koyama, T. Hibino, H. Miyashita, and Y. Shiine, "New picture in picture LSI enhanced functionality for high picture quality," IEEE Transactions on Consumer Electronics , vol. 36, no. 3, pp. 387-394, 1990.
[9]
{9} L. D. Johnson, J. N. Pratt, and D. C. Greene, "Low cost picture-in-picture for color TV receivers," IEEE Transactions on Consumer Electronics, vol. 36, no. 3, pp. 380-386, 1990.
[10]
{10} S. Tsuchida and C. Yoshida, "Multi-picture system for high resolution wide aspect ratio screen," IEEE Transactions on Consumer Electronics, vol. 37, no. 3, pp. 313-319, 1991.
[11]
{11} G. W. Perkins, R. C. Hathaway, S. W. Lai, et al., "A low cost, monolithic, color picture-in-picture device," IEEE Transactions on Consumer Electronics, vol. 40, no. 3, pp. 306-316, 1994.
[12]
{12} A. Rick, T. Herfet, and S. J. Prange, "Digital color decoder for PIP-applications," IEEE Transactions on Consumer Electronics, vol. 42, no. 3, pp. 716-720, 1996.
[13]
{13} M. Brett and D. Wendel, "High performance picture-in-picture (PIP) IC using embedded DRAM technology," IEEE Transactions on Consumer Electronics, vol. 45, no. 3, pp. 698-705, 1999.
[14]
{14} M. Schu, G. Scheffer, C. Tuschen, and A. Stolze, "System on silicon-IC for motion compensated scan rate conversion, picture-in-picture processing, split screen applications and display processing," IEEE Transactions on Consumer Electronics , vol. 45, no. 3, pp. 842-850, 1999.
[15]
{15} M. Schu, D. Wendel, C. Tuschen, M. Hahn, and U. Langenkamp, "System-on-silicon solution for high quality consumer video processing--the next generation," IEEE Transactions on Consumer Electronics, vol. 47, no. 3, pp. 412-419, 2001.
[16]
{16} C. Hentschel, R. J. Bril, Y. Chen, R. Braspenning, and T.-H. Lan, "Video quality-of-service for consumer terminals--a novel system for programmable components," IEEE Transactions on Consumer Electronics, vol. 49, no. 4, pp. 1367-1377, 2003.
[17]
{17} I. Ahmad, X. Wei, Y. Sun, and Y.-Q. Zhang, "Video transcoding: an overview of various techniques and research issues," IEEE Transactions on Multimedia, vol. 7, no. 5, pp. 793-804, 2005.
[18]
{18} S.-F. Chang and D. G. Messerschmitt, "Compositing motion-compensated video within the network," in Proceedings of the 4th IEEE ComSoc International Workshop on Multimedia Communications (MULTIMEDIA '92), pp. 40-56, Monterey, Calif, USA, April 1992.
[19]
{19} S.-F. Chang and D. G. Messerschmitt, "Manipulation and compositing of MC-DCT compressed video," IEEE Journal on Selected Areas in Communications, vol. 13, no. 1, part 2, pp. 1-11, 1995.
[20]
{20} Y. Noguchi, D. G. Messerschmitt, and S.-F. Chang, "MPEG video compositing in the compressed domain," in Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS '96), vol. 2, pp. 596-599, Atlanta, Ga, USA, May 1996.
[21]
{21} B. Yu and K. Nahrstedt, "Internet-based interactive HDTV," Multimedia Systems, vol. 9, no. 5, pp. 477-489, 2004.
[22]
{22} Y.-P. Tan and H. Sun, "Fast motion re-estimation for arbitrary downsizing video transcoding using H.264/AVC standard," IEEE Transactions on Consumer Electronics, vol. 50, no. 3, pp. 887-894, 2004.
[23]
{23} C.-H. Li, C.-N. Wang, and T. Chiang, "A fast downsizing video transcoding based on H.264/AVC standard," in Proceedings of the 5th IEEE Pacific Rim Conference on Multimedia (PCM '04), pp. 215-223, Tokyo, Japan, November-December 2004.
[24]
{24} H. Shen, X. Sun, F. Wu, H. Li, and S. Li, "A fast downsizing video transcoder for H.264/AVC with rate-distortion optimal mode decision," in Proceedings of IEEE International Conference on Multimedia and Expo (ICME '06), vol. 1, pp. 2017-2020, Toronto, Ontario, Canada, July 2006.
[25]
{25} N. Merhav and V. Bhaskaran, "A fast algorithm for DCT-domain inverse motion compensation," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '96), vol. 4, pp. 2307-2310, Atlanta, Ga, USA, May 1996.
[26]
{26} J. Song and B.-L. Yeo, "A fast algorithm for DCT-domain inverse motion compensation based on shared information in a macroblock," IEEE Transactions on Circuits and Systems for Video Technology, vol. 10, no. 5, pp. 767-775, 2000.
[27]
{27} S. Liu and A. C. Bovik, "Local bandwidth constrained fast inverse motion compensation for DCT-domain video transcoding," IEEE Transactions on Circuits and Systems for Video Technology , vol. 12, no. 5, pp. 309-319, 2002.
[28]
{28} H. Shen, X. Sun, F. Wu, H. Li, and S. Li, "A fast downsizing video transcoder for H.264/AVC with rate-distortion optimal mode decision," in Proceedings of IEEE International Conference on Multimedia and Expo (ICME '06), pp. 2017-2020, Toronto, Ontario, Canada, July 2006.
[29]
{29} J. Bialkowski, M. Barkouwsky, F. Leschka, and A. Kaup, "Low-complexity transcoding of inter coded video frames from H.264 to H.263," in Proceedings of IEEE International Conference on Image Processing (ICIP '06), pp. 837-840, Atlanta, Ga, USA, October 2006.
[30]
{30} J. H. Hur and Y. L. Lee, "H.264 to MPEG-4 transcoding using block type information," in Proceedings of IEEE International Conference Region 10 (TENCON '05), pp. 1-6, Melbourne, Australia, November 2005.
[31]
{31} Y.-P. Tan and H. Sun, "Fast motion re-estimation for arbitrary downsizing video transcoding using H.264/AVC standard," IEEE Transactions on Consumer Electronics, vol. 50, no. 3, pp. 887-894, 2004.
[32]
{32} P. Zhang, Y. Lu, Q. Huang, and W. Gao, "Mode mapping method for H.264/AVC spatial downscaling transcoding," in Proceedings of International Conference on Image Processing (ICIP '04), vol. 4, pp. 2781-2784, Singapore, October 2004.
[33]
{33} I.-H. Shin, Y.-L. Lee, and H.-W. Park, "Motion estimation for frame-rate reduction in H.264 transcoding," in Proceedings of the 2nd IEEE Workshop on Software Technologies for Future Embedded and Ubiquitous Systems (WSTFES '04), vol. 4, pp. 63-67, Vienna, Austria, May 2004.
[34]
{34} D. Lefol and D. Bull, "Mode refinement algorithm for H.264 inter frame requantization," in Proceedings of IEEE International Conference on Image Processing (ICIP '06), pp. 845-848, Atlanta, Ga, USA, October 2006.
[35]
{35} J. Zhang, A. Perkis, and N. D. Georganas, "H.264/AVC and transcoding for multimedia adaptation," in Proceedings of the 6th COST 276 Workshop, Thessaloniki, Greece, May 2004.
[36]
{36} X. Xiu, L. Zhuo, and L. Shen, "A H.264 bit rate transcoding scheme based on PID controller," in Proceedings of IEEE International Symposium on Communications and Information Technologies (ISCIT '05), vol. 2, pp. 1074-1077, Beijing, China, October 2005.
[37]
{37} D. Lefol, D. Bull, and N. Canagarajah, "Performance evaluation of transcoding algorithms for H.264," IEEE Transactions on Consumer Electronics, vol. 52, no. 1, pp. 215-222, 2006.
[38]
{38} C.-H. Li, C.-N. Wang, and T. Chiang, "A low complexity picture-in-picture transcoder for video-on-demand," in Proceedings of IEEE International Conference on Wireless Networks, Communications and Mobile Computing (WirelessCom '05), vol. 2, pp. 1382-1387, Maui, Hawaii, USA, June 2005.
[39]
{39} C.-H. Li, H. Lin, C.-N. Wang, and T. Chiang, "A fast H.264-based picture-in-picture (PIP) transcoder," in Proceedings of IEEE International Conference on Multimedia and Expo (ICME '04), vol. 3, pp. 1691-1694, Taipei, Taiwan, June 2004.
[40]
{40} A. Levi and H. Stark, "Restoration from phase and magnitude by generalized projections," in Image Recovery Theory and Application, pp. 277-319, Academic Press, Orlando, Fla, USA, 1987.
[41]
{41} S.-H. Wang, W.-H. Peng, Y. He, et al., "A software-hardware co-implementation of MPEG-4 advanced video coding (AVC) decoder with block level pipelining," The Journal of VLSI Signal Processing, vol. 41, no. 1, pp. 93-110, 2005.
[42]
{42} C. Chen, P.-H. Wu, and H. Chen, "Transform-domain intra prediction for H.264," in Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS '05), vol. 2, pp. 1497-1500, Kobe, Japan, May 2005.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image EURASIP Journal on Advances in Signal Processing
EURASIP Journal on Advances in Signal Processing  Volume 2007, Issue 2
June 2007
332 pages

Publisher

Hindawi Limited

London, United Kingdom

Publication History

Published: 01 June 2007

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)4
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media