Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Wyner-Ziv coding of video with unsupervised motion vector learning

Published: 01 June 2008 Publication History

Abstract

Distributed source coding theory has long promised a new method of encoding video that is much lower in complexity than conventional methods. In the distributed framework, the decoder is tasked with exploiting the redundancy of the video signal. Among the difficulties in realizing a practical codec has been the problem of motion estimation at the decoder. In this paper, we propose a technique for unsupervised learning of forward motion vectors during the decoding of a frame with reference to its previous reconstructed frame. The technique, described for both pixel-domain and transform-domain coding, is an instance of the expectation maximization algorithm. The performance of our transform-domain motion learning video codec improves as GOP size grows. It is better than using motion-compensated temporal interpolation by 0.5dB when GOP size is 2, and by even more when GOP size is larger. It performs within about 0.25dB of a codec that knows the motion vectors through an oracle, but is hundreds of orders of magnitude less complex than a corresponding brute-force decoder motion search approach would be.

References

[1]
A. Aaron, S. Rane, B. Girod, Wyner-Ziv video coding with hash-based motion compensation at the receiver, in: Proceedings of the IEEE International Conference on Image Processing, Singapore, 2004.
[2]
A. Aaron, R. Zhang, B. Girod, Wyner-Ziv coding of motion video, in: Proceedings of the Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, 2002.
[3]
D. Chen, D. Varodayan, Unsupervised learning of motion for distributed video coding, {www.stanford.edu/~dmchen/}, 2008.
[4]
D. Chen, D. Varodayan, M. Flierl, B. Girod, Distributed stereo image coding with improved disparity and noise estimation, in: Proceedings of the IEEE International Conference on Acoustic, Speech and Signal Processing, Las Vegas, NV, 2008.
[5]
D. Chen, D. Varodayan, M. Flierl, B. Girod, Wyner-Ziv coding of multiview images with unsupervised learning of disparity and Gray code, in: Proceedings of the IEEE International Conference on Image Processing, San Diego, CA, 2008.
[6]
Dempster, A., Laird, N. and Rubin, D., Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. Ser. B. v39 i1. 1-38.
[7]
Girod, B., Aaron, A., Rane, S. and Rebollo-Monedero, D., Distributed video coding. Proc. IEEE. v93 i1. 71-83.
[8]
Guillemot, C., Pereira, F., Torres, L., Ebrahimi, T., Leonardi, R. and Ostermann, J., Distributed monoview and multiview video coding. IEEE Signal Proc. Mag. v24 i5. 67-76.
[9]
ITU-T, I. JTC1, Digital compression and coding of continuous-tone still images, ISO/IEC 10918-1-ITU-T Recommendation T.81 (JPEG).
[10]
Kschischang, F.R., Frey, B.J. and Loeliger, H.-A., Factor graphs and the sum-product algorithm. IEEE Trans. Inf. Theory. v47 i2. 498-519.
[11]
Liveris, A., Xiong, Z. and Georghiades, C., Compression of binary sources with side information at the decoder using LDPC codes. IEEE Commun. Lett. v6 i10. 440-442.
[12]
F. Pereira, J. Ascenso, C. Brites, Studying the GOP size impact on the performance of a feedback channel-based Wyner-Ziv video codec, in: Proceedings of the IEEE Pacific Rim Symposium on Image and Video Technology, Santiago, Chile, 2007.
[13]
R. Puri, K. Ramchandran, PRISM: a new robust video coding architecture based on distributed compression principles, in: Proceedings of the Allerton Conference on Communications, Control and Computing, Allerton, IL, 2002.
[14]
Slepian, D. and Wolf, J.K., Noiseless coding of correlated information sources. IEEE Trans. Inf. Theory. v19 i4. 471-480.
[15]
D. Varodayan, A. Aaron, B. Girod, Rate-adaptive distributed source coding using low-density parity-check codes, in: Proceedings of the Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, 2005.
[16]
Varodayan, D., Aaron, A. and Girod, B., Rate-adaptive codes for distributed source coding. EURASIP Signal Process. J. v86 i11. 3123-3130.
[17]
D. Varodayan, Y.-C. Lin, A. Mavlankar, M. Flierl, B. Girod, Wyner-Ziv coding of stereo images with unsupervised learning of disparity, in: Proceedings of the Picture Coding Symposium, Lisbon, Portugal, 2007.
[18]
D. Varodayan, A. Mavlankar, M. Flierl, B. Girod, Distributed coding of random dot stereograms with unsupervised learning of disparity, in: Proceedings of the IEEE International Workshop on Multimedia Signal Processing, Victoria, BC, Canada, 2006.
[19]
D. Varodayan, A. Mavlankar, M. Flierl, B. Girod, Distributed grayscale stereo image coding with unsupervised learning of disparity, in: Proceedings of the IEEE Data Compression Conference, Snowbird, UT, 2007.
[20]
Wyner, A.D. and Ziv, J., The rate-distortion function for source coding with side information at the decoder. IEEE Trans. Inf. Theory. v22 i1. 1-10.

Cited By

View all
  • (2019)Successive refinement of side information frames in distributed video codingMultimedia Tools and Applications10.1007/s11042-019-7249-578:15(20697-20722)Online publication date: 1-Aug-2019
  • (2018)Multi-resolution extreme learning machine-based side information estimation in distributed video codingMultimedia Tools and Applications10.1007/s11042-018-5921-977:20(27301-27335)Online publication date: 1-Oct-2018
  • (2018)A joint correlation noise estimation and decoding algorithm for distributed video codingMultimedia Tools and Applications10.1007/s11042-017-4635-877:6(7327-7355)Online publication date: 1-Mar-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Image Communication
Image Communication  Volume 23, Issue 5
June, 2008
68 pages

Publisher

Elsevier Science Inc.

United States

Publication History

Published: 01 June 2008

Author Tags

  1. Expectation maximization
  2. Wyner-Ziv video coding

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Successive refinement of side information frames in distributed video codingMultimedia Tools and Applications10.1007/s11042-019-7249-578:15(20697-20722)Online publication date: 1-Aug-2019
  • (2018)Multi-resolution extreme learning machine-based side information estimation in distributed video codingMultimedia Tools and Applications10.1007/s11042-018-5921-977:20(27301-27335)Online publication date: 1-Oct-2018
  • (2018)A joint correlation noise estimation and decoding algorithm for distributed video codingMultimedia Tools and Applications10.1007/s11042-017-4635-877:6(7327-7355)Online publication date: 1-Mar-2018
  • (2016)Low complexity encoder for feedback-channel-free distributed video coding using deep convolutional neural networks at the decoderProceedings of the Tenth Indian Conference on Computer Vision, Graphics and Image Processing10.1145/3009977.3009986(1-7)Online publication date: 18-Dec-2016
  • (2016)Graphics processing unit-accelerated joint-bitplane belief propagation algorithm in DSCThe Journal of Supercomputing10.1007/s11227-016-1736-572:6(2351-2375)Online publication date: 1-Jun-2016
  • (2015)Improved Tampering Detection for Image Authentication Based on Image PartitioningWireless Personal Communications: An International Journal10.1007/s11277-015-2594-984:1(69-85)Online publication date: 1-Sep-2015
  • (2015)Decoder side information generation techniques in Wyner-Ziv video codingMultimedia Tools and Applications10.1007/s11042-013-1718-z74:6(1777-1803)Online publication date: 1-Mar-2015
  • (2014)Support vector regression for rate prediction in distributed video codingIntelligent Data Analysis10.5555/2639304.263931318:3(465-477)Online publication date: 1-May-2014
  • (2014)Distributed Video Coding using Local Rank TransformProceedings of the 2014 Indian Conference on Computer Vision Graphics and Image Processing10.1145/2683483.2683501(1-8)Online publication date: 14-Dec-2014
  • (2014)Progressively refined wyner-ziv video coding for visual sensorsACM Transactions on Sensor Networks10.1145/253027910:2(1-34)Online publication date: 31-Jan-2014
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media