article

Wyner-Ziv coding of video with unsupervised motion vector learning

Authors:

David Varodayan,

Bernd GirodAuthors Info & Claims

Image Communication, Volume 23, Issue 5

Pages 369 - 378

https://doi.org/10.1016/j.image.2008.04.009

Published: 01 June 2008 Publication History

Abstract

Distributed source coding theory has long promised a new method of encoding video that is much lower in complexity than conventional methods. In the distributed framework, the decoder is tasked with exploiting the redundancy of the video signal. Among the difficulties in realizing a practical codec has been the problem of motion estimation at the decoder. In this paper, we propose a technique for unsupervised learning of forward motion vectors during the decoding of a frame with reference to its previous reconstructed frame. The technique, described for both pixel-domain and transform-domain coding, is an instance of the expectation maximization algorithm. The performance of our transform-domain motion learning video codec improves as GOP size grows. It is better than using motion-compensated temporal interpolation by 0.5dB when GOP size is 2, and by even more when GOP size is larger. It performs within about 0.25dB of a codec that knows the motion vectors through an oracle, but is hundreds of orders of magnitude less complex than a corresponding brute-force decoder motion search approach would be.

References

[1]

A. Aaron, S. Rane, B. Girod, Wyner-Ziv video coding with hash-based motion compensation at the receiver, in: Proceedings of the IEEE International Conference on Image Processing, Singapore, 2004.

[2]

A. Aaron, R. Zhang, B. Girod, Wyner-Ziv coding of motion video, in: Proceedings of the Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, 2002.

[3]

D. Chen, D. Varodayan, Unsupervised learning of motion for distributed video coding, {www.stanford.edu/~dmchen/}, 2008.

[4]

D. Chen, D. Varodayan, M. Flierl, B. Girod, Distributed stereo image coding with improved disparity and noise estimation, in: Proceedings of the IEEE International Conference on Acoustic, Speech and Signal Processing, Las Vegas, NV, 2008.

[5]

D. Chen, D. Varodayan, M. Flierl, B. Girod, Wyner-Ziv coding of multiview images with unsupervised learning of disparity and Gray code, in: Proceedings of the IEEE International Conference on Image Processing, San Diego, CA, 2008.

[6]

Dempster, A., Laird, N. and Rubin, D., Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. Ser. B. v39 i1. 1-38.

[7]

Girod, B., Aaron, A., Rane, S. and Rebollo-Monedero, D., Distributed video coding. Proc. IEEE. v93 i1. 71-83.

[8]

Guillemot, C., Pereira, F., Torres, L., Ebrahimi, T., Leonardi, R. and Ostermann, J., Distributed monoview and multiview video coding. IEEE Signal Proc. Mag. v24 i5. 67-76.

[9]

ITU-T, I. JTC1, Digital compression and coding of continuous-tone still images, ISO/IEC 10918-1-ITU-T Recommendation T.81 (JPEG).

[10]

Kschischang, F.R., Frey, B.J. and Loeliger, H.-A., Factor graphs and the sum-product algorithm. IEEE Trans. Inf. Theory. v47 i2. 498-519.

[11]

Liveris, A., Xiong, Z. and Georghiades, C., Compression of binary sources with side information at the decoder using LDPC codes. IEEE Commun. Lett. v6 i10. 440-442.

[12]

F. Pereira, J. Ascenso, C. Brites, Studying the GOP size impact on the performance of a feedback channel-based Wyner-Ziv video codec, in: Proceedings of the IEEE Pacific Rim Symposium on Image and Video Technology, Santiago, Chile, 2007.

[13]

R. Puri, K. Ramchandran, PRISM: a new robust video coding architecture based on distributed compression principles, in: Proceedings of the Allerton Conference on Communications, Control and Computing, Allerton, IL, 2002.

[14]

Slepian, D. and Wolf, J.K., Noiseless coding of correlated information sources. IEEE Trans. Inf. Theory. v19 i4. 471-480.

[15]

D. Varodayan, A. Aaron, B. Girod, Rate-adaptive distributed source coding using low-density parity-check codes, in: Proceedings of the Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, 2005.

[16]

Varodayan, D., Aaron, A. and Girod, B., Rate-adaptive codes for distributed source coding. EURASIP Signal Process. J. v86 i11. 3123-3130.

[17]

D. Varodayan, Y.-C. Lin, A. Mavlankar, M. Flierl, B. Girod, Wyner-Ziv coding of stereo images with unsupervised learning of disparity, in: Proceedings of the Picture Coding Symposium, Lisbon, Portugal, 2007.

[18]

D. Varodayan, A. Mavlankar, M. Flierl, B. Girod, Distributed coding of random dot stereograms with unsupervised learning of disparity, in: Proceedings of the IEEE International Workshop on Multimedia Signal Processing, Victoria, BC, Canada, 2006.

[19]

D. Varodayan, A. Mavlankar, M. Flierl, B. Girod, Distributed grayscale stereo image coding with unsupervised learning of disparity, in: Proceedings of the IEEE Data Compression Conference, Snowbird, UT, 2007.

[20]

Wyner, A.D. and Ziv, J., The rate-distortion function for source coding with side information at the decoder. IEEE Trans. Inf. Theory. v22 i1. 1-10.

Cited By

Mohammad Taheri YAhmad MSwamy M(2019)Successive refinement of side information frames in distributed video codingMultimedia Tools and Applications10.1007/s11042-019-7249-578:15(20697-20722)Online publication date: 1-Aug-2019
https://dl.acm.org/doi/10.1007/s11042-019-7249-5
Dash BRup SMohapatra AMajhi BSwamy M(2018)Multi-resolution extreme learning machine-based side information estimation in distributed video codingMultimedia Tools and Applications10.1007/s11042-018-5921-977:20(27301-27335)Online publication date: 1-Oct-2018
https://dl.acm.org/doi/10.1007/s11042-018-5921-9
Mohammad Taheri YAhmad MSwamy M(2018)A joint correlation noise estimation and decoding algorithm for distributed video codingMultimedia Tools and Applications10.1007/s11042-017-4635-877:6(7327-7355)Online publication date: 1-Mar-2018
https://dl.acm.org/doi/10.1007/s11042-017-4635-8
Show More Cited By

Index Terms

Wyner-Ziv coding of video with unsupervised motion vector learning
1. Computing methodologies
2. Information systems
  1. Data management systems
    1. Data structures
      1. Data layout
        Data compression

Recommendations

Backward channel aware Wyner-Ziv video coding: A study of complexity, rate, and distortion tradeoff

Wyner-Ziv video coding is a coding method that exploits source statistics at the decoder. Many Wyner-Ziv video coding schemes encode a video sequence into two types of frames, key frames and Wyner-Ziv frames, and rely on the frequent use of INTRA frames ...
Rate allocation for transform domain Wyner-Ziv video coding without feedback
MM '08: Proceedings of the 16th ACM international conference on Multimedia

In this paper, we propose a new rate allocation algorithm for transform domain Wyner-Ziv video coding (WZVC) without feedback. In contrast to conventional video coding, Wyner-Ziv video coding aims to design simple intra-frame encoding and complex inter-...
Perception-based adaptive quantization for transform-domain Wyner-Ziv video coding

Distributed video coding (DVC) is desirable for encoding systems with tight power or computational constraints, for which the popular practical solution is transform-domain Wyner-Ziv video coding (TD-WZVC). To achieve the similar coding performance with ...

Comments

Information & Contributors

Information

Published In

cover image Image Communication

Image Communication Volume 23, Issue 5

June, 2008

68 pages

ISSN:0923-5965

Issue’s Table of Contents

Copyright © Elsevier B.V. © 2008.

Publisher

Elsevier Science Inc.

United States

Publication History

Published: 01 June 2008

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

15
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Mohammad Taheri YAhmad MSwamy M(2019)Successive refinement of side information frames in distributed video codingMultimedia Tools and Applications10.1007/s11042-019-7249-578:15(20697-20722)Online publication date: 1-Aug-2019
https://dl.acm.org/doi/10.1007/s11042-019-7249-5
Dash BRup SMohapatra AMajhi BSwamy M(2018)Multi-resolution extreme learning machine-based side information estimation in distributed video codingMultimedia Tools and Applications10.1007/s11042-018-5921-977:20(27301-27335)Online publication date: 1-Oct-2018
https://dl.acm.org/doi/10.1007/s11042-018-5921-9
Mohammad Taheri YAhmad MSwamy M(2018)A joint correlation noise estimation and decoding algorithm for distributed video codingMultimedia Tools and Applications10.1007/s11042-017-4635-877:6(7327-7355)Online publication date: 1-Mar-2018
https://dl.acm.org/doi/10.1007/s11042-017-4635-8
Bhagath PMukherjee JMukopadhayay SBatra DBrown MNatarajan V(2016)Low complexity encoder for feedback-channel-free distributed video coding using deep convolutional neural networks at the decoderProceedings of the Tenth Indian Conference on Computer Vision, Graphics and Image Processing10.1145/3009977.3009986(1-7)Online publication date: 18-Dec-2016
https://dl.acm.org/doi/10.1145/3009977.3009986
Dai YFang YYang LJeon G(2016)Graphics processing unit-accelerated joint-bitplane belief propagation algorithm in DSCThe Journal of Supercomputing10.1007/s11227-016-1736-572:6(2351-2375)Online publication date: 1-Jun-2016
https://dl.acm.org/doi/10.1007/s11227-016-1736-5
Masoodhu Banu NSujatha S(2015)Improved Tampering Detection for Image Authentication Based on Image PartitioningWireless Personal Communications: An International Journal10.1007/s11277-015-2594-984:1(69-85)Online publication date: 1-Sep-2015
https://dl.acm.org/doi/10.1007/s11277-015-2594-9
Jia YWang YSong RLi J(2015)Decoder side information generation techniques in Wyner-Ziv video codingMultimedia Tools and Applications10.1007/s11042-013-1718-z74:6(1777-1803)Online publication date: 1-Mar-2015
https://dl.acm.org/doi/10.1007/s11042-013-1718-z
Nickaein IRahmati MHamzei N(2014)Support vector regression for rate prediction in distributed video codingIntelligent Data Analysis10.5555/2639304.263931318:3(465-477)Online publication date: 1-May-2014
https://dl.acm.org/doi/10.5555/2639304.2639313
Mallick KMukherjee JRamakrishnan AMalik JEfros AJawahar CVarma M(2014)Distributed Video Coding using Local Rank TransformProceedings of the 2014 Indian Conference on Computer Vision Graphics and Image Processing10.1145/2683483.2683501(1-8)Online publication date: 14-Dec-2014
https://dl.acm.org/doi/10.1145/2683483.2683501
Deligiannis NVerbist FSlowack JWalle RSchelkens PMunteanu A(2014)Progressively refined wyner-ziv video coding for visual sensorsACM Transactions on Sensor Networks10.1145/253027910:2(1-34)Online publication date: 31-Jan-2014
https://dl.acm.org/doi/10.1145/2530279
Show More Cited By

View Options

View options

Figures

Tables

Media

View Issue’s Table of Contents