0% found this document useful (0 votes)

10 views

Distributed Source Coding For Video and Image Applications

The document discusses several research projects related to distributed source coding for video and image applications, multi-view video compression, error tolerant multimedia compression, and efficient sensor web communication strategies. It provides details on funding sources and publications for each project from 2007-2008.

Uploaded by

jesmarod

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Distributed Source Coding For Video and Image Applications

Uploaded by

jesmarod

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Distributed Source Coding for Video and Image Applications

From Theory To Practice

Antonio ORTEGA
Signal and Image Processing Institute
Department of Electrical Engineering
University of Southern California

http://sipi.usc.edu/~ortega

Multi-view Video Compression

• Funding: Thomson Corporate Research

• Students: Jae Hoon Kim, PoLin Lai
z1
Camera 1 : View 1
A

• Improvements to inter-view coding in MVC B

Camera 2 : View 2

– Heterogeneous camera settings

– Illumination compensation (IC) Camera 3 : View 3

– Adaptive reference filtering (ARF)

•Publications (‘07-’08)
–J. H. Kim, P. Lai, J. Lopez, A. Ortega, Y. Su, P. Yin, and C. Gomila, “New Coding Tools for Illumination and Focus
Mismatch Compensation in Multiview Video Coding,” in IEEE Trans. Circuits and Systems for Video Technology, Vol.
17, No. 11, pp. 1519-1535, Nov. 2007.
–P. Lai, A. Ortega, P. Pandit, P. Yin, and C. Gomila, “Focus Mismatches in Multiview Systems and Efficient Adaptive
Reference Filtering for Multiview Video Coding”, in Proc. SPIE 2008 Visual Communications and Image Processing
(VCIP), Jan 2008.

1
Error tolerant multimedia compression

• Funding: NSF
• PIs: Melvin Breuer, Keith Chugg, Sandeep Gupta, Antonio Ortega
• Students: Hye-Yeon Cheong, In-Suk Chong, Zhaoliang Pan, Shideh Shahid, On Wa
Yeung

80
Full search ME
acceptance curve
70
12

11 13
60 11
12
_=1
11

50
10 _=7/8
13
10 12

40 11

Pe (%)
10
EPZS ME
14
acceptance curve _=3/4
30 12 13

11
_=5/8
10
_=1/2
20
14
12
11
13
9 _=3/8
10 10
14
12
15
11 13
14
12 13
0
0 50 100 150 200 250 300
Se ( average extra bits per wrong match MB)

Acceptable fault: PSNR impact, error

R.C. Leachman and C.N. Berglund, "Systematic
Mechanisms-Limited Yield Assessment Survey,"
rate, error significance
Competitive Semiconductor Manufacturing Large percentage of SSF are acceptable
Program, UC Berkeley, 2003 (e.g., 99% lead to less than 0.01dB
degradation).

Efficient Sensor Web Communication Strategies Based on Jointly

Optimized Distributed Wavelet Transform and Routing

• USC: A. Ortega (PI), B. Krishnamachari, S. Lee, S.W. Lee, S. Pattem, G. Shen, A. Tu

• NASA JPL: M. Cheng, S. Dolinar, A. Kiely, M. Klimesh, H. Xie
• Funding: NASA grant AIST-05-0081
• Interactions between routing, transforms and quantization
• Implementation using Motes

• Publications (‘07-’08):
– G. Shen and A. Ortega, "Optimized Distributed 2D Transforms for Irregularly Sampled Sensor Network Grids
Using Wavelet Lifting," accepted for publication in Proc. of 2008 IEEE Intl. Conf. on Acoustics, Speech and
Signal Processing, ICASSP '08, 2008
– G. Shen and A. Ortega, "Joint Routing and 2D Transform Optimization for Irregular Sensor Network Grids Using
Wavelet Lifting," accepted for publication in IPSN '08: Proc. of the Fifth Intl. Conf. on Information Processing in
Sensor Networks, 2008

2
MADCAT – Maltraffic Analysis and Detection in
Challenging and Aggregate Traffic
• Network Measurement System Modeling
– Modeling network measurement systems
using signal processing modules
– Develop methods to mitigate the effect of
these errors on signal analysis
• Publications:
– U. Mitra, A. Ortega, J. Heidemann and C.
Papadopoulos, “Detecting and Identifying Malware:
A New Signal Processing Goal”, IEEE Signal
Processing Magazine, 2006.
– S.McPherson and A. Ortega, “Modeling the effects of
interrupt moderation on network measurements”,
submitted, Global Internet Symposium, 2008

• Group Members
– P.I’s – John Heidemann (CS), Urbashi Mitra, Antonio Ortega,
Christos Papadopoulos (CS),
– Students – Genevieve Bartlett, Xue Cai, Sean McPherson,
Gautam Thatte
• Website – http://isi.usc.edu/ant

Support - MADCAT is supported by the NSF's NeTS program, grant number CNS-0626696

Signal Processing for Oilfield Data Mining

Students: Kun-Han Lee and Yen-Ting Lin 6 injectors/3 producers Line Drive Scenario

Collaborators: I. Ershaghi (PTE) Injector 4 Injector 5 Injector 6

Funding: Chevron Corp.

Injection Well
Production Well

Publications (‘07-’08):
Producer 1 Producer 2 Producer 3
•K.-H Lee, A. Ortega, I. Ershaghi, “A Method for
Characterization of Flow Units Between Injection-
Production Wells Using Performance Data”, 2008 SPE
Western Regional and Pacific Section AAPG Joint
Injector 1 Injector 2 Injector 3
Meeting, March 2008.

Effective Flow Units in Fracture Case

• Goal: Injector 1 Injector 2 Injector 3

• Inference of oilfield
geological characteristics
based on water injection/oil
Producer 1 Producer 2 Producer 3
production data. Y

• Tools:
• Wavelet-based methods
• Analysis and simulation; Injector 4 Injector 5 Injector 6

planned field testing X

3
Genome Copy Number Alteration Detection

Genome copy number alterations (CNA) Deletion Duplication MYCN

linked with cancer and other conditions 1p Deletion Amplification

Highly noisy data, very large number of

probes.

Signal processing problem:

•Find optimal piecewise constant sparse approximation
•Matching pursuits/basis pursuits inefficient due to high coherence dictionary
•Sparse Bayesian Learning can handle coherent dictionaries and can operate in linear
time.

•Results comparable to state of art (e.g., segmentation based techniques) with 10x speed-up

Neuroblastoma cancer cell-line KAN, Chr 19

R. Pique-Regi, J. Monso-Varona, A. Ortega, R.
Seeger, T. Triche and S. Asgharzadeh, “Sparse
representation and Bayesian detection of
genome copy number alterations from array
data'', Bioinformatics, Jan. 2008.

Distributed Source Coding for Video and Image Applications

From Theory To Practice

Antonio ORTEGA
Signal and Image Processing Institute
Department of Electrical Engineering
University of Southern California

http://sipi.usc.edu/~ortega

4
Acknowledgements

• Collaborators at USC:
– Dr Ngai-Man Cheung
– Dr Huisheng Wang (now at Google)
– Caimu Tang
– Ivy Tseng

• Outside Collaborators:
– Sam Dolinar (NASA-JPL)
– Aaron Kiely (NASA-JPL)
– Kannan Ramchandran (UCB)

• Funding:
– NASA-JPL
‒ NSF

Mystery Graph?

Source: Google Scholar

5
Distributed Source Coding (DSC)

• Information-theoretic results from 1970s

– Lossless: [Slepian, Wolf; 1973]
– Lossy: [Wyner, Ziv; 1976]

• Practical application until recently [Pradhan, Ramchandran; DCC 1999]

– Dense sensor network
– Practical encoding/decoding algorithm

• Application to video compression [Puri, Ramchandran; Allerton 2002], [Aaron, Zhang,

Girod; Asilomar 2002]
– Low complexity encoding
– Distributed video coding, Wyner-Ziv video

• Other applications to video/image compression

– Error resilience
– Low complexity scalable video encoding
– Hyperspectral imagery compression
– Multiview video coding

Outline

• Introduction

– Distributed Source Coding

– Example application scenarios
– Practical DSC techniques
– Key Problems

• Applications/Case studies

– Scalable video coding

– Hyperspectral image coding
– Flexible video decoding

6
Distributed Source Coding: Fig. 1

Lossless compression of random variable X

- Intra coding: R ≥ H(X)

(i) Predictive (e.g. DPCM):

X R≥H(X|Y) X
Encoder Decoder

Y
(ii) DSC:

X X
Encoder Decoder

P(X,Y) Y

Distributed Source Coding: Fig. 1

- Access to exact value of Y

Lossless compression of random variable X
- Compute the residue
- Intra coding: R ≥ H(X) - Use entropy coding to achieve compression

(i) Predictive (e.g. DPCM): X- Entropy

X=4 - 001
Y Coding
X R≥H(X|Y)Y=2 X
X-Y:
Encoder Decoder

Y
(ii) DSC:
- Do not have access to exact value of Y
X - Know only distribution of Y given X X
Encoder - Not immediatelyDecoder
useful since we want to
compress X

P(X,Y) X=4 Y

fX|Y(x|y):
Y

7
Distributed Source Coding: Fig. 1

Lossless compression of random variable X

- Intra coding: R ≥ H(X)

(i) Predictive (e.g. DPCM):

X R≥H(X|Y) X
Encoder Decoder

Y [Slepian, Wolf; 1973]

(ii) DSC:

X R≥H(X|Y) X
Encoder Decoder

P(X,Y) Y
Efficient encoding possible even when encoder
does not have precise knowledge of Y

How can we encode X in this case?

Example Slepian-Wolf Coding

• X={X}, Y={Y}, to compress a n-bits binary vector X

• Correlation: binary symmetric channel with crossover probability p

Encoder: - Partition input space into cosets:

- Send coset index:
Space of n-bits vectors:

… C A
A B CC D E F G H A B C D A B A
B C B C

2n-k cosets
X mod m = coset index X HT = S

Decoder:
- Use Y to disambiguate the information:
Y
C

C C … C Y C

- Compression performance: (n-k)/n

- Number of cosets (min. distance between members
of coset) depends on correlation

8
Application scenarios - 1

Robustness to channel losses:

• Achieving deterministic decoding with non-

deterministic input to the decoder
• Have to estimate noise power at the encoder

X X
Encoder Decoder

Y, P(N) Y+N

[Sehgal, Jagmohan, Ahuja; IEEE Trans. Multimedia 04],

[Majumdar, Wang, Ramchandran, Garudadri; PCS 04],
[Rane, Girod; VCIP 06], etc

Application Scenarios - 2

Flexible playback:
• Exploit correlation between input and
multiple side information
• Decoder can use Y1 OR Y2 OR … etc

X X
Encoder Decoder

Y1, Y2, … Yi

[Cheung, Wang, Ortega; VCIP 06][Cheung, Ortega,

MMSP’07, PCS’07, VCIP’08

9
Application Scenarios - 3

Low complexity encoding

• If finding Y to predict X is complex, but..
• P(X,Y) can be estimated in low complexity

X X
Encoder Decoder

P(X,Y) Y

[Puri, Ramchandran, Allerton 02]

[Aaron, Zhang, Girod, Asilomar 02], etc

Practical Lossy DSC: Components

X Minimum-
Q Slepian-Wolf Slepian-Wolf Q
distortion
Quantizer ^
Encoder Decoder X
Reconstruction

Y Y
P(X,Y)

• Lossy quantization
– In many cases following a transform
• Lossless compression of quantization index Q using SW encoder
– Convert to binary data (e.g., bitplanes):
• Transmit LSB
• Select codes use syndrome coding
• At decoder, side information Y is used in:
– Slepian-Wolf decoding
– Reconstruct X in the quantization bin specified by Q: use as reconstruction
the expected value given Y, conditioned on bin Q

10
Practical Lossy DSC: Key Problems

Minimum- ^
X Q Slepian-Wolf Slepian-Wolf Q X
Quantizer distortion
Encoder Decoder
Reconstruction

P(X,Y) Y Y
• Defining the application:
– Loss in RD performance due to DSC (even in theory)
– What is the other metric that we want to optimize? (complexity, memory,
speed)
• Formulating correlation model estimation problem:
– Lower modeling accuracy leads to coding penalty
– Better modeling may reduce benefits in terms other metric (e.g., complexity)
• For a given design, how to optimize RD performance:
– When to use DSC and when not to (e.g., use Intra coding instead)?
– Optimize trade-off in terms of RD and/or Complexity, etc
• This Talk:
– Case studies to illustrate the benefits of focusing on these issues.

Mystery Graph?

Source: Google Scholar

11
Total citations per 5 year period

Source: Google Scholar

Outline

• Introduction

– Distributed Source Coding

– Example application scenarios
– Practical DSC techniques
– Key Problems

• Applications/Case studies

– Scalable video coding

– Hyperspectral image coding
– Flexible video decoding

12
DSC Application to Scalable Video Coding

Optimal Scalable Video Coding with

Multiple Motion-Compensated Prediction (MCP) Loops

Multiple MCP loops approach

[Rose, Regunathan; IEEE Trans. IP 2001]

Frame 1 Frame 2 Frame k

EL1L EL2L … ELkL
…

Encoder replicates all the

EL reconstructions
EL11 EL21 … ELk1

BL1 BL2 … BLk

Use both BL and EL for prediction: good coding efficiency

13
Multiple MCP Loops Approach:
Non-trivial Complexity
In multiple loops approach, the complexity of replicating all the EL reconstructions
could be non-trivial in multiple layers coding

Repeat motion compensation on the reconstructed data

Frame 1 Frame 2 Frame k
EL1L EL2L … ELkL
Separate set of

…
memory buffers
Repeat inverse transform and
inverse quantization

EL11 EL21 … ELk1

BL1 BL2 … BLk

•Low complexity alternative: Temporal prediction only in

base layer (FGS)
•Our contribution: DSC counter-part to the multiple loops
approach to achieve low complexity encoding (WZS)

WZS: Encoder does not replicate EL reconstruction

Multiple MCP loops Wyner-Ziv Scalable

Input:

BL:

EL:

Disadvantage: Encoder has to replicate

exactly all the possible EL reconstructions, so
Encoder does not replicate EL reconstruction
that the prediction residue can be computed

14
Cast scalable coding as a Wyner-Ziv problem

Residue compressed by DSC

EL predictor acts
as SI in joint
decoding
Use BL reconstruction
to compute the residue

To estimate the difference between input and base layer (uk) we use the
difference between best predictor and base layer (vk)

How to estimate p(uk|vk)?

Wyner-Ziv Scalable Video Coding (WZS) –

Overview

WZ scalable coding: Encoder does not replicate EL reconstruction

- EL predictor available only at decoder
- Side Information: EL predictor

Frame 1 Frame 2 Frame k

EL1L EL2L … ELkL
…

p(uk |vk)
EL11 EL21 … ELk1

Encoder replicates only BL BL1 BL2 … BLk

reconstruction

Key point: DSC requires only the correlation information

instead of the exact reconstructed data in encoding

15
Correlation Estimation at Encoder

• Solution
– Approximate optimal predictor
– Using an approximation of EL reconstruction

Compute the approximation by quantizing original

frame to a quality level similar to EL

Correlation information can be estimated

with low complexity

Our approximation can provide accurate statistics for DSC encoding

Correlation computed from EL reconstruction

(available only at decoder)

Correlation estimated from the approximation

(at encoder)

16
Results

Our proposed algorithm out-performs MPEG-4 FGS

Single layer
Akiyo CIF Single layer Container QCIF
48
48

46 46

44
44
42
42
PSNR-Y (dB)

PSNR-Y (dB)
40
40 Multiple loops 38
DSC Multiple loops
38 36
single layer 34
DSC single coding
36 FGS FGS FGS
32
34
WZS
30
FGS WZS
MCLP MCLP
32 28
0 500 1000 1500 2000 2500 3000 3500 0 300 600 900 1200 1500 1800
bitrate (kbps) bitrate (kbps)

Scalable Video Coding

• Application definition:
– Key problem in video scalability: Multiple open loop layers (e.g.,FGS) leads
to poor performance (temporal redundancy). Close loop layers require
multiple reconstructions at encoder
– Savings in terms of memory/complexity: temporal redundancy at higher
layers exploited via DSC
• Model Estimation:
– Temporal correlation estimated using the quantized original frame (not
reconstructed one, so that EL reconstructions are not needed)
• RD Optimization
– For each frame select encoding based on base layer (FGS) or temporal
enhancement (WZS)

[Wang, Cheung, Ortega, EURASIP Journal on Applied Signal Proc., 2006]

17
DSC Application to Hyperspectral Image Coding

Case Study: Hyperspectral Imagery

• Large volume
– Hundreds of image bands
– More than 100M bytes per hyperspectral image
• Resource constraints
– Encoding in satellite
– Decoding in ground station
– Memory, power
• High correlation between image bands

18
Inter-band Correlation

Large number of highly correlated bands; thus can achieve better

compression by exploring inter-band redundancy

Approaches for Hyperspectral Imagery Compression

• Inter-band prediction approach [S.R. Tate, 1997]

– Inherently sequential coding
– Encoders need to perform decoding
– Rate scalability problem

• 3D wavelets approach [X. Tang et. al., 2003]

– Memory constraints

• Initial DSC based approach [Tang, Cheung, Ortega, Raghavendra; DCC 05]

– Goal: parallel encoding of each band with minimum interprocessor

communication

19
DSC Based Hyperspectral Image Compression –
System Overview

Parallel encoding

P(X1,X0) MPU 1
(Microprocessor Unit)

• On-board correlation
estimation
P(X2,X1) – Small amt. of data
MPU 2 exchange
• Inter-processors
communications
are slow
– Should not hinder
parallel operation
P(X3,X2) MPU 3 – Low complexity

Spectral band

MPU 4
P(X4,X3)

SPIHT Image Compression

• Set Partitioning in Hierarchical Trees (SPIHT, Said & Pearlman)

– Iteratively generate sign and refinement bit-planes
– Coefficients “partially ordered” by magnitude
– Order information conveyed by significance bits
– Truncate at any point: precise rate control

DWT SPIHT

20
DSC Based Hyperspectral Image Compression –
System Overview (Cont’d)

• Wavelet transform and bit-plane extraction

– Sign
– Refinement How to estimate
– Significance information these crossover
probabilities
• DSC (Slepian-Wolf coding) or zerotree coding efficiently?
• Side-information: corresponding bit-plane in
previous band
Pr[ raw crossover],
– In another MPU (Spatially separated) Pr[ sign crossover],
Pr[ refinement crossover]

Slep ian-
Sign/refinement

multip lexing
W olf
W avelet Bit-p lanes b it-p lanes Encoder
Band i Transform Extraction Significance
information Zerotree
b it-p lanes Encoder

Core compression module

Coding strategy
[Tang, Cheung, Ortega, Raghavendra; DCC 05],
[Cheung, Tang, Ortega, Raghavendra; Signal Processing 06]

Model-based Approach to Estimate Correlation

[Cheung, Ortega; ICIP 06] Correlation

Band i–1
estimation
Small number of pixels
MPUi-1 (Model-based)
Est. corr. noise in pixel domain
MPUi
Linear Estimate Est. Source Model;
Predictor Correlation Est. Correlation Info.
Noise Model

Pr[ raw crossover],

Pr[ sign crossover],
Pr[ refinement crossover]

Slep ian-
Sign/refinement
multip lexing

W olf
W avelet Bit-p lanes b it-p lanes Encoder
Band i Transform Extraction Significance
information Zerotree
b it-p lanes Encoder

Core compression module

• Lower complexity
• Less data traffic
• Improve parallelism

21
Coding Summary

1. Estimate model: Estimate the parameters of the p.d.f. (e.g., maximum

likelihood estimate) with a small percentage of pixel values exchanged (e.g.,
less than 5%)

2. Derive bit-plane level statistics: Use the estimated p.d.f. to derive the
crossover probabilities analytically

3. Determine optimal coding modes: On which bit-planes do we apply DSC?

Significance map?

Note: All these decisions can be made for each band independently, after a small
number of pixels have been exchanged

1. Model Estimation

• Estimate fXY(x,y)
– X: Transform coefficients of current spectral band
– Y: Transform coefficients of neighboring spectral band

• Assume Y=X+Z
– Z is the correlation noise independent of X

• Factor fXY(x,y) = fX(x) fZ(y-x)

– fX(x) : source model
– fZ(z) : correlation noise model

• Estimate fX(x), fZ(z) with different procedures

22
2 Derive Bit-plane Crossover Probability

• Derive estimate of pl (Given fXY(x,y)) |Y|

…
• Crossover (of raw bit-planes)
fXY(x,y)
corresponds to regions A i A1 A5
3x2l
A3 Ai …
2x2l
• Example: l=2; X=8 A0 A4
2l
A2
• Crossover probability estimate: 0 2l 2x2l 3x2l
|X|

X=8

pˆ l = ! "" f XY ( x, y )dxdy
• In practice,
i only
Ai need to sum over a Y: 0 1 2 3 4 5 6 7 8 9 …
few regions
X=8
msb 1 0 0 0 0 0 0 0 0 1 1 1 1
0 0 0 0 0 1 1 1 1 0 0 0 0
0 0 0 1 1 0 0 1 1 0 0 1 1
0 0 1 0 1 0 1 0 1 0 1 0 1

3. Coding Strategy

• Different types of bit-planes

– Sign
– Refinement
– Significance map
– Raw
• DSC (inter-band coding) or zerotree (intra) coding?
Pr[ raw crossover],
Pr[ sign crossover],
Pr[ refinement crossover]

Slep ian-
Sign/refinement
multip lexing

W olf
W avelet Bit-p lanes b it-p lanes Encoder
Band i Transform Extraction Significance
information Zerotree
b it-p lanes Encoder

Core compression module

Coding strategy

23
Modeling Results

band 4 band 14
Number of coded bits Number of coded bits
300 Ref (DSC) 70000 Ref (DSC)
250 Signif. map (ZTC) 60000 Signif. map (ZTC)
50000
200

40000
150
30000
100
20000

50
10000

0
0
0 2 4 6 8 10 12 14 16
0 2 4 6 8 10 12 14 16
-50 Significance -10000 Significance
level level
Raw (DSC) Raw (DSC)

• At high significance levels, better results by compressing the signif.

map using ZTC
• In the middle range, coding the entire raw BP with DSC can achieve
better result
• For least signif. BP, both cannot achieve much compression

Results

• Compared DSC-based system with and without adaptive coding

– Non-adaptive: compress signif. map with ZTC for all levels
• Compared with 3D Wavelet based systems
– 3D ICER from NASA JPL [Kiely, Klimesh, Xie, Aranki; 2006]

Cuprite Radiance (Scene SC01, Band 131-138)

95
DSC (Adaptive)

DSC (Non-adaptive)
3D Wavelet (JPL)
90
Latest 3D ICER (Apr 06)
MPSNR (dB)

85 FY04 3D ICER

SPIHT
80
DSC (Adaptive)
75
DSC (Non-adaptive)
70
SPIHT
65
0.1 1 10
Bitrate (bits/pixel/band)

24
Hyperspectral Image Coding

• Application definition: Multiprocessor system, spectral bands processed by

different processors, exploit inter-band correlation with minimum inter-processor
communications
– Savings: Reduced communication between processors.

• Modeling:
– Crossband correlation in the bitplane domain, estimated using pixel-domain
correlation approximated as i.i.d., using exchanged pixels

• RD Optimization:
– Choice between DSC coding of refinement bits only or of raw bits

[Cheung, Tang, Ortega, Raghavendra, Signal Processing, 2006]

[Cheung, Wang, Ortega, IEEE Transactions on IP, 2008, to appear]

DSC Application to Flexible Video Decoding

25
Free viewpoint switching poses challenges to
multiview compression

When users can choose among different decoding paths, it is not clear
which previous reconstructed frame will be available to use in the decoding

View

Y0 Y1 Y2 Y0 Y1 Y2 Y0 Y1 Y2
Time

X X X

Multiple decoding Either Y0 or Y1 or Y2 will be Uncertainty on predictor

paths available at decoder status at decoder!

Problem Formulation

To support low-delay free viewpoint switching (flexible decoding),

encoder needs to operate under uncertainty on decoder predictor
[Cheung, Ortega; MMSP 07]

Encoder Encoder does not know which Decoder

Yi will be available at decoder
View

Y0 Y1 Y2 Y0 Y1 Y2
Time

^
X X

• Assume feedback is not available

– Low-delay, interactive application
– Offline encoding of multiview data

26
Other Flexible Decoding Examples –
Forward/Backward Frame-by-frame Video Playback

[Cheung, Wang, Ortega; VCIP 06]

User can choose to play back in either direction:

Either past or future reconstructed frame will be available at decoder

Y0
X … time
Y1

Decoding path

Y0, Y1: best motion-compensated predictor for X

• Not B-frame (as in video coding standards)

• Not multiple reference frames

Other Flexible Decoding Examples –

Robust Video Transmission

[Wang, Prabhakaran, Ramchandran; ICIP 06]

Y1
Y0
X … time
error free
error corrupted

Decoding path

Some reference frames have error, but encoder does not

know which one (assume feedback is not available)

27
Address Viewpoint Switching/Flexible Decoding Within
Closed-Loop Predictive (CLP) Coding Framework

Encoder has to send multiple prediction residues to the decoder

Prediction residue is “tied”

Encoder to a specific predictor Decoder
View

Y0 Y1 Y2 Y0 Y1 Y2
Time ^ ^ ^
Z1=X-Y1 Z0, Z1, Z2
^ ^
Z0=X-Y0 Z2=X-Y2 X=Y2+Z2

^
X X

Zi: P-frame or SP-frame

^
Overhead increases with X may not be identical
the number of predictor when different Yi are used
candidates as predictor - drifting

DSC - Virtual Communication Channel Perspective

In DSC, encoder can communicate X by sending parity information

(E.g., [Girod, Aaron, Rane, Rebollo-Mondero; Proc. IEEE 04])

CLP: DSC:
Z
Z=X-Y ^ Y ^
X X X + Dec X
Enc Dec

Parity
Y information

Parity information is independent of a specific predictor

- What matters is the amount of parity information

28
Viewpoint Switching (Flexible Decoding):
CLP vs. DSC

Encoder View Decoder

Y0 Y1 Y2 ^ ^ ^ Y0 Y1 Y2
CLP: Z0, Z1, Z2
Time

X DSC: Worst case parity X^

CLP DSC

Bits required to communicate X with

Yi at decoder : H(X | Yi) = H(Zi)

RCLP = ΣH(Zi) RDSC= max ( H(Zi) )

H(Z0)
H(Z0) H(Z1) H(Z2)
H(Z1)

H(Z2)

Encoding Algorithm –
Motion Estimation and Macroblock Classification

M may be classified to be in a skip mode if the difference

between M and predictors from some fi is small

Input current frame

Motion vector information

M
Motion estimation/ skip
MB classification CLP

non-skip DSC

Candidate reference frames: f0, f1, f2 , …, fN-1

Output
Mode decision bitstream

Majority: using DSC

Also: send Intra if more efficient

29
Encoding Algorithm – DSC Coded MB

K lowest frequency

Macroblock M
Direct b(l)
X W coding
DCT Quantization Slepian-Wolf
coding Parity
b(l) information
Significance
coding
s
Predictor from fi
αi
DCT Model
Yi estimation

W of k-th frequency b(l) Slepian-Wolf

Bit-plane
extraction coding

Encoding Algorithm – Significance Coding

High frequency coefficients (k>=K)

Direct b(l)
M X W coding
DCT Quantization Slepian-Wolf
coding Parity
b(l) information
Significance
coding
s
Predictor from fi
αi
DCT Model
Yi estimation

• Expected number of source bits =

1*pk+(1+Lk)*(1-pk)
W = 0?
• Lead to source bits saving when p k>1/Lk
yes
W
b(l)
Bit-plane Slepian-Wolf
no extraction coding
s

30
Experimental Results - Multiview Video Coding

Allow switching from adjacent views: three predictor candidates

Ballroom (320x240, 25fps, GOP=25)

37
View CLP
36 DSC
Y0 Y1 Y2
Time 35
Intra
Intra

PSNR
34 Inter
X Proposed
33

31
0 1000 2000 3000 4000
Bit-rate (kbps)

Our proposed algorithm out-performs CLP and intra coding

Drifting Experiments

Our proposed algorithm is almost drift-free, since quantized

coefficients in DSC coded MB are identically reconstructed

CLP DSC

35 35
w ithout sw itching
w ithout sw itching
sw itching
sw itching
34 34
PSNR

PSNR

Akko&Kayo (GOP=30) Drifting

33 in CLP 33

32 32
0 10 20 30 0 10 20 30

Switching occurs Fram e no. Switching occurs Fram e no.

View

Time View switching occurs at frame number 2

31
Scaling Experiments

• Number of coded bits vs. number of predictor candidates

Bits per frame

80000 Intra
v-1 v v+1 60000 CLP
1 0 2
40000
Intra
3 4 Inter
Proposed
20000
DSC
5 7 6
Number of predictor candidates
0
1 2 3 4 5 6 7 8 9

• Bit-rate of DSC-based approach increases at a slower rate compared

with CLP
– An additional candidate incurs more bits only if it has the worst
correlation among all candidates

Experimental Results –
Forward/backward video playback

Forward/backward playback: two predictor candidates

Coastguard; 30fps, GOP=15

39 Intra
38 DSC CLP
37
PSNR-Y (dB)

36
Proposed
35
H.263 Inter
34
H.263 fw d and backw ard
33 predicted residues
32 H.263 Intra

31
0 1000 2000 3000 4000 5000 6000 7000
kbps

Inter-frame coding
Coastguard CIF
with one prediction
residue: cannot
support flexible
decoding

32
Flexible Video Coding

• Application definition: Exploit temporal correlation between frames, where

one among a known set of frames is used as side information
– Savings: Better RD performance than methods that send all possible
residues.

• Modeling:
– Worst case “noise” between data to be sent and all candidate predictors.

• RD Optimization:
– Mode selection tools

• For more details:

[Cheung, Ortega, MMSP 2007, PCS 2007, VCIP 2008]

Conclusions

• Potential of DSC for interesting applications

• Application definition:
– Careful definition/quantification of expected gains in terms of another metric
of interest (lower memory, parallelism, flexibility, etc)

• Modeling:
– Probability models are never “given”. What is a good model in terms of
explaining the data while also being easy to estimate without affecting RD
and other metrics.

• RD Optimization:
– Alternative metrics are useful, but it is RD performance that will “sell” a
coding system

True of False: H12-111-Enu Hcia-Iot V2.5 Exam
100% (4)
True of False: H12-111-Enu Hcia-Iot V2.5 Exam
20 pages
The GRC Professional
No ratings yet
The GRC Professional
13 pages
The Toyota Kata Practice Guide: Practicing Scientific Thinking Skills for Superior Results in 20 Minutes a Day
From Everand
The Toyota Kata Practice Guide: Practicing Scientific Thinking Skills for Superior Results in 20 Minutes a Day
Mike Rother
4.5/5 (7)
Infrared Thermography: Errors and Uncertainties
No ratings yet
Infrared Thermography: Errors and Uncertainties
3 pages
3.multimedia Compression Algorithms
No ratings yet
3.multimedia Compression Algorithms
23 pages
DIC Predictive Coding 07
No ratings yet
DIC Predictive Coding 07
21 pages
Source Coding For Compression: Types of Data Compression
No ratings yet
Source Coding For Compression: Types of Data Compression
25 pages
Data and Voice Coding
No ratings yet
Data and Voice Coding
20 pages
Module 3
No ratings yet
Module 3
23 pages
Adobe Scan 18-Aug-2023
No ratings yet
Adobe Scan 18-Aug-2023
25 pages
06-Source-Coding-WuG-2023-10-10-17-24
No ratings yet
06-Source-Coding-WuG-2023-10-10-17-24
71 pages
IAP Chapter3 2018
No ratings yet
IAP Chapter3 2018
73 pages
Distributed Video Coding: Bernd Girod, Anne Margot Aaron, Shantanu Rane, David Rebollo-Monedero
No ratings yet
Distributed Video Coding: Bernd Girod, Anne Margot Aaron, Shantanu Rane, David Rebollo-Monedero
13 pages
Shannon Final SCS
No ratings yet
Shannon Final SCS
32 pages
Asynchronous (Cervo Ramboyong)
No ratings yet
Asynchronous (Cervo Ramboyong)
16 pages
Compression PDF
No ratings yet
Compression PDF
55 pages
Interpixel Redundancy: CS 450: Introduction To Digital Signal and Image Processing
No ratings yet
Interpixel Redundancy: CS 450: Introduction To Digital Signal and Image Processing
4 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
163 pages
Lossy Compression Algorithms
100% (2)
Lossy Compression Algorithms
18 pages
Chapter 08
No ratings yet
Chapter 08
111 pages
Updated - MT390 - Tutorial 6 - Spring 2020 - 21
No ratings yet
Updated - MT390 - Tutorial 6 - Spring 2020 - 21
65 pages
Digital Video
No ratings yet
Digital Video
87 pages
Introduction To Data Compression - Guy E. Blelloch PDF
No ratings yet
Introduction To Data Compression - Guy E. Blelloch PDF
54 pages
Lecture 7-Lossy Image Compression Techniques-DM-DPCM
No ratings yet
Lecture 7-Lossy Image Compression Techniques-DM-DPCM
37 pages
Neural Joint Source-Channel Coding
No ratings yet
Neural Joint Source-Channel Coding
15 pages
Fundamentals of Vector Quantization
No ratings yet
Fundamentals of Vector Quantization
87 pages
Internet of Things
No ratings yet
Internet of Things
52 pages
Image Compression 2
No ratings yet
Image Compression 2
24 pages
3G 4 DigitalComm PDF
No ratings yet
3G 4 DigitalComm PDF
163 pages
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
No ratings yet
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
35 pages
Lossless Data Compression Using Neural Networks
No ratings yet
Lossless Data Compression Using Neural Networks
5 pages
Digital Communications
No ratings yet
Digital Communications
66 pages
Digital Communication - S. Haykin
No ratings yet
Digital Communication - S. Haykin
81 pages
Chapter 8-b Lossy Compression Algorithms
No ratings yet
Chapter 8-b Lossy Compression Algorithms
18 pages
Mesleki Yeterlilik
No ratings yet
Mesleki Yeterlilik
106 pages
Data Compression: CS 147 Minh Nguyen
No ratings yet
Data Compression: CS 147 Minh Nguyen
25 pages
Baraniuk IMA Compression June07 Final
No ratings yet
Baraniuk IMA Compression June07 Final
87 pages
Image Processing and Compression Techniques: Digitization Includes Sampling of Image and Quantization of Sampled Values
No ratings yet
Image Processing and Compression Techniques: Digitization Includes Sampling of Image and Quantization of Sampled Values
14 pages
Department of Information Technology Information Theory and Coding Question Bank Unit-I Part - A
No ratings yet
Department of Information Technology Information Theory and Coding Question Bank Unit-I Part - A
6 pages
Data Compression and Encryption 1
No ratings yet
Data Compression and Encryption 1
16 pages
Lec13 Image-Compression Lec
No ratings yet
Lec13 Image-Compression Lec
104 pages
5707-2.coding Basics
No ratings yet
5707-2.coding Basics
66 pages
Lossless Embedded Compression Algorithm With Context-Based Error Compensation For Video Application
No ratings yet
Lossless Embedded Compression Algorithm With Context-Based Error Compensation For Video Application
4 pages
Audio and Video Coding PDF
No ratings yet
Audio and Video Coding PDF
72 pages
Image Compression: Transmit
No ratings yet
Image Compression: Transmit
16 pages
08-3 - Image Compression
No ratings yet
08-3 - Image Compression
33 pages
Predictive PDF
100% (1)
Predictive PDF
5 pages
LDPC Encoding and Decoding For High Memory and DSP Applications
No ratings yet
LDPC Encoding and Decoding For High Memory and DSP Applications
62 pages
Image Compression Coding Schemes
50% (4)
Image Compression Coding Schemes
96 pages
EE412/CS455 Principles of Digital Audio and Video
No ratings yet
EE412/CS455 Principles of Digital Audio and Video
71 pages
1
No ratings yet
1
27 pages
Digital Communication S. Haykin
No ratings yet
Digital Communication S. Haykin
79 pages
Implementation of Image and Audio Compression Techniques Using
No ratings yet
Implementation of Image and Audio Compression Techniques Using
26 pages
Multidimensional Signal, Image, and Video Processing and Coding
No ratings yet
Multidimensional Signal, Image, and Video Processing and Coding
8 pages
Lecture I: Data Compression Data Encoding: Efficient Information Encoding To
No ratings yet
Lecture I: Data Compression Data Encoding: Efficient Information Encoding To
48 pages
2011 Project Proposals
No ratings yet
2011 Project Proposals
4 pages
Jpeg Compressor Using Matlab
No ratings yet
Jpeg Compressor Using Matlab
6 pages
Lec - 14 Image Compression and Coding v4.0
No ratings yet
Lec - 14 Image Compression and Coding v4.0
9 pages
Math, Grade 1
From Everand
Math, Grade 1
Jennifer B. Stith
No ratings yet
Math, Grade 5
From Everand
Math, Grade 5
Redeana Davis Smith
5/5 (1)
Fill Your Glass With Gold-When It's Half-Full or Even Completely Shattered
From Everand
Fill Your Glass With Gold-When It's Half-Full or Even Completely Shattered
Hillary Saffran
No ratings yet
One Bold Move a Day: Meaningful Actions Women Can Take to Fulfill Their Leadership and Career Potential
From Everand
One Bold Move a Day: Meaningful Actions Women Can Take to Fulfill Their Leadership and Career Potential
Shanna A. Hocking
No ratings yet
eSourcing Capability Model for Service Providers eSCM-SP
From Everand
eSourcing Capability Model for Service Providers eSCM-SP
Bill Hefley
No ratings yet
Pricing of BOM Dynamic Determination of Net Value of Main Item Based On Sub Items
No ratings yet
Pricing of BOM Dynamic Determination of Net Value of Main Item Based On Sub Items
8 pages
00 Introducción e Historia CFD
No ratings yet
00 Introducción e Historia CFD
8 pages
Business Model Canvass ADVANCE
No ratings yet
Business Model Canvass ADVANCE
5 pages
S MVS Prim BRB - 00
No ratings yet
S MVS Prim BRB - 00
55 pages
2.0 en-US 2021-10 PL.1315
No ratings yet
2.0 en-US 2021-10 PL.1315
20 pages
Christmas Words Free
100% (1)
Christmas Words Free
11 pages
#1 - 100872-937-Arc-Zenith-Flyer-Web
No ratings yet
#1 - 100872-937-Arc-Zenith-Flyer-Web
2 pages
s5 PDF
No ratings yet
s5 PDF
5 pages
Marketing5 0
No ratings yet
Marketing5 0
3 pages
Hemisphere GPS VS131 - Data Sheet
No ratings yet
Hemisphere GPS VS131 - Data Sheet
2 pages
Complete Manual SCADA Systems
100% (1)
Complete Manual SCADA Systems
376 pages
HP Bios Update-1
No ratings yet
HP Bios Update-1
5 pages
SoMove V2.9.7 ReleaseNotes
No ratings yet
SoMove V2.9.7 ReleaseNotes
8 pages
Tata Word File
No ratings yet
Tata Word File
4 pages
Basics of Well Control
100% (1)
Basics of Well Control
45 pages
Core Python
No ratings yet
Core Python
41 pages
Case Study-DNS
No ratings yet
Case Study-DNS
7 pages
Marley ICG36081 Calentador
No ratings yet
Marley ICG36081 Calentador
2 pages
Accomplishment Report Grade 4 Week 3
No ratings yet
Accomplishment Report Grade 4 Week 3
8 pages
L&T Question Paper
No ratings yet
L&T Question Paper
5 pages
Griwephon AN2-701 TDS
No ratings yet
Griwephon AN2-701 TDS
3 pages
Work From Home HandBook For FFA RMs
No ratings yet
Work From Home HandBook For FFA RMs
3 pages
Oracle Modern Best Practice Handbook PDF
No ratings yet
Oracle Modern Best Practice Handbook PDF
29 pages
Chapter 2 Slides
No ratings yet
Chapter 2 Slides
33 pages
Oem Price Ford
No ratings yet
Oem Price Ford
28 pages
CSE MINI PROJECT Report TemplatePDF 231122 133402
No ratings yet
CSE MINI PROJECT Report TemplatePDF 231122 133402
9 pages
Fundamentals of Networks - Lab - Manual
No ratings yet
Fundamentals of Networks - Lab - Manual
36 pages