Encoding of Waveforms To Compress Information
Encoding of Waveforms To Compress Information
Image Encoding
Makes use of suitable transforms Uses special techniques
Transmits only the difference between image frames
5-1
Amplitude of a train of pulses is modulated: Pulse Amplitude Signal Amplitude Width of a train of pulses is modulated: Pulse Width Signal Amplitude Position of a train of pulses is modulated: Pulse Position Signal Amplitude
t
PWM
PPM
t
LECTURE 5
Copyright 1998, Texas Instruments Incorporated All Rights Reserved
5-2
t y(t)
1 0 0 1 1 1 1 0 1 0 11 PCM
PCM = Any Analog to Digital conversion where the result is a serial bit stream. Several methods of converting and transmitting PCM exist.
5-3
x
RANDOM NOISE GAIN
SPEECH
LECTURE 5
Copyright 1998, Texas Instruments Incorporated All Rights Reserved
5-4
LOWPASS FILTER
CODED
BANDPASS FILTER 16
RECTIFIER
LOWPASS FILTER
MUX OUTPUT
PITCH DETECTOR
Pitch detector estimates the frequency and aids in distinguishing voiced and unvoiced segments Outputs are multiplexed to produce coded speech signal
LECTURE 5
Copyright 1998, Texas Instruments Incorporated All Rights Reserved
5-5
+
DEMUX
DAC SPEECH
PULSE SOURCE
Pitch information switches between Voiced - Pulse Source and Unvoiced - Random Noise sounds Pitch produces correct frequency for voiced sounds DSP is the ideal medium for implementing vocoders
Filters may be implemented efficiently Speech spectrum can be analyzed easily Vocal tract can be modeled easily
LECTURE 5
Copyright 1998, Texas Instruments Incorporated All Rights Reserved
5-6
Image Coding
Bandwidth required for current TV Image Resolution
NTSC: 484 x 427 pixels, 29.94 Hz frame rate
PAL: 580 x 425 pixels, 25 Hz frame rate
LECTURE 5
Copyright 1998, Texas Instruments Incorporated All Rights Reserved
5-7
Bandwidth for TV
White Pixel
For black and white picture, bandwidth required is approximately 3 MHz Each pixel represents one sample so the required bandwidth is 6 MHz for a horizontal resolution of 3 MHz For color pictures, basic rate is about 150 MBits per second
Black Pixel
3 MHz A
MHz
LECTURE 5
Copyright 1998, Texas Instruments Incorporated All Rights Reserved
5-8
Transform Coding
Transform coding of images reduces bandwidth requirements
Most of the information in a picture is at low frequencies
Transform coders preserve information at low frequencies Ignoring transformed signals with small coefficients
Reduces bandwidth required Does not significantly degrade picture quality
Huffman coding may be used on transformed signals to further reduce the bandwidth requirements
LECTURE 5
Copyright 1998, Texas Instruments Incorporated All Rights Reserved
5-9
Video Compression
PRESENT FRAME VIDEO IN
DCT
HUFFMAN CODER
COEFFICIENT VALUES
MOTION DETECTOR
IMAGE REGENERATION
DISPLACEMENT VECTORS
5-10
Video Decompression
COEFFICIENT VALUES IDCT
DISPLACEMENT VECTORS
FRAME STORE
+
DECODED PICTURE
H.261 and more recent H.320 standards are computationally intensive DSPs provide the best implementation platform
LECTURE 5
Copyright 1998, Texas Instruments Incorporated All Rights Reserved
5-11
Next stage codes and orders coefficients Finally, coefficients are Huffman encoded to reduce amount of data
ENCODED HUFFMAN DECODER COEFFICIENT DECODER INVERSE QUANTIZER DECODED IDCT PICTURE
DATA
5-12
QUANTIZE
Each frame is split into small blocks Blocks are transform-coded by DCT Coefficients are coded with one of the following:
Forward or Backward predictive coding or a combination of both
This scheme makes use of the similarity between the present frame and either the previous or the next frame Finally, blocks are quantized for transmission
LECTURE 5
Copyright 1998, Texas Instruments Incorporated All Rights Reserved
5-13
Summary
Variants of pulse coded modulation (PCM) are widely used in waveform encoding Speech coding makes use of its special properties such as:
A single black and white TV frame needs approximately a quarter of a million bits Color frames need even more
FFT is not a suitable coder for images Discrete cosine transform (DCT) is used widely
For moving images, coding systems exploit the similarity between frames
Only changes to the previous frame are transmitted MPEG uses similarity to next as well as previous frame
5-14