0% found this document useful (0 votes)

158 views

GSM Codecs

The GSM standard supports four different but similar compression technologies to analyse and compress speech. Full-rate, enhanced full-rate (EFR), adaptive multi-rate (AMR), and half-rate codecs have been optimized to accurately regenerate speech at the output of a wireless link. Designers must understand how and when to implement these codecs in order to provide toll-quality voice over a GSM network.

Uploaded by

Martin Bitnet

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

158 views

GSM Codecs

Uploaded by

Martin Bitnet

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 6

Sorting Through GSM Codecs: A Tutorial

The transmission of speech from one point to another over GSM mobile phone
network is something that most of us take for granted. The complexity is usually
perceived to be associated with the network infrastructure and management
required in order to create the end-to-end connection, and not with the
transmission of the payload itself. The real complexity, however, lies in the codec
scheme used to encode voice traffic for transmission.

The GSM standard supports four different but similar compression technologies to
analyse and compress speech. These include full-rate, enhanced full-rate (EFR),
adaptive multi-rate (AMR), and half-rate. Despite all being lossy (i.e. some data is
lost during the compression), these codecs have been optimized to accurately
regenerate speech at the output of a wireless link.

In order to provide toll-quality voice over a GSM network, designers must

understand how and when to implement these codecs. To help out, this article
provides a look inside how each of these codecs works. We'll also examine how the
codecs need to evolve in order to meet the demands of 2.5 and 3G wireless
networks.

Speech Transmission Overview

When you speak into the microphone on a GSM phone, the speech is converted to a
digital signal with a resolution of 13 bits, sampled at a rate of 8 kHz—this 104,000
b/s forms the input signal to all the GSM speech codecs. The codec analyses the
voice, and builds up a bit-stream composed of a number of parameters that
describe aspects of the voice. The output rate of the codec is dependent on its type
(see Table 1), with a range of between 4.75 kbit/s and 13 kbit/s.

Table 1: Different Coding Rates

After coding, the bits are re-arranged, convoluted, interleaved, and built into bursts
for transmission over the air interface. Under extreme error conditions a frame
erasure occurs and the data is lost, otherwise the original data is re-assembled,
potentially with some errors to the less significant bits. The bits are arranged back
into their parametric representation, and fed into the decoder, which uses the data
to synthesise the original speech information.

The Full-Rate Codec

The full-rate codec is a regular pulse excitation, long-term prediction (RPE-LTP)
linear predictive coder that operates on a 20-ms frame composed of one hundred
sixty 13-bit samples.

The vocoder model consists of a tone generator (which models the vocal chords),
and a filter that modifies the tone (which models the mouth and nasal cavity shape)
[Figure 1]. The short-term analysis and filtering determines the filter coefficients
and an error measurement, the long-term analysis quantifies the harmonics of the
speech.

Figure 1: Diagram of a full-rate vocoder model.

As the mathematical model for speech generation in a full-rate codec shows a

gradual decay in power for an increase in frequency, the samples are fed through a
pre-emphasis filter that enhances the higher frequencies, resulting in better
transmission efficiency. An equivalent de-emphasis filter at the remote end restores
the sound.

The short-term analysis (linear prediction) performs autocorrelation and Schur

recursion on the input signal to determine the filter ("reflection") coefficients. The
reflection coefficients, which are transmitted over the air as eight parameters
totalling 36 bits of information, are converted into log area ratios (LARs) as they
offer more favourable companding characteristics. The reflection coefficients are
then used to apply short term filtering to the input signal, resulting in 160 samples
of residual signal.

The residual signal from the short-term filtering is segmented into four sub-frames
of 40 samples each. The long-term prediction (LTP) filter models the fine harmonics
of the speech using a combination of current and previous sub-frames. The gain
and lag (delay) parameters for the LTP filter are determined by cross-correlating
the current sub-frame with previous residual sub-frames.

The peak of the cross-correlation determines the signal lag, and the gain is
calculated by normalising the cross-correlation coefficients. The parameters are
applied to the long-term filter, and a prediction of the current short-term residual is
made. The error between the estimate and the real short-term residual signal—the
long-term residual signal—is applied to the RPE analysis, which performs the data
compression.

The Regular Pulse Excitation (RPE) stage involves reducing the 40 long-term
residual samples down to four sets of 13-bit sub-sequences through a combination
of interleaving and sub-sampling. The optimum sub-sequence is determined as
having the least error, and is coded using APCM (adaptive PCM) into 45 bits.

The resulting signal is fed back through an RPE decoder and mixed with the short-
term residual estimate in order to source the long-term analysis filter for the next
frame, thereby completing the feedback loop (Table 2).

Table 2 - Output Parameters from the Full Rate Codec

The Enhanced Full-Rate Codec

As processing power improved and power consumption decreased in digital signal
processors (DSPs), more complex codecs could be used to give a better quality of
speech. The EFR codec is capable of conveying more subtle detail in the speech,
even though the output bit rate is lower than full rate.

The EFR codec is an algebraic code excitation linear prediction (ACELP) codec,
which uses a set of similar principles to the RPE-LTP codec, but also has some
significant differences. The EFR codec uses a 10th-order linear-predictive (short-
term) filter and a long-term filter implemented using a combination of adaptive and
fixed codebooks (sets of excitation vectors).
Figure 2: Diagram of the EFM vocoder model

The pre-processing stage for EFR consists of an 80 Hz high-pass filter, and some
downscaling to reduce implementation complexity. Short-term analysis, on the
other hand, occurs twice per frame and consists of autocorrelation with two
different asymmetric windows of 30mS in length concentrated around different sub-
frames. The results are converted to short-term filter coefficients, then to line
spectral pairs (for better transmission efficiency) and quantized to 38 bits.

In the EFR codec, the adaptive codebook contains excitation vectors that model the
long-term speech structure. Open-loop pitch analysis is performed on half a frame,
and this gives two estimates of the pitch lag (delay) for each frame.

The open-loop result is used to seed a closed-loop search for speed and reduced
computation requirements. The pitch lag is applied to a synthesiser, and the results
compared against the non-synthesised input (analysis-by-synthesis), and the
minimum perceptually weighted error is found. The results are coded into 34 bits.

The residual signal remaining after quantization of the adaptive codebook search is
modelled by the algebraic (fixed) codebook, again using an analysis-by-synthesis
approach. The resulting lag is coded as 35 bits per sub-frame, and the gain as 5
bits per sub-frame.

The final stage for the encoder is to update the appropriate memory ready for the
next frame.

Going Adaptive
The principle of the AMR codec is to use very similar computations for a set of
codecs, to create outputs of different rates. In GSM, the quality of the received air-
interface signal is monitored and the coding rate of speech can be modified. In this
way, more protection is applied to poorer signal areas by reducing the coding rate
and increasing the redundancy, and in areas of good signal quality, the quality of
the speech is improved.

In terms of implementation, an ACELP coder is used. In fact, the 12.2 kbit/s AMR
codec is computationally the same as the EFR codec. For rates lower than 12.2
kbit/s, the short-term analysis is performed only once per frame. For 5.15 kbit/s
and lower, the open-loop pitch lag is estimated only once per frame. The result is
that at lower output bit rates, there are a smaller number of parameters to
transmit, and fewer bits are used to represent them.

The Half-Rate Codec

The air transmission specification for GSM allows the splitting of a voice channel
into two sub-channels that can maintain separate calls. A voice coder that uses half
the channel capacity would allow the network operators to double the capacity on a
cell for very little investment.

The half-rate codec is a vector sum excitation linear prediction (VSELP) codec that
operates on an analysis-by-synthesis approach similar to the EFR and AMR codecs.
The resulting output is 5.7 kb/s, which includes 100 b/s of mode indicator bits
specifying whether the frames are thought to contain voice or no voice. The mode
indicator allows the codec to operated slightly differently to obtain the best quality.

Half-rate speech coding was first introduced in the mid 1990's, but the public
perception of speech quality was so poor that it is not generally used today.
However, due to the variable bit-rate output, AMR lends itself nicely to transmission
over a half-rate channel. By limiting the output to the lowest 6 coding rates (4.75 --
7.95kbps), the user can still experience the quality benefits of adaptive speech
coding, and the network operator benefits from increased capacity. It is thought
that with the introduction of AMR, use of the half-rate air-channel will start to
become much more widespread.

Computational Complexity
Table 3 shows the time taken to encode and decode a random stream of speech-
like data, and the speed of the operations relative to the GSM full-rate codec.

Table 3: General Encoding and Decoding Complexity

The full-rate encoder operates on a non-iterative analysis and filtering, which

results in fast encoding and decoding. By comparison, the analysis-by-synthesis
approach employed in the CELP codecs involves repetitive computation of
synthesised speech parameters. The computational complexity of the
EFR/AMR/half-rate codecs is therefore far greater than the full-rate codec, and is
reflected in the time taken to compress and decompress a frame.
The output of the speech codecs is grouped into parameters (e.g. LARs) as they are
generated (Figure 3). For transmission over the air interface, the bits are
rearranged so the more important bits are grouped together. Extra protection can
then be applied to the most significant bits of the parameters that will have biggest
effect on the speech quality if they are erroneous

Figure 3: Diagram of vocoder parameter groupings.

The process of building the air transmission bursts involves adding redundancy to
the data by convolution. During this process, the most important bits (Class 1a) are
protected most while the least important bits (Class 2) have no protection applied.

This frame building process ensures that many errors occurring on the air interface
will be either correctable (using the redundancy), or will have only a small impact
on the speech quality.

Future Outlook
The current focus for speech codecs is to produce a result that has a perceptually
high quality at very low data rated by attempting to mathematically simulate the
mechanics of human voice generation. With the introduction of 2.5G and 3G
systems, it is likely that two different applications of speech coding will be
developed.

The first will be comparatively low bandwidth speech coding, most likely based on
the current generation of CELP codecs. Wideband AMR codecs have already been
standardised for use with 2G and 2.5G technologies and these will utilise the
capacity gains from EDGE deployment.

The second will make more use of the wide bandwidth employing a range of
different techniques which will probably be based on current psychoacoustic coding,
a technique which is in widespread use today for MP3 audio compression.

There is no doubt that speech quality over mobile networks will improve, but it may
be some time before wideband codecs are standardised and integrated with fixed
wire-line networks, leading to potentially CD-quality speech communications
worldwide.

OXE Training - Complete (2011)
80% (5)
OXE Training - Complete (2011)
94 pages
Simulation of Digital Communication Systems Using Matlab
From Everand
Simulation of Digital Communication Systems Using Matlab
Mathuranathan Viswanathan
3.5/5 (22)
IP Phone Metavoice MV100 (P)
No ratings yet
IP Phone Metavoice MV100 (P)
2 pages
Amr Half Full
No ratings yet
Amr Half Full
6 pages
Implementation of An Enhanced Fixed Point Variable Bit-Rate Melp Vocoder On Tms320C549
No ratings yet
Implementation of An Enhanced Fixed Point Variable Bit-Rate Melp Vocoder On Tms320C549
4 pages
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
No ratings yet
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
6 pages
GSM Phy Part-2
No ratings yet
GSM Phy Part-2
13 pages
IOSRJEN (WWW - Iosrjen.org) IOSR Journal of Engineering
No ratings yet
IOSRJEN (WWW - Iosrjen.org) IOSR Journal of Engineering
5 pages
GSM Speech Coders
No ratings yet
GSM Speech Coders
7 pages
Channel Coding Techniques For Adaptive Multi Rate Speech Transmission
No ratings yet
Channel Coding Techniques For Adaptive Multi Rate Speech Transmission
5 pages
Mits Hammed DSP
No ratings yet
Mits Hammed DSP
4 pages
Mobile Communications Handbook
No ratings yet
Mobile Communications Handbook
15 pages
6.1 To 13.3-Kfvs Variable Rate Celp Codec (Vr-Celp) For Amr Speech Coding
No ratings yet
6.1 To 13.3-Kfvs Variable Rate Celp Codec (Vr-Celp) For Amr Speech Coding
4 pages
Speech Compression Using GSM
No ratings yet
Speech Compression Using GSM
23 pages
Hardware Considerations For Digital Audio Broadcasting System
No ratings yet
Hardware Considerations For Digital Audio Broadcasting System
4 pages
Audio Compression
No ratings yet
Audio Compression
81 pages
Voice Capacity Enhancement
No ratings yet
Voice Capacity Enhancement
13 pages
AN2197 - Implementing The Levinson-Durbin Algorithm On The StarCore SC140 - SC1400 Cores
No ratings yet
AN2197 - Implementing The Levinson-Durbin Algorithm On The StarCore SC140 - SC1400 Cores
24 pages
Amr
No ratings yet
Amr
6 pages
Comparative Study - BSP
No ratings yet
Comparative Study - BSP
4 pages
Transmission of Voice Signal: BER Performance Analysis of Different FEC Schemes Based OFDM System Over Various Channels
No ratings yet
Transmission of Voice Signal: BER Performance Analysis of Different FEC Schemes Based OFDM System Over Various Channels
12 pages
Development of A Matlab Simulation Environment For Vehicle
No ratings yet
Development of A Matlab Simulation Environment For Vehicle
7 pages
AMR V 21 PDF
No ratings yet
AMR V 21 PDF
94 pages
Speech Coding
No ratings yet
Speech Coding
2 pages
Structure of The Demo
No ratings yet
Structure of The Demo
9 pages
Concepts and Solutions For Link Adaption and Inband Signaling For The GSM Amr Speech Coding Standard
No ratings yet
Concepts and Solutions For Link Adaption and Inband Signaling For The GSM Amr Speech Coding Standard
5 pages
GSM Amr
No ratings yet
GSM Amr
6 pages
Design Low Power Physical Layer of NB-IOT LTE Uplink Receiver
No ratings yet
Design Low Power Physical Layer of NB-IOT LTE Uplink Receiver
17 pages
BER Analysis of MIMO-OFDM
No ratings yet
BER Analysis of MIMO-OFDM
10 pages
Bit Error Rate Evaluation of IEEE 802.16 in OFDM System
No ratings yet
Bit Error Rate Evaluation of IEEE 802.16 in OFDM System
4 pages
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
No ratings yet
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
20 pages
Multidimensional LDPC-Coded Modulation For Beyond 400 Gb/s Per Wavelength Transmission
No ratings yet
Multidimensional LDPC-Coded Modulation For Beyond 400 Gb/s Per Wavelength Transmission
3 pages
Report 145
No ratings yet
Report 145
8 pages
Cdma Engg
No ratings yet
Cdma Engg
59 pages
International Journal of Engineering Research and Development (IJERD)
No ratings yet
International Journal of Engineering Research and Development (IJERD)
8 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
7 pages
Ijetae 0612 54 PDF
No ratings yet
Ijetae 0612 54 PDF
4 pages
LPC Modeling: Unit 5 1.speech Compression
No ratings yet
LPC Modeling: Unit 5 1.speech Compression
13 pages
Adaptive Multi Rate
No ratings yet
Adaptive Multi Rate
16 pages
Adaptive Thresholds For Amr Codec Mode Selection
No ratings yet
Adaptive Thresholds For Amr Codec Mode Selection
5 pages
MDC PDF
No ratings yet
MDC PDF
4 pages
Enhancing The PCM FM Link Without The Math
No ratings yet
Enhancing The PCM FM Link Without The Math
10 pages
Fixed PT CVSD Implementation
No ratings yet
Fixed PT CVSD Implementation
5 pages
Adaptive Multi Rate
No ratings yet
Adaptive Multi Rate
16 pages
Performance of Linear Block Coded OFDM System in BER and PAPR Under Different Channels
No ratings yet
Performance of Linear Block Coded OFDM System in BER and PAPR Under Different Channels
7 pages
Compression Techniques and Cyclic Redundency Check
No ratings yet
Compression Techniques and Cyclic Redundency Check
5 pages
Speech Capacity Enhancements in The GSM/EDGE Radio Access Network (GERAN)
No ratings yet
Speech Capacity Enhancements in The GSM/EDGE Radio Access Network (GERAN)
5 pages
Hybrid Optical MODI
No ratings yet
Hybrid Optical MODI
16 pages
Equalizer Training in IEEE 802.11b Standard
No ratings yet
Equalizer Training in IEEE 802.11b Standard
4 pages
Ieee 802.11N: On Performance of Channel Estimation Schemes Over Ofdm Mimo Spatially-Correlated Frequency Selective Fading TGN Channels
No ratings yet
Ieee 802.11N: On Performance of Channel Estimation Schemes Over Ofdm Mimo Spatially-Correlated Frequency Selective Fading TGN Channels
5 pages
GSM Data Rate
No ratings yet
GSM Data Rate
3 pages
Performance Analysis and Design Optimization of LDPC-Coded MIMO OFDM Systems
No ratings yet
Performance Analysis and Design Optimization of LDPC-Coded MIMO OFDM Systems
14 pages
Researchpaper OFDM Modulator For Wireless LAN WLAN Standard
No ratings yet
Researchpaper OFDM Modulator For Wireless LAN WLAN Standard
5 pages
Research Paper
No ratings yet
Research Paper
5 pages
Jntu College of Engineering: Kakinada
No ratings yet
Jntu College of Engineering: Kakinada
9 pages
Code Excited Liner Predictive Coding
No ratings yet
Code Excited Liner Predictive Coding
9 pages
Digital Signal Processing for RFID
From Everand
Digital Signal Processing for RFID
Feng Zheng
No ratings yet
RF Analog Impairments Modeling for Communication Systems Simulation: Application to OFDM-based Transceivers
From Everand
RF Analog Impairments Modeling for Communication Systems Simulation: Application to OFDM-based Transceivers
Lydi Smaini
No ratings yet
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
From Everand
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
Analog Dialogue
No ratings yet
Introduction to Mobile Network Engineering: GSM, 3G-WCDMA, LTE and the Road to 5G
From Everand
Introduction to Mobile Network Engineering: GSM, 3G-WCDMA, LTE and the Road to 5G
Alexander Kukushkin
No ratings yet
Radio Frequency Identification and Sensors: From RFID to Chipless RFID
From Everand
Radio Frequency Identification and Sensors: From RFID to Chipless RFID
Etienne Perret
No ratings yet
Filter Bank: Insights into Computer Vision's Filter Bank Techniques
From Everand
Filter Bank: Insights into Computer Vision's Filter Bank Techniques
Fouad Sabry
No ratings yet
l32hd32d SCH Operation Remocon Schematic
No ratings yet
l32hd32d SCH Operation Remocon Schematic
1 page
Intel DZ87KLT-75K - Board Status LEDs
No ratings yet
Intel DZ87KLT-75K - Board Status LEDs
1 page
Intel Dz87klt-75k - Port 80h Post Codes
No ratings yet
Intel Dz87klt-75k - Port 80h Post Codes
5 pages
Intel Dz87klt-75k - Port 80h Post Codes
No ratings yet
Intel Dz87klt-75k - Port 80h Post Codes
5 pages
Yealink SIP-T40G 3-Line Ultra-Elegant Gigabit IP Phone With PoE
No ratings yet
Yealink SIP-T40G 3-Line Ultra-Elegant Gigabit IP Phone With PoE
3 pages
X6U High-End IP Phone-X6U Datasheet-1
No ratings yet
X6U High-End IP Phone-X6U Datasheet-1
2 pages
X210 Datasheet
No ratings yet
X210 Datasheet
2 pages
X3SP+Pro Fanvil Datasheet EN V1.04 20210425
No ratings yet
X3SP+Pro Fanvil Datasheet EN V1.04 20210425
2 pages
Polycom Solutions Product Portfolio BR Enus
No ratings yet
Polycom Solutions Product Portfolio BR Enus
7 pages
RAN831 Wideband AMR Codec Set
No ratings yet
RAN831 Wideband AMR Codec Set
3 pages
Cisco 7962G IP Phone Data Sheet
No ratings yet
Cisco 7962G IP Phone Data Sheet
5 pages
Poly Solutions Product Portfolio BR en
No ratings yet
Poly Solutions Product Portfolio BR en
7 pages
POLQA White Paper 1011
No ratings yet
POLQA White Paper 1011
8 pages
Avaya B100 Offer Definition Final 1
No ratings yet
Avaya B100 Offer Definition Final 1
16 pages
Yealink w73h
No ratings yet
Yealink w73h
5 pages
Reliable, Two Line Desk Phone: Poly VVX 150 Business Ip Phone
No ratings yet
Reliable, Two Line Desk Phone: Poly VVX 150 Business Ip Phone
3 pages
Dinstar C60U-W&C60UP-W IP Phone Datasheet
No ratings yet
Dinstar C60U-W&C60UP-W IP Phone Datasheet
3 pages
Cisco Unified IP Conference Phone 8831: Product Overview
No ratings yet
Cisco Unified IP Conference Phone 8831: Product Overview
8 pages
Jabra Speak 750 UC Datasheet A4 Web
No ratings yet
Jabra Speak 750 UC Datasheet A4 Web
2 pages
Now Anywhere Is A Great Place To Talk.: VOYAGER 5200
No ratings yet
Now Anywhere Is A Great Place To Talk.: VOYAGER 5200
2 pages
ale-2-deskphone-datasheet-en
No ratings yet
ale-2-deskphone-datasheet-en
2 pages
Fanvil I10 I10v
No ratings yet
Fanvil I10 I10v
2 pages
Fanvil X7-V2 Datasheet
No ratings yet
Fanvil X7-V2 Datasheet
2 pages
PB cx20562
No ratings yet
PB cx20562
2 pages
Polycom Products Portfolio PDF
No ratings yet
Polycom Products Portfolio PDF
7 pages
(Full) The Impact of VoLTE On IP Backhaul Network
100% (2)
(Full) The Impact of VoLTE On IP Backhaul Network
44 pages
Samsung WE VoIP
No ratings yet
Samsung WE VoIP
2 pages
IT002 VoIP
No ratings yet
IT002 VoIP
6 pages
© Ericsson-LG Enterprise Co., Ltd. 2020 Version 1.1
No ratings yet
© Ericsson-LG Enterprise Co., Ltd. 2020 Version 1.1
16 pages
HD Voice Handover Presentation 2
No ratings yet
HD Voice Handover Presentation 2
58 pages
Digital Information Age An Introduction To Electrical Engineering 2nd Edition Roman Kuc Solutions Manual 1
100% (49)
Digital Information Age An Introduction To Electrical Engineering 2nd Edition Roman Kuc Solutions Manual 1
36 pages
Fanvil I53w SIP Indoor Station Datasheet
No ratings yet
Fanvil I53w SIP Indoor Station Datasheet
2 pages