3GPP TS 46.062: Comfort noise aspects for Enhanced Full Rate speech traffic channels

Specification: 46062

🟢Approvedv800

Rel-8

Summary

This document specifies the requirements for the correct operation of background acoustic noise evaluation, noise parameter encoding/decoding and comfort noise generation in Mobile Stations (MSs) and Base Station Systems (BSSs) during Discontinuous Transmission (DTX) on Enhanced Full Rate speech traffic channels.

Specification Intelligence

This is a Technical Document in the Unknown Series series, focusing on Technical Document. The document is currently in approved by tsg and under change control and is under formal change control.

Classification

Type: Technical Document

Subject: Unknown Series

Series: 46.xxx

Target: Technical Implementers

Specifics

Status: Change Control

Version

800.0.0

Release 800

0 technical • 0 editorial

Full Document v800

3GPP TS 46.062 v. 8.0.0

3GPP TS 46.062 V8.0.0 (2008-12)

Technical Specification

3rd Generation Partnership Project;

Technical Specification Group Services and System Aspects;

Comfort noise aspects for Enhanced Full Rate (EFR)

speech traffic channels

(Release 8)

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â

The present document has been developed within the 3^rd Generation Partnership Project (3GPP^TM) and may be further elaborated for the purposes of 3GPP.Â Â Â Â
The present document has not been subject to any approval process by the 3GPPOrganizational Partners and shall not be implemented.Â
This Specification is provided for future development work within 3GPPonly. The Organizational Partners accept no liability for any use of this Specification.
Specifications and reports for implementation of the 3GPP^TM system should be obtained via the 3GPP Organizational Partners' Publications Offices.

Keywords

GSM, speech, codec

3GPP

Postal address

3GPP support office address

650 Route des Lucioles - Sophia Antipolis

Valbonne - FRANCE

Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16

Internet

http://www.3gpp.org

Copyright Notification

No part may be reproduced except as authorized by written permission.
The copyright and the foregoing restriction extend to reproduction in all media.

UMTSâ¢ is a Trade Mark of ETSI registered for the benefit of its members

3GPPâ¢ is a Trade Mark of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners
LTEâ¢ is a Trade Mark of ETSI currently being registered for the benefit of its Members and of the 3GPP Organizational Partners

GSMÂ® and the GSM logo are registered and owned by the GSM Association

Contents

Foreword................................................................................................................................................ 4

1....... Scope........................................................................................................................................... 5

2....... References.................................................................................................................................... 5

3....... Definitions, symbols and abbreviations........................................................................................... 5

3.1......... Definitions............................................................................................................................................................................ 5

3.2......... Symbols................................................................................................................................................................................ 6

3.3......... Abbreviations....................................................................................................................................................................... 6

4....... General......................................................................................................................................... 7

5....... Functions on the transmit (TX) side................................................................................................ 7

5.1......... Background acoustic noise evaluation............................................................................................................................ 8

5.2......... Modification of the speech encoding algorithm during SID frame generation...................................................... 11

5.3......... SID-frame encoding......................................................................................................................................................... 11

6....... Functions on the receive (RX) side............................................................................................... 12

6.1......... Averaging and decoding of the LP and fixed codebook gain parameters............................................................... 12

6.2......... Comfort noise generation and updating........................................................................................................................ 14

7....... Computational details.................................................................................................................. 15

Annex A (informative):....... Change history....................................................................................... 16

Foreword

This Technical Specification has been produced by the 3^rd Generation Partnership Project (3GPP).

The present document defines operation of the background acoustic noise evaluation, noise parameter encoding/decoding and comfort noise generation in Mobile Stations (MSs) and Base Station Systems (BSSs) during Discontinuous Transmission (DTX) on Enhanced Full Rate speech traffic channels within the digital cellular telecommunications system.

The contents of the present document are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows:

Version x.y.z

where:

xÂ Â Â the first digit:

1Â Â Â presented to TSG for information;

2Â Â Â presented to TSG for approval;

3Â Â Â or greater indicates TSG approved document under change control.

yÂ Â Â the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc.

zÂ Â Â the third digit is incremented when editorial only changes have been incorporated in the document.

1Â Â Â Â Â Â Â Â Scope

The present document gives the detailed requirements for the correct operation of the background acoustic noise evaluation, noise parameter encoding/decoding and comfort noise generation in Mobile Stations (MSs) and Base Station Systems (BSSs) during Discontinuous Transmission (DTX) on Enhanced Full Rate speech traffic channels.

The requirements described in the present document are mandatory for implementation in all GSM MSs capable of supporting the Enhanced Full Rate speech traffic channel.

The receiver requirements are mandatory for implementation in all GSM BSSs capable of supporting the Enhanced Full Rate speech traffic channel, the transmitter requirements only for those where downlink DTX will be used.

In case of discrepancy between the requirements described in the present document and the fixed point computational description of these requirements contained in GSM 06.53 [2], the description in GSM 06.53 [2] will prevail.

2Â Â Â Â Â Â Â Â References

The following documents contain provisions which, through reference in this text, constitute provisions of the present document.

Â· References are either specific (identified by date of publication, edition number, version number, etc.) or nonâspecific.

Â· For a specific reference, subsequent revisions do not apply.

Â· For a non-specific reference, the latest version applies.Â In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document.

[1]Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â GSM 01.04: "Digital cellular telecommunications system (Phase 2+); Abbreviations and acronyms".

[2]Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â GSM 06.53: "Digital cellular telecommunications system (Phase 2+); ANSI-C code for the GSM Enhanced Full Rate (EFR) speech codec".

[3]Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â GSM 06.60: "Digital cellular telecommunications system (Phase 2+); Enhanced Full Rate (EFR) speech transcoding".

[4]Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â GSM 06.61: "Digital cellular telecommunications system (Phase 2+); Substitution and muting of lost frame for Enhanced Full Rate (EFR) speech traffic channels".

[5]Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â GSM 06.81: "Digital cellular telecommunications system (Phase 2+); Discontinuous transmission (DTX) for Enhanced Full Rate (EFR) speech traffic channels".

3Â Â Â Â Â Â Â Â Definitions, symbols and abbreviations

3.1Â Â Â Â Â Â Â Definitions

For the purposes of the present document, the following terms and definitions apply:

frame: time interval of 20 ms corresponding to the time segmentation of the Enhanced Full Rate speech transcoder, also used as a short term traffic frame.

SID frame: frame characterized by the SID (SIlence Descriptor) codeword. It conveys information on the acoustic background noise.

SID codeword: fixed bit pattern for labelling a traffic frame as a SID frame.

SID field: bit positions of the SID codeword within a SID frame.

speech frame: traffic frame that cannot be classified as a SID frame.

VAD flag: Voice Activity Detection flag.

SP flag: SPeech flag.

Other definitions of terms used in the present document can be found in GSM 06.60 [3] and GSM 06.81 [5]. The overall operation of DTX is described in GSM 06.81 [5].

3.2Â Â Â Â Â Â Â Symbols

For the purposes of the present document, the following symbols apply. Boldface symbols are used for vector variables.

Â Â Â Unquantized LSF vector

Â Â Â Quantized LSF vector

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â mth unquantized LSF vector of the frame

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â mth quantized LSF vector of the frame

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Reference LSF parameter vector

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Averaged LSF parameter vector

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Unquantized fixed codebook gain

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Quantized fixed codebook gain

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Reference fixed codebook gain

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Averaged fixed codebook gain

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Linear prediction residual signal

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Computed LSF parameter prediction residual

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Quantized LSF parameter prediction residual

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Computed fixed codebook gain correction factor

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Quantized fixed codebook gain correction factor

Â Â Â Â Â Â Â Â Â Â Â Â

3.3Â Â Â Â Â Â Â Abbreviations

For the purposes of the present document, the following abbreviations apply:

BSSÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Base Station Subsystem

DTXÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Discontinuous Transmission

LPÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Linear Prediction

LSFÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Line Spectral Frequency

LSPÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Line Spectral Pair

MSÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Mobile Station

RXÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Receive

SIDÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â SIlence Descriptor

TXÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Transmit

VADÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Voice Activity Detector

For abbreviations not given in this clause, see GSM 01.04 [1].

4Â Â Â Â Â Â Â Â General

A basic problem when using DTX is that the background acoustic noise, which is transmitted together with the speech, would disappear when the radio transmission is cut, resulting in discontinuities of the background noise. Since the DTX switching can take place rapidly, it has been found that this effect can be very annoying for the listener - especially in a car environment with high background noise levels. In bad cases, the speech may be hardly intelligible.

The present document specifies the way to overcome this problem by generating on the receive (RX) side synthetic noise similar to the transmit (TX) side background noise. The comfort noise parameters are estimated on the TX side and transmitted to the RX side before the radio transmission is switched off and at a regular low rate afterwards. This allows the comfort noise to adapt to the changes of the noise on the TX side.

5Â Â Â Â Â Â Â Â Functions on the transmit (TX) side

The comfort noise evaluation algorithm uses the following parameters of the GSM Enhanced Full Rate speech encoder, defined in GSM 06.60 [3]:

-Â Â Â Â the unquantized and quantized Linear Prediction (LP) parameters, using the Line Spectral Pair (LSP) representation, where the unquantized Line Spectral Frequency (LSF) vector is given by , the quantized LSF vector is given by , and the two sets of unquantized and quantized LSF vectors (one for each half of a frame) are given by , , Â and , respectively;

-Â Â Â Â the quantized fixed-codebook gain .

The algorithm also computes the following parameters to assist in comfort noise generation:

-Â Â Â Â the reference LSF parameter vector Â (average of the quantized LSF parameters of the hangover period);

-Â Â Â Â the averaged LSF parameter vector Â (average of the LSF parameters of the eight most recent frames);

-Â Â Â Â the reference fixed codebook gain Â (average of the quantized fixed codebook gain values of the hangover period);

-Â Â Â Â the averaged fixed codebook gain Â (average of the fixed codebook gain values of the eight most recent frames);

-Â Â Â Â the unquantized fixed codebook gain .

These parameters give information on the level (, , , ) and the spectrum (, , , , , ) of the background noise.

Two of the evaluated comfort noise parameters (Â and ) are encoded into a special frame, called a Silence Descriptor (SID) frame, for transmission to the RX side. Since the reference LSF parameter vector Â and the reference fixed codebook gain Â can be evaluated in the same way in the encoder and decoder, as given in clause 5.1, no transmission of these parameters is necessary.

The averaged LSF parameter and fixed codebook gain values, Â and , are computed in the encoder using both quantized and unquantized parameter values if the period of the eight most recent frames (the SID averaging period) is overlapping with the hangover period (the parameters from the frames overlapping with the hangover period have quantized values, while the parameters of the more recent frames of the SID averaging period have unquantized values). If the period of the eight most recent frames is non-overlapping with the hangover period, the averaged LSF parameter and fixed codebook gain values are computed using only unquantized parameter values.

The SID frame also serves to initiate the comfort noise generation on the receive side, as a SID frame is always sent at the end of a speech burst, i.e., before the radio transmission is terminated.

The scheduling of SID or speech frames on the radio path is described in GSM 06.81 [5].

5.1Â Â Â Â Â Â Â Background acoustic noise evaluation

The comfort noise parameters to be encoded into a SID frame are calculated over N = 8 consecutive frames marked with VAD = 0, as follows:

The averaged LSF parameter vector Â of the frame i shall be computed according to the equation:

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (1)

where:

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the mth (unquantized) LSF parameter vector of the current frame i (n = 0);

Â Â Â Â Â Â Â is the mth (quantized or unquantized) LSF parameter vector of one of the last frames (n = 1,..., 7);

Â Â Â Â Â nÂ Â Â Â Â Â Â Â is the averaging period index (n = 0, 1,..., 7);

Â Â Â Â Â mÂ Â Â Â Â Â Â Â Â is the LSF parameter vector index within a frame (1 or 2);

Â Â Â Â Â iÂ Â Â Â Â Â Â Â Â Â is the frame index.

NOTE:Â Â Â Â Â When the averaging is performed at the end of the hangover period (first SID update), all of the LSF parameter vectors Â of the 7 previous frames (the hangover period) have quantized values, while the LSF parameter vectors Â of the current frame i have unquantized values. In the subsequent SID updates, the LSF parameter vectors of the SID averaging period in the frames overlapping with the hangover period have quantized values, while the parameter vectors of the more recent frames of the SID averaging period have unquantized values.

The averaged LSF parameter vector Â of the frame i is encoded using the same encoding tables that are also used by the GSM Enhanced Full Rate speech codec for the encoding of the non-averaged LSF parameter vectors in ordinary speech encoding mode, but the quantization algorithm is modified in order to support the quantization of comfort noise. The LSF parameter prediction residual to be quantized is obtained according to the following equation:

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (2)

where:

Â Â Â Â Â is the averaged LSF parameter vector at the current frame i

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the reference LSF parameter vector

Â Â Â Â Â Â Â Â Â Â Â Â Â is the computed LSF parameter prediction residual at the current frame i

Â Â Â Â Â iÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the frame index;

NOTE:Â Â Â Â Â This prediction residual is used for both halves of the frame in the quantization algorithm. The computation of the reference LSF parameter vector Â is made on the basis of the quantized LSF parameters, Â and , by averaging the parameters over the hangover period of 7 frames, according to the following equation:

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (3)

where:

Â Â Â Â Â Â Â is the mth quantized LSF parameter vector of one of the frames of the hangover period
Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (n = 1,..., 7);

nÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the hangover period frame index (n = 1,..., 7);

mÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the LSF parameter index within a frame (1 or 2);

kÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the frame index.

For each comfort noise insertion period, the computation of the reference LSF parameter vector Â is done only once at the end of the hangover period and for the rest of the comfort noise insertion period Â will be frozen. The reference LSF parameter vector Â is evaluated in the decoder in the same way as in the encoder, because during the hangover period the same LSF parameter vectors Â are available at the encoder and decoder. An exception to this are the cases when transmission errors are severe enough to cause the parameters to become unusable, and the frame substitution procedure is activated (see GSM 06.61 [4]). In these cases, the modified parameters obtained from the frame substitution procedure are used instead of the received parameters.

The fixed codebook gain values shall be averaged and updated in every subframe according to the equation:

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (4)

where:

Â Â Â Â Â Â Â Â Â Â is the (unquantized) fixed codebook gain in the current subframe (n=0);

Â Â Â Â Â Â is the (quantized or unquantized) fixed codebook gain in one of the past

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â subframes (n = 1,..., 28);

Â Â Â Â Â nÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the averaging period index (n = 0, 1,..., 28);

NOTE:Â Â Â Â Â When the averaging is started at the end of the hangover period (first SID update), all of the fixed codebook gains Â of the 28 (n=1,..., 28) previous subframes (the hangover period) have quantized values, while the fixed codebook gains Â of the current subframe has an unquantized value. In the subsequent SID updates, the fixed codebook gain values of the SID averaging period in the subframes overlapping with the hangover period have quantized values, while the parameter vectors of the more recent subframes of the SID averaging period have unquantized values.

Since most parts of the subframe processing clause in the encoder are switched off when the SP flag = "0" (to minimize the average complexity of the speech encoder algorithm), the unquantized fixed codebook gain is not directly available for gain averaging. Due to this, the unquantized fixed codebook gain is separately computed, based on the energy of the LP residual signal in each subframe, according to the following equation:

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (5)

where:

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the unquantized fixed codebook gain of the current subframe j;

Â Â Â Â Â Â Â Â Â is the lth sample of the LP residual in the current subframe j;

Â Â Â Â Â jÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the subframe index (j = 1,..., 4);

Â Â Â Â Â lÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the sample index (l = 1,..., 40).

NOTE:Â Â Â Â Â The computed energy of the LP residual signal is divided by the value of 10 to yield the energy for one excitation pulse, since during comfort noise generation, the subframe excitation signal (pseudo noise) has 10 non-zero samples, whose amplitudes can take values of +1 or â1.

The averaged fixed codebook gain value Â of the current subframe is encoded using the nonâaveraged fixed codebook gain values in ordinary speech encoding mode, but the quantization algorithm is modified in order to support comfort noise quantization. The fixed codebook gain correction factor Â to be quantized is obtained according to the following equation:

where:

Â Â Â Â Â Â Â Â Â Â Â is the averaged fixed codebook gain value in the current subframe;

Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the reference fixed codebook gain;

The computation of the reference fixed codebook gain Â is made on the basis of the quantized fixed codebook gain parameters , by averaging the parameter values over the hangover period of 7 frames according to the following equation:

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (7)

where:

Â Â Â Â Â is the quantized fixed codebook gain parameter value in subframe j of one of the frames of the hangover period (n = 1,..., 7);

Â Â Â Â Â nÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the hangover period frame index (n = 1, 2,..., 7);

Â Â Â Â Â kÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the frame index;

Â Â Â Â Â jÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the subframe index (j = 1,..., 4);

For each comfort noise insertion period, the computation of the reference fixed codebook gain Â is done only once at the end of the hangover period and for the rest of the comfort noise insertion period Â will be frozen. The reference fixed codebook gain Â can be evaluated in the decoder in the same way as in the encoder, because during the hangover period the same quantized fixed codebook gain values Â are available at the encoder and decoder. An exception to this are the cases when transmission errors are severe enough to cause the parameters to become unusable, and the frame substitution procedure is activated (see GSM 06.61 [4]). In these cases, the modified parameters obtained from the frame substitution procedure are used instead of the received parameters.

The hangover period is defined in GSM 06.81 [5]. It is a period added at the end of a speech burst in which no voice activity is detected (VAD flag = "0"), but the speech encoder stays for the processing of 7 speech frames in speech encoding mode (SP flag = "1"). This hangover period and the first SID frame are used for averaging the comfort noise parameters contained in the first SID frame.

5.2Â Â Â Â Â Â Â Modification of the speech encoding algorithm during SID frame generation

When the SP flag is equal to "0" the speech encoding algorithm is modified in the following way:

-Â Â Â Â The non-averaged LP parameters which are used to derive the filter coefficients of the filters Â and Â of the speech encoder are not quantized.

-Â Â Â Â The open loop pitch lag search is performed, but the closed loop pitch lag search is inactivated. The adaptive codebook gain is set to zero.

-Â Â Â Â No fixed codebook search is made. In each subframe the pulse positions and signs of the fixed codebook excitation are locally generated using uniformly distributed pseudo random numbers. The excitation pulses take values of +1 and â1 when comfort noise is generated. The fixed codebook comfort noise excitation generation algorithm is defined in clause 6.2.

-Â Â Â Â The memory of weighting filter Â is set to zero, i.e., the memory of Â is not updated.

-Â Â Â Â The ordinary LP parameter quantization algorithm is inactive. At the end of the hangover period the reference LSF parameter vector Â is calculated as defined in clause 5.1. For the rest of the comfort noise insertion period Â is frozen. The averaged LSF parameter vector Â is calculated each time a new SID frame is to be sent to the Radio Subsystem. This parameter vector is encoded into the SID frame as defined in clause 5.1.

-Â Â Â Â The ordinary fixed codebook gain quantization algorithm is inactive. At the end of the hangover period the reference fixed codebook gain Â is calculated as defined in clause 5.1. For the rest of the comfort noise insertion period Â is frozen. The averaged fixed codebook gain value Â is calculated each time a new SID frame is to be sent to the Radio Subsystem. This gain value is encoded into the SID frame as defined in clause 5.1.

-Â Â Â Â The predictor memories of the ordinary LP parameter quantization and fixed codebook gain quantization algorithms are reset when SP flag = "0", so that the quantizers start from their initial states when the speech activity begins again.

-Â Â Â Â The computation of the unquantized fixed codebook gain is performed based on the energy of the LP residual signal.

5.3Â Â Â Â Â Â Â SID-frame encoding

The SID-frame encoding algorithm exploits the fact that only some of the 244 bits in a frame are needed to code the comfort noise parameters. The other bits can then be used to mark the SID-frame by means of a fixed bit pattern, called the SID code word.

The quantization indices of the LP parameters are replaced by the quantization indices derived from the averaged LSF parameter vector . The encoding of the quantization indices is defined in clause 5.1.

The fixed codebook gain quantization indices are replaced by the quantization index derived from the averaged fixed codebook gain value , encoded as defined in clause 5.1, repeated four times inside the frame.

The SID code word consists of 95 bits which are all set to one. The bits of the SID code word are inserted in the SID field as defined in table 1. All of the bits in the SID field are in the error protection Class I.

The remaining bits in the SID frame are set to zero. The use of these bits is for further study.

Table 1: SID codeword

Parameter	Number of bits	Bit positions (b0 = LSB)
LTP LAG 1	2	b0, b1
LTP LAG 2	3	b0, b1, b2
LTP LAG 3	2	b0, b1
LTP LAG 4	4	b0, b1, b2, b3
LTP GAIN 1	3	b0, b1, b2
LTP GAIN 2	3	b0, b1, b2
LTP GAIN 3	4	b0, b1, b2, b3
LTP GAIN 4	4	b0, b1, b2, b3
PULSE 1 of 1st subfr.	4	b0, b1, b2, b3
PULSE 2 of 1st subfr.	4	b0, b1, b2, b3
PULSE 3 of 1st subfr.	4	b0, b1, b2, b3
PULSE 4 of 1st subfr.	4	b0, b1, b2, b3
PULSE 5 of 1st subfr.	2	b2,b3
PULSE 1 of 2nd subfr.	4	b0, b1, b2, b3
PULSE 2 of 2nd subfr.	4	b0, b1, b2, b3
PULSE 3 of 2nd subfr.	4	b0, b1, b2, b3
PULSE 4 of 2nd subfr.	4	b0, b1, b2, b3
PULSE 5 of 2nd subfr.	2	b2,b3
PULSE 1 of 3rd subfr.	4	b0, b1, b2, b3
PULSE 2 of 3rd subfr.	4	b0, b1, b2, b3
PULSE 3 of 3rd subfr.	4	b0, b1, b2, b3
PULSE 4 of 3rd subfr.	4	b0, b1, b2, b3
PULSE 5 of 3rd subfr.	2	b2,b3
PULSE 1 of 4th subfr.	4	b0, b1, b2, b3
PULSE 2 of 4th subfr.	2	b2, b3
PULSE 3 of 4th subfr.	4	b0, b1, b2, b3
PULSE 4 of 4th subfr.	4	b0, b1, b2, b3
PULSE 5 of 4th subfr.	2	b2,b3

The parameters in table 1 are defined in GSM 06.60 [3].

6Â Â Â Â Â Â Â Â Functions on the receive (RX) side

The situations in which comfort noise shall be generated on the receive side are defined in GSM 06.81 [5]. Generally speaking, the comfort noise generation is started or updated whenever a valid SID frame is received.

6.1Â Â Â Â Â Â Â Averaging and decoding of the LP and fixed codebook gain parameters

When speech frames are received by the decoder the LP and the fixed codebook gain parameters of the last seven speech frames shall be kept in memory. The decoder counts the number of frames elapsed since the last SID frame was updated and passed to the RSS by the encoder. Based on this count, the decoder determines whether or not there is a hangover period at the end of the speech burst (if at least 31 frames have elapsed since the last SID update when the first SID frame after a speech burst arrives, the hangover period has existed at the end of the speech burst).

As soon as a SID frame is received, and the hangover period is detected at the end of the speech burst, the stored LP and fixed codebook gain parameters shall be averaged to obtain the reference LSF parameter vector Â and the reference fixed gain codebook value . The reference LSF parameter vector and the reference fixed codebook gain value shall be frozen and used for the actual comfort noise insertion period.

The averaging procedure for obtaining the reference parameters is as follows:

-Â Â Â Â when a speech frame is received, the LSF and fixed codebook gain parameters are decoded and stored in memory;

-Â Â Â Â when the first SID frame is received, and the hangover period is detected at the end of the speech burst, the stored LSF and fixed codebook gain parameters are averaged in the same way as in the speech encoder as follows (see also clause 5.1):

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (8)

where:

Â Â Â Â Â Â Â is the mth quantized LSF parameter vector of one of the frames of the hangover period
Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (n = 1,..., 7);

Â Â Â Â Â nÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the hangover period frame index (n = 1, 2,..., 7);

Â Â Â Â Â mÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the LSF parameter index within a frame (1 or 2);

Â Â Â Â Â kÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the frame index.

Â Â Â Â Â and

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (9)

where:

Â Â Â Â Â is the quantized fixed codebook gain parameter value in subframe j of one of the Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â frames of the hangover period (n = 1,..., 7);

Â Â Â Â Â nÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the hangover period frame index (n = 1,..., 7);

Â Â Â Â Â kÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the frame index;

Â Â Â Â Â jÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the subframe index (j = 1,..., 4).

Once the reference LSF parameter vector has been computed, the averaged LSF parameter vector Â of the frame i (encoded into the SID frame) can be reproduced at the decoder each time a SID update frame is received, according to the equation:

where:

Â Â Â Â Â is the quantized, averaged LSF parameter vector at the current frame i to be used for comfort noise Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â generation;

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the reference LSF parameter vector;

Â Â Â Â Â Â Â Â Â Â Â Â Â is the received quantized LSF parameter prediction residual at the current frame i;

Â Â Â Â Â iÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the frame index.

The averaged fixed codebook gain Â of the frame i (encoded into the SID frame) can be similarly reproduced at the decoder each time a SID update frame is received, according to the following equation:

Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (11)

where:

Â Â Â Â Â is the averaged fixed codebook gain value in the current frame i to be used for comfort noise Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â generation;

Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the reference fixed codebook gain;

Â Â Â Â Â Â Â Â Â Â Â is the received quantized fixed codebook gain correction factor in the current frame i;

Â Â Â Â Â iÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â is the frame index.

6.2Â Â Â Â Â Â Â Comfort noise generation and updating

The comfort noise generation procedure uses the GSM Enhanced Full Rate speech decoder algorithm defined in GSM 06.60 [3].

When comfort noise is to be generated, the various encoded parameters are set as follows:

In each subframe, the pulse positions and signs of the fixed codebook excitation are locally generated using uniformly distributed pseudo random numbers. The excitation pulses take values of +1 and â1 when comfort noise is generated. The fixed codebook comfort noise excitation generation algorithm works as follows:

Â Â Â Â Â for (i = 0; i < 40; i++)Â Â Â Â Â code[i] = 0;

Â Â Â Â Â for (i = 0; i < 10; i++) {

Â Â Â Â Â j = random(4);

Â Â Â Â Â idx = j * 10 + I;

Â Â Â Â Â if (random(2) == 1)Â Â code[idx] = 1;

Â Â Â Â Â elseÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â code[idx] = â1;

Â Â Â Â Â }

where:

Â Â Â Â Â code[0..39]Â fixed codebook excitation buffer;

Â Â Â Â Â random(4) Â Â Â Â Â generates a random integer value, uniformly distributed between 0 and 3;

Â Â Â Â Â random(2) Â Â Â Â Â generates a random integer value, uniformly distributed between 0 and 1.

The fixed codebook gain values are those received in the SID frame.

The adaptive codebook gain values in each subframe are set to 0.

The pitch delay values in each subframe are set to 40.

The LP filter parameters used are those received in the SID frame.

The predictor memories of the ordinary LP parameter and fixed codebook gain quantization algorithms are reset when SP flag = "0", so that the quantizers start from their initial states when the speech activity begins again.

With these parameters, the speech decoder now performs the standard operations described in GSM 06.60 [3] and synthesizes comfort noise.

Updating of the comfort noise parameters (fixed codebook gain and LP filter parameters) occurs each time a valid SID frame is received, as described in GSM 06.81 [5].

When updating the comfort noise, the parameters above should preferably be interpolated over the SID update period to obtain smooth transitions.

7Â Â Â Â Â Â Â Â Computational details

A low level description has been prepared in form of an ANSI C source code which is part of GSM 06.53 [2].

Annex A (informative):
Change history

Change history
SMG No.	TDoc. No.	CR. No.	Clause affected	New version	Subject/Comments
SMG#22				4.0.1	ETSI Publication
SMG#20				5.1.2	Release 1996 version
SMG#27				6.0.0	Release 1997 version
SMG#29				7.0.0	Release 1998 version
				7.0.1	Version update to 7.0.1 for Publication
SMG#31				8.0.0	Release 1999 version
				8.0.1	Update to Version 8.0.1 for Publication

Change history
Date	TSG #	TSG Doc.	CR	Rev	Subject/Comment	Old	New
03-2001	11				Version for Release 4		4.0.0
06-2002	16				Version for Release 5	4.0.0	5.0.0
12-2004	26				Version for Release 6	5.0.0	6.0.0
06-2007	36				Version for Release 7	6.0.0	7.0.0
12-2008	42				Version for Release 8	7.0.0	8.0.0

Version Control

Toto je jediná verze této specifikace.

v800

Download & Access

Download ZIP Browse Directory

46062-800

Technical Details

AI Classification

Category: 7. Testování a interoperabilita

Subcategory: 7.1 Conformance Testing

Function: Test specification

Relevance: 7/10

Version Information

Release: Rel-8

Version: 800

Series: 46_series

Published: 2008-12

Document Info

Type: Technical Specification

TSG: Services and

Keywords & Refs

Keywords:

GSM

Partners

Contributors:

ETSITTCCCSA+3

File Info

File: 46062-800

Processed: 2025-06-22

3GPP Spec Explorer - Enhanced specification intelligence