Description
Algebraic Code-Excited Linear Prediction (ACELP) is a sophisticated speech coding algorithm that combines linear prediction with codebook-based excitation. The algorithm works by analyzing speech signals in frames, typically 20-30 milliseconds, and separating them into two main components: the spectral envelope (representing the vocal tract filter) and the excitation signal (representing the glottal pulses). The linear prediction component models the vocal tract using a set of linear prediction coefficients that are transformed into Line Spectral Pairs (LSPs) for more efficient quantization and transmission.
The excitation signal in ACELP is generated using algebraic codebooks, which contain predefined pulse sequences that can be efficiently represented using algebraic structures. These codebooks use sparse algebraic vectors with only a few non-zero pulses, allowing for fast search algorithms during encoding. The encoder searches through the codebook to find the excitation sequence that minimizes the perceptual error between the original and synthesized speech, using a weighted error criterion that accounts for human auditory perception.
Key components of the ACELP algorithm include the linear prediction analysis filter, the adaptive codebook (for pitch periodicity), the fixed algebraic codebook (for innovation), and the perceptual weighting filter. The adaptive codebook captures the long-term periodicity of voiced speech segments, while the algebraic codebook represents the remaining stochastic components. These elements work together in a closed-loop analysis-by-synthesis approach where the encoder simulates the decoder's operation to optimize the coding parameters.
In 3GPP networks, ACELP forms the core of several speech codecs including AMR (Adaptive Multi-Rate), AMR-WB (Wideband), and EVS (Enhanced Voice Services). The algorithm operates at various bit rates that can be dynamically adjusted based on network conditions, with typical rates ranging from 4.75 to 12.2 kbps for narrowband and 6.6 to 23.85 kbps for wideband applications. This flexibility allows network operators to balance speech quality against bandwidth requirements while maintaining robust performance under varying channel conditions.
Purpose & Motivation
ACELP was developed to address the fundamental challenge of transmitting high-quality speech over bandwidth-constrained mobile networks. Before ACELP-based codecs, earlier speech coding techniques like Regular Pulse Excitation (RPE) and Vector Sum Excited Linear Prediction (VSELP) offered limited quality at low bit rates or required excessive computational complexity. The mobile industry needed a solution that could deliver toll-quality speech while efficiently utilizing scarce radio spectrum resources.
The algebraic structure of ACELP's codebooks was specifically designed to reduce computational complexity compared to traditional stochastic codebooks, making real-time implementation feasible on mobile devices with limited processing power. This innovation allowed for more efficient codebook searches without sacrificing speech quality, enabling the widespread adoption of high-quality digital voice services in 2G, 3G, and subsequent mobile generations. The algorithm's flexibility to operate at multiple bit rates also supported adaptive rate control mechanisms that could respond to changing network conditions.
Furthermore, ACELP addressed the need for backward compatibility and interoperability across different network generations and regions. By providing a robust foundation for speech coding that could be enhanced through wider bandwidths and improved processing techniques, ACELP enabled the evolution from narrowband to wideband and super-wideband voice services while maintaining essential characteristics like low delay and graceful degradation under error conditions.
Classification
Evolution Across Releases
Introduced ACELP as the core algorithm for AMR-WB (Adaptive Multi-Rate Wideband) codec, extending speech bandwidth to 50-7000 Hz. This initial implementation supported bit rates from 6.60 to 23.85 kbps with nine discrete modes, providing significantly improved speech quality over narrowband codecs while maintaining backward compatibility through transcoding capabilities.
Explore further
Broader topics and technologies where ACELP plays a role.
Defining Specifications
3GPP specifications that define or reference ACELP, with the latest known release. Sourced from the 3GPP document catalog — see methodology.
| Specification | Title | Release |
|---|---|---|
| TR 21.905 vj00 | 3GPP Technical Terms and Definitions | Rel-19 |
| TS 23.782 vf00 | Interworking between LTE MC and non-LTE MC systems | Rel-15 |
| TS 26.071 vj00 | AMR Speech Codec Introduction | Rel-19 |
| TS 26.090 vj00 | AMR Speech Codec Detailed Mapping Specification | Rel-19 |
| TS 26.110 vj00 | 3G-324M Multimedia Codecs for Circuit Switched Networks | Rel-19 |
| TS 26.171 vj00 | Introduction to AMR-WB Speech Processing | Rel-19 |
| TS 26.190 vj00 | AMR-WB Speech Codec Detailed Mapping | Rel-19 |
| TS 26.253 vj00 | IVAS Codec Algorithmic Description | Rel-19 |
| TS 26.274 vj00 | AMR-WB+ Codec Conformance Testing Specification | Rel-19 |
| TS 26.290 vj00 | AMR-WB+ Audio Codec Specification | Rel-19 |
| TS 26.441 vj00 | EVS Audio Processing Introduction | Rel-19 |
| TS 26.442 vj00 | EVS Codec Fixed Point ANSI-C Code | Rel-19 |
| TS 26.443 vj00 | EVS Codec Floating-Point C Code | Rel-19 |
| TS 26.444 vj00 | EVS Codec Conformance Test Sequences | Rel-19 |
| TS 26.450 vj00 | EVS Codec DTX System Level Aspects | Rel-19 |
| TS 26.451 vj00 | EVS Codec Voice Activity Detector (VAD) Specification | Rel-19 |
| TS 26.452 vj00 | EVS Codec Fixed-Point C Code Implementation | Rel-19 |
| TR 26.952 vj00 | EVS Codec Selection, Verification & Characterization | Rel-19 |
| TS 46.051 vj00 | GSM Enhanced Full Rate Speech Processing Intro | Rel-19 |
| TS 46.060 vj00 | GSM Enhanced Full Rate Speech Codec | Rel-19 |