Description
Algebraic Code-Excited Linear Prediction (ACELP) is a sophisticated speech coding algorithm that combines linear prediction with codebook-based excitation. The algorithm works by analyzing speech signals in frames, typically 20-30 milliseconds, and separating them into two main components: the spectral envelope (representing the vocal tract filter) and the excitation signal (representing the glottal pulses). The linear prediction component models the vocal tract using a set of linear prediction coefficients that are transformed into Line Spectral Pairs (LSPs) for more efficient quantization and transmission.
The excitation signal in ACELP is generated using algebraic codebooks, which contain predefined pulse sequences that can be efficiently represented using algebraic structures. These codebooks use sparse algebraic vectors with only a few non-zero pulses, allowing for fast search algorithms during encoding. The encoder searches through the codebook to find the excitation sequence that minimizes the perceptual error between the original and synthesized speech, using a weighted error criterion that accounts for human auditory perception.
Key components of the ACELP algorithm include the linear prediction analysis filter, the adaptive codebook (for pitch periodicity), the fixed algebraic codebook (for innovation), and the perceptual weighting filter. The adaptive codebook captures the long-term periodicity of voiced speech segments, while the algebraic codebook represents the remaining stochastic components. These elements work together in a closed-loop analysis-by-synthesis approach where the encoder simulates the decoder's operation to optimize the coding parameters.
In 3GPP networks, ACELP forms the core of several speech codecs including AMR (Adaptive Multi-Rate), AMR-WB (Wideband), and EVS (Enhanced Voice Services). The algorithm operates at various bit rates that can be dynamically adjusted based on network conditions, with typical rates ranging from 4.75 to 12.2 kbps for narrowband and 6.6 to 23.85 kbps for wideband applications. This flexibility allows network operators to balance speech quality against bandwidth requirements while maintaining robust performance under varying channel conditions.
Purpose & Motivation
ACELP was developed to address the fundamental challenge of transmitting high-quality speech over bandwidth-constrained mobile networks. Before ACELP-based codecs, earlier speech coding techniques like Regular Pulse Excitation (RPE) and Vector Sum Excited Linear Prediction (VSELP) offered limited quality at low bit rates or required excessive computational complexity. The mobile industry needed a solution that could deliver toll-quality speech while efficiently utilizing scarce radio spectrum resources.
The algebraic structure of ACELP's codebooks was specifically designed to reduce computational complexity compared to traditional stochastic codebooks, making real-time implementation feasible on mobile devices with limited processing power. This innovation allowed for more efficient codebook searches without sacrificing speech quality, enabling the widespread adoption of high-quality digital voice services in 2G, 3G, and subsequent mobile generations. The algorithm's flexibility to operate at multiple bit rates also supported adaptive rate control mechanisms that could respond to changing network conditions.
Furthermore, ACELP addressed the need for backward compatibility and interoperability across different network generations and regions. By providing a robust foundation for speech coding that could be enhanced through wider bandwidths and improved processing techniques, ACELP enabled the evolution from narrowband to wideband and super-wideband voice services while maintaining essential characteristics like low delay and graceful degradation under error conditions.
Key Features
- Algebraic codebook structure with sparse pulse vectors for efficient representation
- Closed-loop analysis-by-synthesis optimization for perceptual quality
- Adaptive multi-rate operation supporting bit rates from 4.75 to 23.85 kbps
- Linear prediction with LSP quantization for spectral envelope coding
- Integrated adaptive codebook for pitch periodicity modeling
- Perceptual weighting filter based on human auditory characteristics
Evolution Across Releases
Introduced ACELP as the core algorithm for AMR-WB (Adaptive Multi-Rate Wideband) codec, extending speech bandwidth to 50-7000 Hz. This initial implementation supported bit rates from 6.60 to 23.85 kbps with nine discrete modes, providing significantly improved speech quality over narrowband codecs while maintaining backward compatibility through transcoding capabilities.
Defining Specifications
| Specification | Title |
|---|---|
| TS 21.905 | 3GPP TS 21.905 |
| TS 23.782 | 3GPP TS 23.782 |
| TS 26.071 | 3GPP TS 26.071 |
| TS 26.090 | 3GPP TS 26.090 |
| TS 26.110 | 3GPP TS 26.110 |
| TS 26.171 | 3GPP TS 26.171 |
| TS 26.190 | 3GPP TS 26.190 |
| TS 26.253 | 3GPP TS 26.253 |
| TS 26.274 | 3GPP TS 26.274 |
| TS 26.290 | 3GPP TS 26.290 |
| TS 26.441 | 3GPP TS 26.441 |
| TS 26.442 | 3GPP TS 26.442 |
| TS 26.443 | 3GPP TS 26.443 |
| TS 26.444 | 3GPP TS 26.444 |
| TS 26.450 | 3GPP TS 26.450 |
| TS 26.451 | 3GPP TS 26.451 |
| TS 26.452 | 3GPP TS 26.452 |
| TS 26.952 | 3GPP TS 26.952 |
| TS 46.051 | 3GPP TR 46.051 |
| TS 46.060 | 3GPP TR 46.060 |