Description
Advanced Audio Coding (AAC) is a perceptual audio coding algorithm standardized by the Moving Picture Experts Group (MPEG) as part of MPEG-2 and MPEG-4, and adopted by 3GPP for use in mobile multimedia services. It operates by exploiting the psychoacoustic properties of human hearing, such as frequency masking and temporal masking, to remove audio data that is imperceptible to listeners. The encoding process involves transforming time-domain audio signals into the frequency domain using a Modified Discrete Cosine Transform (MDCT), which allows for efficient representation of spectral components. Quantization and entropy coding are then applied to the spectral data, with bit allocation optimized based on a perceptual model to maximize audio quality at a given bitrate. The decoder reconstructs the audio signal by inverse transforming the quantized spectral coefficients, resulting in a high-fidelity output that closely approximates the original source.
AAC supports a wide range of sampling rates (from 8 kHz to 96 kHz) and bitrates (typically from 8 kbps to over 320 kbps), making it versatile for various applications from low-bitrate speech to high-quality music. Key profiles include AAC-LC (Low Complexity) for general-purpose audio, HE-AAC (High Efficiency AAC, also known as AAC+) which combines AAC with Spectral Band Replication (SBR) for enhanced efficiency at low bitrates, and HE-AAC v2 which adds Parametric Stereo (PS) for further bitrate reduction in stereo content. These profiles allow service providers to select the optimal trade-off between audio quality, bitrate, and computational complexity for different use cases.
Within the 3GPP architecture, AAC is specified as a mandatory or recommended codec for multiple services across various technical specifications (TS). It is integral to Packet-Switched Streaming Service (PSS), Multimedia Broadcast/Multicast Service (MBMS), and Multimedia Telephony Service for IMS (MTSI). The codec is encapsulated in transport formats such as 3GP and MP4 for file-based delivery or Real-time Transport Protocol (RTP) for streaming. 3GPP specifications define precise conformance points for encoder and decoder implementations, including test sequences and performance requirements to ensure interoperability between devices and networks. AAC's efficiency directly impacts network capacity and user experience by reducing the bandwidth required for audio services while maintaining high perceptual quality.
AAC's design incorporates several advanced techniques to achieve its performance. It uses a filter bank with higher frequency resolution than MP3, allowing for more precise control over quantization noise shaping. The Temporal Noise Shaping (TNS) tool mitigates pre-echo artifacts in transient signals by applying prediction in the frequency domain. Perceptual Noise Substitution (PNS) replaces noise-like signal components with parametric descriptions, saving bits. These tools, combined with sophisticated bitstream multiplexing and error resilience mechanisms, make AAC robust for error-prone mobile channels. The codec's modular structure also facilitates scalability and extensibility, supporting multi-channel audio configurations up to 48 channels and object-based audio in later evolutions.
Purpose & Motivation
AAC was developed to address the growing demand for high-quality digital audio services over bandwidth-constrained networks, particularly in mobile environments. Prior audio codecs like MP3 (MPEG-1 Audio Layer III), while revolutionary, had limitations in compression efficiency and audio quality at lower bitrates. As mobile networks evolved from 2G to 3G and beyond, enabling multimedia applications, there was a need for a codec that could deliver CD-like audio quality at significantly reduced bitrates to conserve scarce radio resources and reduce data costs for users. AAC was designed to be the successor to MP3, offering better sound quality at similar bitrates or equivalent quality at roughly 30% lower bitrates, making it ideal for music streaming, video soundtracks, and voice-enhanced services.
The adoption of AAC within 3GPP standards, starting from Release 8, was driven by the need for a unified, high-performance audio codec for packet-switched services. Earlier 3GPP releases relied on speech-centric codecs like AMR-NB/WB for voice and MP3 or AAC for music, but lacked a comprehensive, optimized solution for a wide range of audio content. AAC filled this gap by providing a single codec family capable of handling music, speech, and mixed content with high efficiency. Its standardization ensured interoperability across devices and networks, fostering the growth of mobile multimedia ecosystems. By reducing the bandwidth required for audio, AAC also alleviated network congestion and enabled service providers to offer higher quality experiences without proportional increases in infrastructure costs.
Furthermore, AAC's scalability and profile system allowed it to adapt to diverse network conditions and device capabilities. For instance, HE-AAC enabled reasonable audio quality at very low bitrates (e.g., 24 kbps for stereo), crucial for early 3G streaming and broadcast services like MBMS. This efficiency was pivotal in making music streaming and mobile TV viable over cellular networks. As mobile data consumption soared, AAC's role expanded to include high-definition audio services and immersive formats, supporting the evolution towards enriched media experiences. Its continued enhancement through 3GPP releases reflects the ongoing pursuit of optimal audio delivery in an era of ever-increasing quality expectations and network demands.
Classification
Detected Changes Across Releases
from 3GPP Change RequestsSpecific changes extracted from the „Change history“ tables of 3GPP specifications (7 CRs across 2 releases). Complements the general historical overview above with the evidence-based evolution of this function.
Studied in Rel-8, normative work from Rel-18.
In Release 18, the Advanced Audio Coding (AAC) function was updated with the introduction of a new xHE-AAC stereo audio operation point. This release also included corrections and additions to audio aspects, specifically addressing stereo mapping for HE-AAC and xHE-AAC to DASH adaptation sets and correcting the xHE-AAC codecs parameter syntax for signaling. Furthermore, amendments were made to the audio encapsulation specifications for ISO BMFF, 3GP file format, CMAF, and DASH.
- Introduction of xHE-AAC stereo audio operation point TS 26.117CR0001
- On MeCAR Audio capabilities TS 26.117CR0005
- Corrections and additions on audio aspects TS 26.998CR0002
- Correction to HE-AAC and xHE-AAC stereo mapping to DASH adaptation set TS 26.117CR0010
- Correction to xHE-AAC codecs parameter, AMR and EVS capability and media type signalling syntax TS 26.143CR0003
- [5GMS3] Correction on Audio encapsulation in ISO BMFF, 3GP file format, CMAF and DASH TS 26.117CR0008
In Release 19, the changes for Advanced Audio Coding (AAC) introduced new video coding capabilities and operating points, as indicated by the Change Request title focusing on video coding. However, the provided grounding context for the specification does not contain any new technical details, procedures, or capabilities specifically for AAC that differ from the previous release.
- [VOPS] On video coding capabilities and operating points TS 26.143CR0004
Explore further
Broader topics and technologies where AAC plays a role.
Defining Specifications
3GPP specifications that define or reference AAC, with the latest known release. Sourced from the 3GPP document catalog — see methodology.
| Specification | Title | Release |
|---|---|---|
| TS 26.117 vj00 | 5G Media Streaming Speech/Audio Capabilities | Rel-19 |
| TS 26.119 vj00 | XR Media Capabilities for AR Devices | Rel-19 |
| TS 26.140 vj00 | MMS Media Formats and Codecs Specification | Rel-19 |
| TS 26.141 vj00 | IMS Messaging & Presence Media Formats | Rel-19 |
| TS 26.143 vj00 | 5G Messaging Media Types and Codecs | Rel-19 |
| TS 26.234 vj00 | 3GPP PSS Protocols and Codecs Specification | Rel-19 |
| TS 26.244 vj00 | 3GPP File Format (3GP) Specification | Rel-19 |
| TS 26.401 vj00 | Enhanced aacPlus Audio Codec Mapping | Rel-19 |
| TS 26.402 vj00 | Enhanced aacPlus Error Concealment & Processing | Rel-19 |
| TS 26.403 vj00 | Enhanced aacPlus AAC Encoder Specification | Rel-19 |
| TS 26.405 vj00 | Parametric Stereo Encoder for Enhanced aacPlus | Rel-19 |
| TS 26.410 vj00 | Enhanced aacPlus Floating-Point ANSI-C Code | Rel-19 |
| TS 26.411 vj00 | Enhanced aacPlus Fixed-Point ANSI-C Code | Rel-19 |
| TR 26.926 vj00 | Traffic Models & Quality Evaluation for Media/XR in 5G | Rel-19 |
| TR 26.955 vj00 | Video Codec Analysis for 5G Services | Rel-19 |
| TR 26.956 vj01 | Beyond 2D Video Formats & Codecs Study | Rel-19 |
| TR 26.998 vj00 | 5G AR/MR Glasses Integration Study | Rel-19 |