AAC

Advanced Audio Coding

Services →
Introduced in Rel-8

AAC is a standardized, lossy audio compression codec defined by 3GPP for high-quality digital audio services, providing superior sound quality at lower bitrates for efficient delivery over mobile networks.

Category
Services
Introduced
Rel-8
Where
Services › Codecs
Specifications
17 specs
AAC Description Purpose Related Classification Detected Changes Specifications

Description

Advanced Audio Coding (AAC) is a perceptual audio coding algorithm standardized by the Moving Picture Experts Group (MPEG) as part of MPEG-2 and MPEG-4, and adopted by 3GPP for use in mobile multimedia services. It operates by exploiting the psychoacoustic properties of human hearing, such as frequency masking and temporal masking, to remove audio data that is imperceptible to listeners. The encoding process involves transforming time-domain audio signals into the frequency domain using a Modified Discrete Cosine Transform (MDCT), which allows for efficient representation of spectral components. Quantization and entropy coding are then applied to the spectral data, with bit allocation optimized based on a perceptual model to maximize audio quality at a given bitrate. The decoder reconstructs the audio signal by inverse transforming the quantized spectral coefficients, resulting in a high-fidelity output that closely approximates the original source.

AAC supports a wide range of sampling rates (from 8 kHz to 96 kHz) and bitrates (typically from 8 kbps to over 320 kbps), making it versatile for various applications from low-bitrate speech to high-quality music. Key profiles include AAC-LC (Low Complexity) for general-purpose audio, HE-AAC (High Efficiency AAC, also known as AAC+) which combines AAC with Spectral Band Replication (SBR) for enhanced efficiency at low bitrates, and HE-AAC v2 which adds Parametric Stereo (PS) for further bitrate reduction in stereo content. These profiles allow service providers to select the optimal trade-off between audio quality, bitrate, and computational complexity for different use cases.

Within the 3GPP architecture, AAC is specified as a mandatory or recommended codec for multiple services across various technical specifications (TS). It is integral to Packet-Switched Streaming Service (PSS), Multimedia Broadcast/Multicast Service (MBMS), and Multimedia Telephony Service for IMS (MTSI). The codec is encapsulated in transport formats such as 3GP and MP4 for file-based delivery or Real-time Transport Protocol (RTP) for streaming. 3GPP specifications define precise conformance points for encoder and decoder implementations, including test sequences and performance requirements to ensure interoperability between devices and networks. AAC's efficiency directly impacts network capacity and user experience by reducing the bandwidth required for audio services while maintaining high perceptual quality.

AAC's design incorporates several advanced techniques to achieve its performance. It uses a filter bank with higher frequency resolution than MP3, allowing for more precise control over quantization noise shaping. The Temporal Noise Shaping (TNS) tool mitigates pre-echo artifacts in transient signals by applying prediction in the frequency domain. Perceptual Noise Substitution (PNS) replaces noise-like signal components with parametric descriptions, saving bits. These tools, combined with sophisticated bitstream multiplexing and error resilience mechanisms, make AAC robust for error-prone mobile channels. The codec's modular structure also facilitates scalability and extensibility, supporting multi-channel audio configurations up to 48 channels and object-based audio in later evolutions.

Purpose & Motivation

AAC was developed to address the growing demand for high-quality digital audio services over bandwidth-constrained networks, particularly in mobile environments. Prior audio codecs like MP3 (MPEG-1 Audio Layer III), while revolutionary, had limitations in compression efficiency and audio quality at lower bitrates. As mobile networks evolved from 2G to 3G and beyond, enabling multimedia applications, there was a need for a codec that could deliver CD-like audio quality at significantly reduced bitrates to conserve scarce radio resources and reduce data costs for users. AAC was designed to be the successor to MP3, offering better sound quality at similar bitrates or equivalent quality at roughly 30% lower bitrates, making it ideal for music streaming, video soundtracks, and voice-enhanced services.

The adoption of AAC within 3GPP standards, starting from Release 8, was driven by the need for a unified, high-performance audio codec for packet-switched services. Earlier 3GPP releases relied on speech-centric codecs like AMR-NB/WB for voice and MP3 or AAC for music, but lacked a comprehensive, optimized solution for a wide range of audio content. AAC filled this gap by providing a single codec family capable of handling music, speech, and mixed content with high efficiency. Its standardization ensured interoperability across devices and networks, fostering the growth of mobile multimedia ecosystems. By reducing the bandwidth required for audio, AAC also alleviated network congestion and enabled service providers to offer higher quality experiences without proportional increases in infrastructure costs.

Furthermore, AAC's scalability and profile system allowed it to adapt to diverse network conditions and device capabilities. For instance, HE-AAC enabled reasonable audio quality at very low bitrates (e.g., 24 kbps for stereo), crucial for early 3G streaming and broadcast services like MBMS. This efficiency was pivotal in making music streaming and mobile TV viable over cellular networks. As mobile data consumption soared, AAC's role expanded to include high-definition audio services and immersive formats, supporting the evolution towards enriched media experiences. Its continued enhancement through 3GPP releases reflects the ongoing pursuit of optimal audio delivery in an era of ever-increasing quality expectations and network demands.

Classification

Part ofPSS
Specific typesAAC-LTP
Related approachesAMR-WBEVS

Detected Changes Across Releases

from 3GPP Change Requests

Specific changes extracted from the „Change history“ tables of 3GPP specifications (7 CRs across 2 releases). Complements the general historical overview above with the evidence-based evolution of this function.

Studied in Rel-8, normative work from Rel-18.

Rel-18 6 changes

In Release 18, the Advanced Audio Coding (AAC) function was updated with the introduction of a new xHE-AAC stereo audio operation point. This release also included corrections and additions to audio aspects, specifically addressing stereo mapping for HE-AAC and xHE-AAC to DASH adaptation sets and correcting the xHE-AAC codecs parameter syntax for signaling. Furthermore, amendments were made to the audio encapsulation specifications for ISO BMFF, 3GP file format, CMAF, and DASH.

  • Introduction of xHE-AAC stereo audio operation point TS 26.117CR0001
  • On MeCAR Audio capabilities TS 26.117CR0005
  • Corrections and additions on audio aspects TS 26.998CR0002
  • Correction to HE-AAC and xHE-AAC stereo mapping to DASH adaptation set TS 26.117CR0010
  • Correction to xHE-AAC codecs parameter, AMR and EVS capability and media type signalling syntax TS 26.143CR0003
  • [5GMS3] Correction on Audio encapsulation in ISO BMFF, 3GP file format, CMAF and DASH TS 26.117CR0008
Rel-19 1 change

In Release 19, the changes for Advanced Audio Coding (AAC) introduced new video coding capabilities and operating points, as indicated by the Change Request title focusing on video coding. However, the provided grounding context for the specification does not contain any new technical details, procedures, or capabilities specifically for AAC that differ from the previous release.

  • [VOPS] On video coding capabilities and operating points TS 26.143CR0004

Explore further

Broader topics and technologies where AAC plays a role.

Defining Specifications

3GPP specifications that define or reference AAC, with the latest known release. Sourced from the 3GPP document catalog — see methodology.

SpecificationTitleRelease
TS 26.117 vj00 5G Media Streaming Speech/Audio Capabilities Rel-19
TS 26.119 vj00 XR Media Capabilities for AR Devices Rel-19
TS 26.140 vj00 MMS Media Formats and Codecs Specification Rel-19
TS 26.141 vj00 IMS Messaging & Presence Media Formats Rel-19
TS 26.143 vj00 5G Messaging Media Types and Codecs Rel-19
TS 26.234 vj00 3GPP PSS Protocols and Codecs Specification Rel-19
TS 26.244 vj00 3GPP File Format (3GP) Specification Rel-19
TS 26.401 vj00 Enhanced aacPlus Audio Codec Mapping Rel-19
TS 26.402 vj00 Enhanced aacPlus Error Concealment & Processing Rel-19
TS 26.403 vj00 Enhanced aacPlus AAC Encoder Specification Rel-19
TS 26.405 vj00 Parametric Stereo Encoder for Enhanced aacPlus Rel-19
TS 26.410 vj00 Enhanced aacPlus Floating-Point ANSI-C Code Rel-19
TS 26.411 vj00 Enhanced aacPlus Fixed-Point ANSI-C Code Rel-19
TR 26.926 vj00 Traffic Models & Quality Evaluation for Media/XR in 5G Rel-19
TR 26.955 vj00 Video Codec Analysis for 5G Services Rel-19
TR 26.956 vj01 Beyond 2D Video Formats & Codecs Study Rel-19
TR 26.998 vj00 5G AR/MR Glasses Integration Study Rel-19