AAC

Advanced Audio Coding

Services
Introduced in Rel-8
Advanced Audio Coding (AAC) is a standardized, lossy audio compression codec defined by 3GPP for high-quality digital audio services. It provides superior sound quality at lower bitrates compared to older codecs like MP3, enabling efficient delivery of music, speech, and multimedia content over mobile networks. AAC is a foundational technology for streaming services, broadcast, and multimedia messaging in 3GPP systems.

Description

Advanced Audio Coding (AAC) is a perceptual audio coding algorithm standardized by the Moving Picture Experts Group (MPEG) as part of MPEG-2 and MPEG-4, and adopted by 3GPP for use in mobile multimedia services. It operates by exploiting the psychoacoustic properties of human hearing, such as frequency masking and temporal masking, to remove audio data that is imperceptible to listeners. The encoding process involves transforming time-domain audio signals into the frequency domain using a Modified Discrete Cosine Transform (MDCT), which allows for efficient representation of spectral components. Quantization and entropy coding are then applied to the spectral data, with bit allocation optimized based on a perceptual model to maximize audio quality at a given bitrate. The decoder reconstructs the audio signal by inverse transforming the quantized spectral coefficients, resulting in a high-fidelity output that closely approximates the original source.

AAC supports a wide range of sampling rates (from 8 kHz to 96 kHz) and bitrates (typically from 8 kbps to over 320 kbps), making it versatile for various applications from low-bitrate speech to high-quality music. Key profiles include AAC-LC (Low Complexity) for general-purpose audio, HE-AAC (High Efficiency AAC, also known as AAC+) which combines AAC with Spectral Band Replication (SBR) for enhanced efficiency at low bitrates, and HE-AAC v2 which adds Parametric Stereo (PS) for further bitrate reduction in stereo content. These profiles allow service providers to select the optimal trade-off between audio quality, bitrate, and computational complexity for different use cases.

Within the 3GPP architecture, AAC is specified as a mandatory or recommended codec for multiple services across various technical specifications (TS). It is integral to Packet-Switched Streaming Service (PSS), Multimedia Broadcast/Multicast Service (MBMS), and Multimedia Telephony Service for IMS (MTSI). The codec is encapsulated in transport formats such as 3GP and MP4 for file-based delivery or Real-time Transport Protocol (RTP) for streaming. 3GPP specifications define precise conformance points for encoder and decoder implementations, including test sequences and performance requirements to ensure interoperability between devices and networks. AAC's efficiency directly impacts network capacity and user experience by reducing the bandwidth required for audio services while maintaining high perceptual quality.

AAC's design incorporates several advanced techniques to achieve its performance. It uses a filter bank with higher frequency resolution than MP3, allowing for more precise control over quantization noise shaping. The Temporal Noise Shaping (TNS) tool mitigates pre-echo artifacts in transient signals by applying prediction in the frequency domain. Perceptual Noise Substitution (PNS) replaces noise-like signal components with parametric descriptions, saving bits. These tools, combined with sophisticated bitstream multiplexing and error resilience mechanisms, make AAC robust for error-prone mobile channels. The codec's modular structure also facilitates scalability and extensibility, supporting multi-channel audio configurations up to 48 channels and object-based audio in later evolutions.

Purpose & Motivation

AAC was developed to address the growing demand for high-quality digital audio services over bandwidth-constrained networks, particularly in mobile environments. Prior audio codecs like MP3 (MPEG-1 Audio Layer III), while revolutionary, had limitations in compression efficiency and audio quality at lower bitrates. As mobile networks evolved from 2G to 3G and beyond, enabling multimedia applications, there was a need for a codec that could deliver CD-like audio quality at significantly reduced bitrates to conserve scarce radio resources and reduce data costs for users. AAC was designed to be the successor to MP3, offering better sound quality at similar bitrates or equivalent quality at roughly 30% lower bitrates, making it ideal for music streaming, video soundtracks, and voice-enhanced services.

The adoption of AAC within 3GPP standards, starting from Release 8, was driven by the need for a unified, high-performance audio codec for packet-switched services. Earlier 3GPP releases relied on speech-centric codecs like AMR-NB/WB for voice and MP3 or AAC for music, but lacked a comprehensive, optimized solution for a wide range of audio content. AAC filled this gap by providing a single codec family capable of handling music, speech, and mixed content with high efficiency. Its standardization ensured interoperability across devices and networks, fostering the growth of mobile multimedia ecosystems. By reducing the bandwidth required for audio, AAC also alleviated network congestion and enabled service providers to offer higher quality experiences without proportional increases in infrastructure costs.

Furthermore, AAC's scalability and profile system allowed it to adapt to diverse network conditions and device capabilities. For instance, HE-AAC enabled reasonable audio quality at very low bitrates (e.g., 24 kbps for stereo), crucial for early 3G streaming and broadcast services like MBMS. This efficiency was pivotal in making music streaming and mobile TV viable over cellular networks. As mobile data consumption soared, AAC's role expanded to include high-definition audio services and immersive formats, supporting the evolution towards enriched media experiences. Its continued enhancement through 3GPP releases reflects the ongoing pursuit of optimal audio delivery in an era of ever-increasing quality expectations and network demands.

Key Features

  • Perceptual audio coding using MDCT for high frequency resolution
  • Support for multiple profiles: AAC-LC, HE-AAC (with SBR), HE-AAC v2 (with SBR and PS)
  • Wide bitrate range (8 kbps to 320+ kbps) and sampling rates up to 96 kHz
  • Advanced tools like Temporal Noise Shaping (TNS) and Perceptual Noise Substitution (PNS)
  • Scalability to multi-channel audio configurations (e.g., 5.1 surround)
  • Error resilience and robust bitstream syntax for mobile transmission

Evolution Across Releases

Rel-8 Initial

Introduced AAC as a standardized audio codec for 3GPP services, primarily for Packet-Switched Streaming (PSS) and Multimedia Broadcast/Multicast Service (MBMS). Specified AAC-LC and HE-AAC profiles to provide high-quality audio at various bitrates. Defined conformance and performance requirements in specs like TS 26.234 and TS 26.346, enabling efficient music streaming and audio delivery over HSPA networks.

Defining Specifications

SpecificationTitle
TS 26.117 3GPP TS 26.117
TS 26.119 3GPP TS 26.119
TS 26.140 3GPP TS 26.140
TS 26.141 3GPP TS 26.141
TS 26.143 3GPP TS 26.143
TS 26.234 3GPP TS 26.234
TS 26.244 3GPP TS 26.244
TS 26.401 3GPP TS 26.401
TS 26.402 3GPP TS 26.402
TS 26.403 3GPP TS 26.403
TS 26.405 3GPP TS 26.405
TS 26.410 3GPP TS 26.410
TS 26.411 3GPP TS 26.411
TS 26.926 3GPP TS 26.926
TS 26.955 3GPP TS 26.955
TS 26.956 3GPP TS 26.956
TS 26.998 3GPP TS 26.998