Description
The Multi-channel Coding Tool (MCT) is a standardized audio codec component defined in 3GPP specifications, designed to handle the compression and transmission of multi-channel audio signals. It operates by encoding multiple discrete audio channels into a single, unified bitstream, which is then transmitted over the network. The architecture of MCT integrates with existing 3GPP media delivery frameworks, such as those defined for the Enhanced Voice Services (EVS) codec or future immersive audio codecs, allowing it to be used in conjunction with other audio processing modules like bandwidth extension and error concealment. Key components include a channel encoder that applies perceptual coding principles to reduce redundancy between channels, a spatial audio parameter extractor that captures directional information, and a bitstream multiplexer that combines all data for efficient packetization. Its role in the network is primarily within the media plane of the User Equipment (UE) and application servers, where it processes audio before transmission or after reception, ensuring that immersive audio applications can function with the high quality and low latency required for real-time interactions. The tool supports various channel configurations, such as 5.1 surround sound or Ambisonics formats, making it versatile for different use cases from entertainment to communication. By standardizing this tool, 3GPP ensures interoperability across devices and networks, facilitating widespread adoption of advanced audio services in 5G and beyond.
Purpose & Motivation
MCT was created to address the growing demand for immersive audio experiences in mobile networks, driven by applications like Extended Reality (XR), virtual meetings, and high-fidelity entertainment. Prior to its introduction, 3GPP audio codecs like EVS were optimized primarily for mono or stereo channels, lacking efficient native support for multi-channel spatial audio, which often required proprietary solutions or inefficient transmission of multiple independent streams. This limitation hindered the quality and scalability of immersive services, as they consumed excessive bandwidth and suffered from interoperability issues. The development of MCT in Rel-18 was motivated by the need to standardize a tool that could encode multiple audio channels with high compression efficiency, reducing network load while maintaining perceptual quality. It solves problems related to bandwidth constraints in 5G networks, where data-intensive applications like XR require optimized media handling to deliver seamless user experiences. Historically, as 3GPP evolved from voice-centric to media-rich services, the addition of MCT reflects a shift towards supporting complex audio environments, enabling new revenue streams for operators and enhanced functionality for end-users in areas such as gaming, telepresence, and augmented reality.
Key Features
- Efficient encoding of multiple audio channels into a single bitstream
- Support for various channel configurations (e.g., 5.1, Ambisonics)
- Integration with 3GPP audio codecs like EVS for backward compatibility
- Perceptual coding techniques to reduce inter-channel redundancy
- Low-latency processing suitable for real-time communication
- Standardized bitstream format ensuring interoperability across devices
Evolution Across Releases
Introduced the Multi-channel Coding Tool with initial architecture supporting basic multi-channel audio encoding for immersive services. It defined the core components, including channel encoding and multiplexing, integrated into 3GPP specifications for audio codecs, targeting applications like XR and enhanced voice calls.
Defining Specifications
| Specification | Title |
|---|---|
| TS 26.253 | 3GPP TS 26.253 |
| TS 26.254 | 3GPP TS 26.254 |
| TS 26.255 | 3GPP TS 26.255 |