What is MCT? Multi-channel Coding Tool

Description

The Multi-channel Coding Tool (MCT) is a standardized audio codec component defined in 3GPP specifications, designed to handle the compression and transmission of multi-channel audio signals. It operates by encoding multiple discrete audio channels into a single, unified bitstream, which is then transmitted over the network. The architecture of MCT integrates with existing 3GPP media delivery frameworks, such as those defined for the Enhanced Voice Services (EVS) codec or future immersive audio codecs, allowing it to be used in conjunction with other audio processing modules like bandwidth extension and error concealment. Key components include a channel encoder that applies perceptual coding principles to reduce redundancy between channels, a spatial audio parameter extractor that captures directional information, and a bitstream multiplexer that combines all data for efficient packetization. Its role in the network is primarily within the media plane of the User Equipment (UE) and application servers, where it processes audio before transmission or after reception, ensuring that immersive audio applications can function with the high quality and low latency required for real-time interactions. The tool supports various channel configurations, such as 5.1 surround sound or Ambisonics formats, making it versatile for different use cases from entertainment to communication. By standardizing this tool, 3GPP ensures interoperability across devices and networks, facilitating widespread adoption of advanced audio services in 5G and beyond.

Purpose & Motivation

MCT was created to address the growing demand for immersive audio experiences in mobile networks, driven by applications like Extended Reality (XR), virtual meetings, and high-fidelity entertainment. Prior to its introduction, 3GPP audio codecs like EVS were optimized primarily for mono or stereo channels, lacking efficient native support for multi-channel spatial audio, which often required proprietary solutions or inefficient transmission of multiple independent streams. This limitation hindered the quality and scalability of immersive services, as they consumed excessive bandwidth and suffered from interoperability issues. The development of MCT in Rel-18 was motivated by the need to standardize a tool that could encode multiple audio channels with high compression efficiency, reducing network load while maintaining perceptual quality. It solves problems related to bandwidth constraints in 5G networks, where data-intensive applications like XR require optimized media handling to deliver seamless user experiences. Historically, as 3GPP evolved from voice-centric to media-rich services, the addition of MCT reflects a shift towards supporting complex audio environments, enabling new revenue streams for operators and enhanced functionality for end-users in areas such as gaming, telepresence, and augmented reality.

Classification

Related approaches

Evolution Across Releases

Rel-18 Initial

Introduced the Multi-channel Coding Tool with initial architecture supporting basic multi-channel audio encoding for immersive services. It defined the core components, including channel encoding and multiplexing, integrated into 3GPP specifications for audio codecs, targeting applications like XR and enhanced voice calls.

TS 26.253 TS 26.254 TS 26.255

Explore further

Broader topics and technologies where MCT plays a role.

Topics

SON (Self-Organizing Networks)Mission Critical (MCPTT)Lawful Intercept Services & Applications Radio Access Network

Technologies

Defining Specifications

3GPP specifications that define or reference MCT, with the latest known release. Sourced from the 3GPP document catalog — see methodology.

Specification	Title	Release
TS 26.253 vj00	IVAS Codec Algorithmic Description	Rel-19
TS 26.254 vj00	IVAS Rendering Functions Specification	Rel-19
TS 26.255 vj00	IVAS Frame Loss Concealment Procedure	Rel-19