What is DCT? Discrete Cosine Transformation

Description

The Discrete Cosine Transformation (DCT) is a lossy compression technique central to many multimedia codecs standardized by 3GPP. It operates by taking a block of pixel data (for video) or a window of audio samples and transforming this data from the spatial or time domain into the frequency domain. This transformation results in a set of coefficients representing different frequency components. The human visual and auditory systems are less sensitive to high-frequency details, allowing these higher-frequency coefficients to be quantized more coarsely or even set to zero with minimal perceived quality loss. This selective discarding of information is the primary mechanism for achieving high compression ratios.

In the context of 3GPP specifications like TS 26.110 (Codec for circuit-switched multimedia telephony service) and TS 26.234 (Transparent end-to-end packet-switched streaming service), DCT forms the core of codecs such as H.263, MPEG-4 Part 2, and aspects of the AMR-WB+ audio codec. The process typically involves dividing an image frame into macroblocks (e.g., 8x8 or 16x16 pixels), applying the DCT to each block, quantizing the resulting coefficients using a quantization matrix, and then encoding the quantized values using entropy coding techniques like Huffman or arithmetic coding. The inverse DCT (IDCT) is applied at the decoder to reconstruct an approximation of the original block.

Key architectural components involving DCT include the encoder's transform and quantization modules and the decoder's inverse quantization and inverse transform modules. Its role in the network is to enable efficient use of scarce radio and transport resources by drastically reducing the size of multimedia content without a proportionate loss in subjective quality. This efficiency is critical for delivering video telephony, mobile TV, and streaming services over bandwidth-constrained cellular networks, directly impacting user experience and network capacity.

Purpose & Motivation

DCT was incorporated into 3GPP standards to address the fundamental challenge of delivering multimedia services over mobile networks with limited and expensive bandwidth. Prior to efficient compression, transmitting raw video or high-fidelity audio was impractical due to the massive data rates required. The purpose of DCT is to perform perceptual coding, exploiting the psychoacoustic and psychovisual properties of human perception to discard information that will be least noticed, thereby creating a much smaller, transmittable bitstream.

The historical context stems from the evolution of digital video and audio compression standards developed in the 1980s and 1990s, such as JPEG and MPEG-1/2, which established DCT as a proven, effective method. 3GPP adopted and specified these techniques to enable multimedia services for 3G (UMTS) and beyond. Without DCT-based compression, services like video calling and mobile TV would have been impossible on early 3G networks, or would have consumed an untenable share of network resources, hindering mass-market adoption. It solved the problem of fitting high-bitrate media into low-bitrate, error-prone wireless channels.

Detected Changes Across Releases

from 3GPP Change Requests

Specific changes extracted from the „Change history“ tables of 3GPP specifications (1 CRs across 1 releases). Complements the general historical overview above with the evidence-based evolution of this function.

Studied in Rel-8, normative work from Rel-18.

Rel-18 1 change

In Release 18, the primary update related to DCT was within the context of enhancing multimedia codec support, specifically through the addition of IVAS codec support. The DCT function itself remains the established method for spatial redundancy reduction by converting input signals from the time domain to the frequency domain within video codecs. This change builds upon the existing framework where codecs like MPEG-4 Visual utilize DCT as part of their compression techniques for mobile communications.

Adding IVAS codec support TS 26.143CR0001

Explore further

Broader topics and technologies where DCT plays a role.

Topics

MEC (Edge Computing)Lawful Intercept UMTS / WCDMA Services & Applications Radio Access Network Core Network

Defining Specifications

3GPP specifications that define or reference DCT, with the latest known release. Sourced from the 3GPP document catalog — see methodology.

Specification	Title	Release
TS 26.110 vj00	3G-324M Multimedia Codecs for Circuit Switched Networks	Rel-19
TS 26.143 vj00	5G Messaging Media Types and Codecs	Rel-19
TS 26.234 vj00	3GPP PSS Protocols and Codecs Specification	Rel-19
TR 26.906 vj00	HEVC Evaluation for 3GPP Services	Rel-19