ILP (Inter-Layer Prediction) — 3GPP Glossary

Inter-Layer Prediction (ILP) is a video coding technique used in 3GPP's Enhanced Voice Services (EVS) codec to improve compression efficiency for scalable bitstreams. It predicts higher enhancement layer data from the lower base layer, reducing redundancy and bitrate while maintaining quality. This is crucial for efficient video transmission over bandwidth-constrained mobile networks.

Description

Inter-Layer Prediction (ILP) is a core component of the 3GPP Enhanced Voice Services (EVS) codec, specifically designed for scalable video coding. It operates within a layered coding structure where a video stream is encoded into a base layer and one or more enhancement layers. The base layer provides a basic quality representation at a lower bitrate, while enhancement layers add incremental quality improvements. ILP's primary function is to exploit the statistical dependencies between these layers to achieve higher compression efficiency.

Technically, ILP works by using the reconstructed information from the lower layer (e.g., base layer) to predict the data for the current enhancement layer being encoded. This prediction can involve various types of data, such as motion vectors, transform coefficients, or residual signals. For instance, motion vectors from the base layer can be upsampled and refined to serve as predictors for the enhancement layer, avoiding the need to transmit entirely new motion information. Similarly, texture prediction uses the upsampled base layer reconstruction as a reference, so the enhancement layer only needs to encode the residual difference. This process significantly reduces the bitrate required for the enhancement layers compared to independent encoding.

The architecture supporting ILP is integrated into the EVS codec's scalable video coding tools. Key components include the layer prediction modules, upsampling filters for spatial scalability, and mechanisms for signaling prediction modes. The decoder performs the corresponding inverse prediction using the base layer data to reconstruct the enhancement layer. ILP is particularly effective in scenarios requiring adaptive bitrate streaming, such as video conferencing or streaming services, where network conditions may vary. By enabling efficient layered representations, ILP allows seamless quality adaptation without the overhead of multiple independent encodes, optimizing bandwidth usage and enhancing user experience in mobile environments.

Purpose & Motivation

ILP was introduced to address the growing demand for high-quality video services over mobile networks with limited and variable bandwidth. Prior to its adoption, scalable video coding often suffered from inefficiencies because enhancement layers were encoded independently, leading to significant bitrate overhead and reduced compression gains. This redundancy made it challenging to deliver adaptive video streams efficiently, especially as video resolutions and frame rates increased.

The creation of ILP within the EVS codec was motivated by the need for a more efficient scalable coding framework that could support robust video transmission in 3GPP networks. By leveraging inter-layer dependencies, ILP reduces the bitrate required for enhancement layers, enabling higher quality video at lower bitrates or more graceful quality degradation under network congestion. This technology is essential for applications like video calling, streaming, and broadcast services, where bandwidth optimization and quality adaptation are critical for user satisfaction and network resource management.

Key Features

Enables efficient prediction of enhancement layer data from base layer information
Reduces bitrate overhead in scalable video coding by exploiting inter-layer redundancy
Supports various prediction types including motion vector prediction and texture prediction
Integrated into the 3GPP EVS codec for enhanced video compression
Facilitates adaptive bitrate streaming by enabling seamless layer switching
Improves video quality and bandwidth utilization in mobile network environments

Evolution Across Releases

Rel-13 Initial

Introduced Inter-Layer Prediction as part of the Enhanced Voice Services (EVS) codec for scalable video coding. Initial architecture included basic prediction mechanisms for motion and texture data between base and enhancement layers to improve compression efficiency.

TS 26.948

Defining Specifications

Specification	Title
TS 26.948	3GPP TS 26.948