RASL

Random Access Skipped Leading picture

Other
Introduced in Rel-12
A video coding technique in 3GPP's Enhanced Voice Services (EVS) codec. It improves error resilience for speech and audio transmission over packet-switched networks by handling packet loss in predictive video frames, ensuring smoother media playback and higher quality of experience.

Description

Random Access Skipped Leading picture (RASL) is a specific error resilience mechanism defined within the 3GPP Enhanced Voice Services (EVS) codec specifications, primarily detailed in TS 26.346 for media delivery over MBMS. It operates at the intersection of audio/video coding and packet loss concealment. The technique is designed to handle scenarios where a leading picture in a video stream, which is predictively coded and depends on a previous random access point picture, is lost or arrives out of order due to network packet loss or jitter. In predictive video coding (like H.264/AVC or HEVC used in multimedia telephony), a RASL picture is a type of leading picture that precedes a Clean Random Access (CRA) picture in decoding order but follows it in output order. If the CRA picture (the key recovery point) is missing, these dependent RASL pictures cannot be correctly decoded. The EVS codec's system layer, when configured for video-aware operation, can signal and manage these dependencies. The mechanism works by allowing the decoder or the system to 'skip' or discard these undecodable leading pictures when the necessary reference (the CRA) is lost, preventing the propagation of visual artifacts and allowing for faster recovery at the next valid random access point. This involves careful timestamp management and decoding order synchronization between the audio (EVS) and video streams to maintain lip-sync and presentation continuity. Its role is critical in conversational services and streaming where low latency and robustness are paramount, ensuring that temporary network impairments do not cause prolonged degradation in the audiovisual presentation.

Purpose & Motivation

RASL was introduced to address the challenge of robust multimedia communication over unreliable packet-switched networks like LTE and 5G NR. Prior approaches in video telephony could suffer from severe and persistent visual corruption when packet loss occurred near random access points, as decoders might attempt to decode dependent pictures without valid references, leading to error propagation. The primary problem it solves is the graceful handling of packet loss for predictively coded video frames that are out-of-order relative to their reference frames, a common issue in real-time transport. By formally defining the handling of RASL pictures within the 3GPP media delivery framework, it enables more resilient video services alongside high-quality EVS audio. This was particularly motivated by the evolution towards all-IP networks for voice and video services (VoLTE, ViLTE, VoNR), where traditional circuit-switched error robustness mechanisms were not applicable. It ensures that the Quality of Experience (QoE) for combined speech and video services meets user expectations even under suboptimal radio conditions.

Key Features

  • Defines handling for leading pictures that depend on a missing Clean Random Access (CRA) picture
  • Prevents error propagation in predictive video coding after packet loss
  • Enables faster visual recovery at the next valid synchronization point
  • Integrates with 3GPP EVS codec system layer for synchronized media playback
  • Supports robust multimedia telephony over packet-switched networks (e.g., ViLTE)
  • Specified within MBMS and streaming service architectures in 3GPP

Evolution Across Releases

Rel-12 Initial

Initially introduced within the Enhanced Voice Services (EVS) codec framework, specifically in TS 26.346 for Multimedia Broadcast/Multicast Service (MBMS) applications. The architecture defined the concept for handling skipped leading pictures to improve error resilience in video streams delivered alongside EVS audio, establishing the foundational signaling and decoding rules.

Defining Specifications

SpecificationTitle
TS 26.346 3GPP TS 26.346
TS 26.522 3GPP TS 26.522
TS 26.906 3GPP TS 26.906