Description
An Intra Random Access Picture is a specific type of frame in a video bitstream encoded using codecs like H.264/AVC or H.265/HEVC. Technically, it is a coded picture where all slices are intra-coded slices (I-slices), meaning they are compressed using spatial redundancy within the same picture only, without reference to other pictures. This makes the IRAP self-contained and decodable independently. In the context of adaptive streaming, an IRAP picture, along with all subsequent pictures in decoding order up to the next IRAP picture, forms a "Random Access Point" segment. This segment can be decoded without any reference to pictures that precede it in decoding order, effectively creating a decodable entry point into the continuous stream.
The architecture for IRAP utilization is defined within the 3GPP Packet-switched Streaming Service (PSS) and Dynamic Adaptive Streaming over HTTP (DASH) specifications. Key components include the media encoder, which periodically inserts IRAP pictures according to a defined interval or on scene changes; the segmenter, which packages media data into segments aligned with IRAP boundaries for DASH; and the Media Presentation Description (MPD) file, which signals the availability and timing of these random access points to the client. The client player uses this information to perform bitrate adaptation by cleanly switching to a segment from a different representation (a different bitrate or resolution stream) starting at an IRAP picture, ensuring seamless playback without decoding errors.
How it works in practice involves careful alignment between encoding, packaging, and signaling. During video encoding, the encoder is configured to generate IRAP pictures at regular intervals (e.g., every 2 seconds). These pictures are significantly larger than predicted (P or B) frames but provide crucial access points. For DASH delivery, the media file is fragmented into segments such that each segment begins with an IRAP picture. The MPD indicates the start time of each segment relative to the presentation timeline. When a user seeks to a new time or the client decides to switch to a higher or lower bitrate, it requests the segment from the new representation that contains the IRAP picture nearest to the desired playback time. This ensures the decoder can initialize and start decoding correctly from that point, maintaining continuous playback.
Purpose & Motivation
The IRAP concept exists to solve critical problems in video streaming, particularly adaptive streaming over HTTP (DASH). The primary problems are enabling random access (seeking) within a stream and facilitating seamless switching between different encoded versions of the same content (adaptive bitrate switching). Without self-contained access points, a video player seeking to a new position would have to decode all frames from the start of the stream or the last full keyframe, which is inefficient and slow. Similarly, switching bitrates mid-stream would cause decoding failures if the new stream segment depended on frames from the old stream.
Historically, simple I-frames (Intra-coded frames) provided random access, but the IRAP concept in the context of H.264/AVC and HEVC provides a more formalized and robust definition that includes handling of leading pictures (like Broken Link Access pictures). Its adoption in 3GPP standards (starting Rel-12) was motivated by the industry's shift towards HTTP-based adaptive streaming as the dominant method for delivering video over mobile and fixed networks. 3GPP needed to standardize how to package and signal media for reliable, interoperable streaming services on devices.
IRAP addresses the limitations of non-aligned stream switching and inefficient seeking. By mandating that segments for adaptive streaming start with an IRAP picture, it guarantees that a client can always switch representations cleanly. This is fundamental to providing a high Quality of Experience (QoE), as it enables smooth adaptation to changing network bandwidth without introducing visual artifacts or playback stalls. It also enables efficient trick-play modes (fast-forward, rewind) and error resilience, as transmission errors can be recovered at the next IRAP boundary.
Key Features
- Self-contained coded picture (all intra-coded slices) enabling independent decoding
- Serves as a random access point for seeking and stream splicing
- Mandatory alignment point for DASH media segments to enable bitrate switching
- Defined within video coding standards like H.264/AVC and H.265/HEVC
- Signaled in the Media Presentation Description (MPD) for DASH clients
- Enables error recovery and trick-play functionality in streaming
Evolution Across Releases
Formally introduced IRAP and its role in 3GPP DASH and streaming services. Standardized the requirement for DASH segments to begin with an IRAP picture (or an access unit that is a Random Access Point). Defined signaling of segment start times and random access properties in the MPD to enable reliable client-driven adaptation.
Defining Specifications
| Specification | Title |
|---|---|
| TS 26.114 | 3GPP TS 26.114 |
| TS 26.223 | 3GPP TS 26.223 |
| TS 26.346 | 3GPP TS 26.346 |
| TS 26.522 | 3GPP TS 26.522 |
| TS 26.822 | 3GPP TS 26.822 |
| TS 26.906 | 3GPP TS 26.906 |
| TS 26.948 | 3GPP TS 26.948 |