Description
Region-Wise Packing (RWP) is a sophisticated video processing mechanism defined within 3GPP's Multimedia Broadcast/Multicast Service (MBMS) and Media Streaming frameworks. It operates at the intersection of video coding and network delivery, specifically designed for advanced video formats like Omnidirectional Media (360-degree video). The core architectural principle involves segmenting a video frame, typically a projection of a spherical view, into multiple rectangular regions, each representing a distinct spatial area of the full panorama. These regions are then independently encoded, often with varying quality levels, and 'packed' into a single, standard-compliant video bitstream, such as an HEVC tile-based bitstream. This packing process is governed by metadata that describes the spatial location and arrangement of each region within the overall frame canvas.
From a network perspective, RWP enables highly efficient delivery. Instead of transmitting the entire high-resolution omnidirectional video stream to a user's device, the network can leverage this structure. A client application, aware of the user's current field of view (viewport), can request only the specific packed regions required to render that viewport at high quality, while other regions may be delivered at lower quality or not at all. This is facilitated by protocols like Dynamic Adaptive Streaming over HTTP (DASH), where the Media Presentation Description (MPD) contains the RWP metadata, allowing the client to map requested spatial segments to the corresponding data segments in the bitstream.
The role of RWP in the network is to act as an enabler for bandwidth-efficient immersive media services. It sits within the application layer and influences the operation of the media encoder, the manifest file generator, and the adaptive streaming client. By decoupling the spatial organization of video content from its encoding and transport, RWP provides the foundational structure for viewport-adaptive streaming, which is critical for delivering high-quality VR and 360-degree video over capacity-constrained radio access networks. It transforms a monolithic video asset into a structured, spatially-aware asset that the network and client can interact with intelligently.
Purpose & Motivation
Region-Wise Packing was created to address the significant bandwidth challenge of streaming immersive media, particularly 360-degree and virtual reality video. Traditional video streaming transmits the entire frame for every video segment, which is grossly inefficient for omnidirectional content where a user typically views only a portion (e.g., 90-120 degrees) of the full 360x180-degree sphere at any given time. Transmitting the full, high-resolution sphere would consume excessive network resources and lead to poor user experience due to buffering or reduced quality.
The historical context for RWP's development is the standardization of 5G Multimedia Services in 3GPP Release 15 and beyond, where enhanced Mobile Broadband (eMBB) and media services were key drivers. Previous approaches either streamed the full monoscopic or stereoscopic video (wasteful) or required complex server-side viewport rendering and dedicated unicast streams (not scalable). RWP provides a standardized, intermediate solution. It motivates a shift towards a client-driven, adaptive model by structuring the content at the source. This solves the problem by allowing the network to deliver a single, structured bitstream that can be consumed efficiently by multiple clients with different viewports, balancing bandwidth savings with the flexibility of client-side viewport selection.
Key Features
- Spatial partitioning of video frames into independently accessible regions
- Packing of multiple encoded regions into a single standards-compliant video bitstream (e.g., HEVC tiles)
- Generation of standardized metadata describing region location and packing structure
- Enables viewport-adaptive streaming for omnidirectional media
- Bandwidth efficiency by allowing selective request and delivery of high-quality regions
- Compatibility with MPEG-DASH for dynamic adaptation based on viewport and network conditions
Evolution Across Releases
Initial introduction of Region-Wise Packing for Omnidirectional Media Format (OMAF) in 3GPP. Defined the fundamental framework for packing multiple rectangular region-wise video sub-pictures into an HEVC tile bitstream and the associated Viewport-Dependent Delivery metadata within the MPEG-DASH MPD to enable viewport-adaptive streaming services.
Defining Specifications
| Specification | Title |
|---|---|
| TS 26.114 | 3GPP TS 26.114 |
| TS 26.118 | 3GPP TS 26.118 |
| TS 26.962 | 3GPP TS 26.962 |