VUI

Video Usability Information

Services
Introduced in Rel-13
VUI (Video Usability Information) is a set of metadata parameters embedded within a video bitstream (like H.264/AVC or H.265/HEVC) that describes the characteristics of the video signal. It provides essential information to decoders and displays about color space, transfer characteristics, and matrix coefficients to ensure accurate color reproduction and optimal viewing experience across different devices.

Description

Video Usability Information (VUI) is a standardized syntax and semantics defined within video coding standards, notably H.264/Advanced Video Coding (AVC) and H.265/High Efficiency Video Coding (HEVC), and is crucial for multimedia services in 3GPP. It is not video compression data itself, but rather auxiliary information carried in the Sequence Parameter Set (SPS) of the bitstream. VUI parameters describe the properties of the source video signal that are not inherently deducible from the decoded pixel values alone, enabling the receiving device to interpret and render the video content as intended by the content creator.

Architecturally, VUI is generated at the encoding side—by a content server, media encoder, or during video transcoding—and is encapsulated within the video elementary stream. When a 3GPP device, such as a smartphone, receives a video stream (e.g., via Multimedia Broadcast Multicast Service (MBMS) or Packet-Switched Streaming Service (PSS)), its video decoder extracts the VUI parameters during the parsing of the SPS. These parameters are then passed to the device's display processing pipeline. The key components described by VUI include the video signal's color primaries (the chromaticity coordinates of the red, green, and blue primaries), transfer characteristics (the electro-optical transfer function, such as gamma curve or Perceptual Quantizer for HDR), and matrix coefficients (the conversion matrix from non-linear RGB to luma and chroma components, like YCbCr).

How it works is through precise signaling. For example, the 'colour_primaries' field indicates whether the video uses the standard BT.709 (HDTV) or BT.2020 (Ultra HD) color gamut. The 'transfer_characteristics' field signals if the video is standard dynamic range (SDR) using a gamma curve or high dynamic range (HDR) using SMPTE ST 2084 (PQ) or HLG. The 'matrix_coeffs' tells the decoder how to convert the decoded YCbCr samples back to RGB for display. Without this information, a decoder might assume default values (often BT.709), leading to incorrect color saturation, incorrect brightness, or washed-out images when playing back content mastered with different parameters, such as HDR or wide color gamut content.

Its role in 3GPP networks is to ensure quality of experience (QoE) for video services. As mobile networks evolved to deliver higher resolution (4K, 8K) and HDR content, correct color management became critical. VUI provides the necessary metadata for end-to-end color fidelity. It allows heterogeneous devices with different display capabilities (SDR, HDR10, HLG) to automatically configure themselves to present the video correctly, or to perform appropriate tone mapping if the display cannot fully support the signaled characteristics. This is essential for broadcast services like eMBMS and for adaptive streaming (DASH) where multiple representations of content (with different VUI) may be available.

Purpose & Motivation

VUI was created to solve the problem of inconsistent and inaccurate video playback across the myriad of devices and displays in the market. Early digital video standards often implied certain colorimetric assumptions (like BT.601 for SD, BT.709 for HD). However, as video content creation expanded—with computer graphics, different camera systems, and emerging HDR standards—these implicit assumptions broke down. Without explicit signaling, a decoder had to guess the intent, leading to visible errors like incorrect color tint (e.g., red faces) or mismatched brightness.

The motivation for standardizing VUI within codecs like H.264/AVC was to provide a robust, in-band mechanism to convey this critical display metadata. It addresses the limitation of out-of-band signaling (like container formats) which could be lost during transcoding or transmission. By embedding it directly in the coded video bitstream, VUI remains intact through most processing chains. Its adoption into 3GPP specifications (e.g., for PSS and MBMS) was driven by the need to standardize video delivery over mobile networks, ensuring that advanced video features like HDR and wide color gamut could be deployed reliably.

Historically, its importance grew with the transition to HD and then UHD. For 3GPP, specifically, as Rel-13 and later focused on enhanced mobile broadband (eMBB) and rich media services, incorporating support for proper VUI parsing and handling became essential for competitive video offerings. It solves the problem of 'what you see is what was intended' in a fragmented device ecosystem, enabling content providers and network operators to deliver a high-quality, consistent viewing experience, which is a key differentiator for mobile video services.

Key Features

  • Embedded within video bitstream Sequence Parameter Set (SPS)
  • Signals color primaries defining the color gamut (e.g., BT.709, BT.2020)
  • Signals transfer characteristics defining the EOTF (e.g., gamma, PQ, HLG)
  • Signals matrix coefficients for YUV to RGB conversion
  • Enables automatic and accurate display configuration on playback devices
  • Essential for correct rendering of High Dynamic Range (HDR) and Wide Color Gamut (WCG) content

Evolution Across Releases

Rel-13 Initial

Introduced support for VUI parameters within 3GPP multimedia service specifications, particularly for the Packet-switched Streaming Service (PSS) and Multimedia Broadcast/Multicast Service (MBMS). This included referencing the relevant VUI syntax from H.264/AVC and H.265/HEVC standards to enable signaling of color space and transfer characteristics, forming the basis for consistent color reproduction in mobile video delivery.

Enhanced support for Ultra High Definition (UHD) and High Dynamic Range (HDR) video services. Specifications further detailed the use of VUI parameters for signaling HDR metadata (like SMPTE ST 2086 for mastering display color volume) and for defining profiles and levels that mandate specific VUI settings for advanced video codecs, improving interoperability for premium video content.

Strengthened integration with Dynamic Adaptive Streaming over HTTP (DASH) for 5G media delivery. Specifications ensured VUI parameters are correctly mapped and preserved in Media Presentation Descriptions (MPD) and segments, enabling adaptive streaming clients to select the appropriate video representation based on both codec requirements and display capabilities signaled by VUI.

Refinements for immersive media and extended reality (XR) applications. Work included considerations for VUI in the context of multi-view and 360-degree video, ensuring color consistency across different views and for head-mounted displays, which have specific colorimetric requirements.

Continued evolution aligned with new video coding standards like Versatile Video Coding (VVC). Ensured backward compatibility and forward-looking support for VUI parameters in next-generation codecs, maintaining the framework for accurate color management as video resolutions and dynamic ranges continue to advance in 5G-Advanced systems.

Defining Specifications

SpecificationTitle
TS 26.116 3GPP TS 26.116
TS 26.118 3GPP TS 26.118
TS 26.948 3GPP TS 26.948
TS 26.949 3GPP TS 26.949