Description
In the context of 3GPP specifications, a Coded Video Sequence (CVS) is a self-contained, decodable bitstream that constitutes a video program. Technically, it is a sequence of Network Abstraction Layer (NAL) units that starts with an Instantaneous Decoding Refresh (IDR) access unit or a Broken Link Access (BLA) unit and ends before the next IDR or BLA access unit, or at the end of the bitstream. This structure ensures random access points and clean switching between different video streams or segments, which is essential for adaptive bitrate streaming and channel switching in broadcast services like eMBMS (evolved Multimedia Broadcast Multicast Service).
The CVS architecture is integral to video codecs standardized by 3GPP, such as Advanced Video Coding (AVC/H.264) and High Efficiency Video Coding (HEVC/H.265). It encapsulates a sequence of coded pictures, including I-frames (intra-coded), P-frames (predicted), and B-frames (bi-predicted), organized according to a Group of Pictures (GOP) structure. The sequence begins with a key frame (IDR or BLA) that resets the decoder's state, allowing independent decoding without reference to prior frames. This is crucial for error resilience, as corruption in one CVS does not propagate to subsequent sequences.
Within the 3GPP system, CVS plays a key role in video service delivery protocols. For example, in Multimedia Broadcast Multicast Service (MBMS) and its evolved variant (eMBMS), video content is packetized and transmitted as a series of CVS units over the broadcast/multicast bearer. The 3GPP Packet-switched Streaming (PSS) service and Multimedia Telephony Service for IMS (MTSI) also utilize CVS for unicast video streaming and conversational services, respectively. The specifications (e.g., TS 26.346 for MBMS, TS 26.265 for codec conformance) define how CVS is mapped onto transport protocols like RTP/RTCP and file formats like 3GP, ensuring end-to-end compatibility from content encoding to playback on user equipment.
The role of CVS extends to quality of service (QoS) and network efficiency. By providing clear boundaries for video segments, it enables efficient bandwidth adaptation, error concealment, and seamless handovers during user mobility. Network elements, such as the Broadcast Multicast Service Center (BM-SC) in MBMS, can manipulate CVS units for service announcements, synchronization, and delivery. Furthermore, CVS is a fundamental unit for Digital Rights Management (DRM) and content encryption in 3GPP, allowing selective protection of video sequences without impacting the entire stream.
Purpose & Motivation
The Coded Video Sequence (CVS) was introduced to standardize the carriage and processing of video content within the 3GPP ecosystem, addressing the growing demand for mobile video services. Prior to its formal definition in Release 12, video delivery over cellular networks relied on various proprietary or loosely defined packetization methods, leading to interoperability issues, inefficient bandwidth usage, and poor user experience during network impairments or channel switching. The CVS provides a clear, codec-agnostic structure that ensures video streams can be reliably decoded, manipulated, and transmitted across heterogeneous networks and devices.
A primary motivation for standardizing CVS was to support efficient broadcast and multicast services, specifically MBMS/eMBMS, where a single video stream is delivered to multiple users simultaneously. Without a standardized sequence structure, managing random access, service continuity, and adaptive streaming in broadcast scenarios was challenging. CVS enables the network to insert synchronization markers, announce service boundaries, and apply forward error correction (FEC) at the sequence level, enhancing reliability and reducing latency for live streaming and file delivery.
Furthermore, CVS addresses the limitations of earlier video delivery mechanisms by providing a foundation for advanced features like Dynamic Adaptive Streaming over HTTP (DASH) in mobile environments. By aligning video sequences with DASH segments, CVS facilitates smooth bitrate switching and trick-play operations (e.g., fast-forward, rewind). Its definition also supports the evolution towards higher efficiency video codecs (HEVC) and immersive media (e.g., 360-degree video), ensuring backward compatibility and future-proofing the 3GPP video service architecture. In essence, CVS exists to unify video handling, solve fragmentation issues, and enable scalable, high-quality video delivery across all 3GPP services.
Classification
Detected Changes Across Releases
from 3GPP Change RequestsSpecific changes extracted from the „Change history“ tables of 3GPP specifications (7 CRs across 3 releases). Complements the general historical overview above with the evidence-based evolution of this function.
Studied in Rel-12, normative work from Rel-15.
In Release 15, support for High Dynamic Range (HDR) video was newly introduced for the CVS function within TV video profiles. This was specified through the addition of dedicated HDR Operation Points for Full HD and UHD resolutions, utilizing the HEVC/H.265 Main-10 Profile and defining the use of both the Perceptual Quantizer (PQ) and Hybrid Log-Gamma (HLG) transfer functions from ITU-R BT.2100. These new Operation Points expanded the defined consistent video distribution formats to include enhanced colour and contrast capabilities for TV services over 3GPP systems.
- HDR Support in TV Video Profiles TS 26.116CR0007
In Release 16, the CVS function was enhanced for TV video profiles to support HLG HDR video, adding specific Operation Points for Full HD and UHD HLG content. The release also introduced more precise definitions for the spatial positioning of chroma samples within these TV video profiles. Furthermore, it included miscellaneous fixes and reference updates to solidify the technical specifications for consistent TV service distribution over 3GPP networks.
In Release 17, the key new feature for the CVS function was the introduction of an "8K HEVC Operation Point" for 7680 x 4320 resolution video, aligning it with CMAF (Common Media Application Format). This formally defined an 8K UHD operation point using the H.265/HEVC Main-10 Profile at Level 6.3 within the specification's framework for consistent TV service distribution formats.
- 8K HEVC Operation Point and CMAF Alignment TS 26.116CR0018
Explore further
Broader topics and technologies where CVS plays a role.
Defining Specifications
3GPP specifications that define or reference CVS, with the latest known release. Sourced from the 3GPP document catalog — see methodology.
| Specification | Title | Release |
|---|---|---|
| TS 26.116 vj00 | TV Video Formats for 3GPP Services | Rel-19 |
| TS 26.265 vj10 | Video Operation Points & Capabilities | Rel-19 |
| TS 26.346 vj20 | MBMS User Services Media Codecs & Protocols | Rel-19 |