CMAF

Common Media Application Format

Services
Introduced in Rel-15
CMAF is a standardized media format for adaptive bitrate streaming, enabling efficient delivery of audio and video content over networks. It reduces storage and delivery costs by allowing a single set of media segments to be used across multiple streaming protocols and devices. This is crucial for scalable, high-quality media services in 3GPP networks.

Description

The Common Media Application Format (CMAF) is a media container and delivery format standardized by 3GPP, MPEG, and other bodies, designed to unify adaptive bitrate (ABR) streaming. It specifies a common set of media segment formats (based on the ISO Base Media File Format, ISOBMFF) that can be packaged once and delivered via multiple streaming protocols, such as HTTP Live Streaming (HLS) and Dynamic Adaptive Streaming over HTTP (MPEG-DASH). CMAF segments are structured as fragmented MP4 (fMP4) files, containing media data (e.g., video, audio) and metadata (e.g., timing, encryption information) in a way that supports efficient chunked transfer and playback. The format includes support for advanced codecs like HEVC/H.265 and AV1, as well as common encryption (CENC) for digital rights management (DRM).

Architecturally, CMAF operates within the media delivery chain between content preparation (encoding/packaging) and client playback. Content is encoded into multiple bitrate representations, each segmented into CMAF-compliant chunks. These chunks are stored on origin servers or content delivery networks (CDNs). During streaming, a client uses a manifest (e.g., MPD for DASH or m3u8 for HLS) to request segments adaptively based on network conditions. CMAF's key innovation is the 'CMAF track,' which defines a self-contained media stream (e.g., video or audio) with specific codec and encryption parameters, allowing interoperability. For example, a single CMAF track for a video representation can be served as a DASH segment or an HLS segment without repackaging, reducing storage overhead and latency.

CMAF's role in 3GPP networks is integral to media services like Multimedia Broadcast Multicast Service (MBMS) and 5G Media Streaming. It enables efficient broadcast and unicast delivery of live and on-demand content, supporting features like low-lency streaming and network-based media processing. In 5G systems, CMAF aligns with edge computing by allowing media segments to be cached and processed at the network edge (e.g., in Multi-access Edge Computing, MEC), improving quality of experience (QoE). The format also includes mechanisms for timed metadata (e.g., ad insertion cues) and chunked transfer encoding, which allows streaming servers to send partial segments as they become available, crucial for live events. Overall, CMAF provides a foundation for scalable, interoperable media delivery across heterogeneous devices and networks.

Purpose & Motivation

CMAF was created to address the fragmentation in adaptive streaming ecosystems, where different protocols (e.g., HLS, DASH) required separate media segments, leading to increased storage, bandwidth, and complexity. Before CMAF, content providers had to maintain multiple versions of the same content for different streaming formats, which was inefficient and costly. This fragmentation also hindered innovation, as new codecs or features needed implementation across multiple formats. CMAF solves this by providing a common intermediate format that decouples content preparation from delivery, allowing a single set of segments to be reused across protocols.

The motivation for CMAF within 3GPP stems from the growing demand for high-quality media services over mobile networks, such as 4G LTE and 5G. As video traffic dominates network usage, efficient delivery becomes critical for network performance and user experience. CMAF reduces latency and bandwidth consumption by enabling chunked transfer and reducing duplicate storage. It also supports advanced media capabilities like 4K/8K streaming, high dynamic range (HDR), and immersive audio, which are essential for next-generation services. Historically, 3GPP integrated CMAF into its specifications (starting in Release 15) to align with industry standards and enhance media delivery in MBMS and 5G systems, ensuring interoperability with global streaming practices.

By addressing these limitations, CMAF enables cost-effective scaling of media services, simplifies content workflows, and improves adaptability to network conditions. It is particularly important for 5G, where low-latency applications (e.g., live sports, virtual reality) benefit from CMAF's chunked streaming and edge integration. The format also future-proofs media delivery by supporting emerging codecs and encryption schemes, ensuring long-term viability in evolving telecom and media landscapes.

Key Features

  • Single set of media segments reusable across HLS and MPEG-DASH protocols
  • Based on fragmented MP4 (fMP4) format for efficient chunked transfer and playback
  • Supports advanced codecs including HEVC/H.265, AV1, and VVC for high-efficiency compression
  • Includes Common Encryption (CENC) for interoperable DRM and content protection
  • Enables low-latency streaming through chunked transfer encoding and CMAF Low Latency mode
  • Provides timed metadata tracks for dynamic ad insertion and event signaling

Evolution Across Releases

Rel-15 Initial

Introduced CMAF as a standardized media format in 3GPP specifications, focusing on adaptive streaming for MBMS and unicast services. Initial capabilities included support for common codecs like AVC and HEVC, fragmented MP4 segment structure, and integration with 5G media delivery. This release established the foundation for interoperable media segments across streaming protocols, reducing storage and bandwidth requirements.

Enhanced CMAF with support for 5G Media Streaming, including improved low-latency modes and chunked transfer for live content. Added features for edge computing integration, such as MEC-based media processing, and expanded codec support to include immersive audio formats. These changes optimized media delivery for 5G networks, enabling faster startup times and better QoE.

Extended CMAF capabilities for advanced media services, including support for enhanced reality (XR) streaming and dynamic adaptive streaming over QUIC. Introduced refinements in timed metadata for interactive applications and improved encryption mechanisms for security. This release focused on scalability and flexibility for diverse use cases like broadcast and on-demand streaming.

Further evolved CMAF to support network slicing for media delivery, allowing optimized resource allocation for different service types. Added enhancements for AI-based media optimization and expanded codec support for AV1 and future standards. These updates aimed at improving efficiency and personalization in 5G-Advanced networks.

Continued enhancements for CMAF in next-generation networks, focusing on integration with 6G study items and advanced media applications. Included improvements for energy-efficient streaming and support for novel media types like holographic content. This release ensured CMAF's relevance in evolving telecom ecosystems with higher data rates and lower latency demands.

Defining Specifications

SpecificationTitle
TS 26.117 3GPP TS 26.117
TS 26.247 3GPP TS 26.247
TS 26.265 3GPP TS 26.265
TS 26.511 3GPP TS 26.511
TS 26.512 3GPP TS 26.512
TS 26.517 3GPP TS 26.517
TS 26.802 3GPP TS 26.802
TS 26.804 3GPP TS 26.804
TS 26.841 3GPP TS 26.841
TS 26.857 3GPP TS 26.857
TS 26.891 3GPP TS 26.891
TS 26.918 3GPP TS 26.918
TS 26.955 3GPP TS 26.955
TS 26.956 3GPP TS 26.956
TS 26.998 3GPP TS 26.998