MVC

Multi-view Video Coding

Services
Introduced in Rel-2
Multi-view Video Coding (MVC) is an extension of the H.264/AVC standard for efficiently compressing video from multiple cameras. It is foundational for delivering 3D video services over mobile networks, enabling stereoscopic 3D playback and interactive viewpoint selection by exploiting similarities between different views of the same scene.

Description

Multi-view Video Coding (MVC) is a video compression standard developed as an amendment to H.264/Advanced Video Coding (AVC). Its primary function is to encode video sequences captured simultaneously from multiple cameras arranged to cover the same scene from different angles. The fundamental technical approach is hierarchical B-picture prediction extended across both time (within a single view) and view (across different camera perspectives). This creates a prediction structure that maximizes compression by using already encoded pictures from other views as references, in addition to past and future pictures in the same view.

The MVC codec architecture designates one view as the base view, which is encoded compliant with the H.264/AVC High Profile, ensuring that any AVC decoder can decode this view for 2D playback. Additional auxiliary views are encoded using inter-view prediction. Key coding tools include motion-compensated prediction for temporal redundancy and disparity-compensated prediction for spatial redundancy between views. The encoder generates a single bitstream containing all views, interleaved according to the prediction hierarchy. The bitstream includes Network Abstraction Layer (NAL) units with new types to identify view components and manage the decoding order.

Within 3GPP ecosystems, MVC has been specified for use in Packet-switched Streaming Service (PSS) and Multimedia Broadcast/Multicast Service (MBMS) for delivering 3D video content. A network service delivers the MVC bitstream, and a compatible client (e.g., a mobile phone with a 3D display) decodes at least two views to render a stereoscopic image. The 3GPP specifications define profiles and levels for MVC, its carriage in MPEG-2 Transport Streams or ISO Base Media File Format, and signaling in session description or media presentation descriptions. Its role was to provide a standardized, efficient method for 3D video delivery, forming the basis for early mobile 3D TV services.

Purpose & Motivation

MVC was created to enable practical 3D video services, which require at least two views (for left and right eye), at a time when transmitting two independent AVC streams would have doubled the bandwidth requirement. The primary problem it solved was the high bitrate cost of multi-view video, which was a major barrier to deploying 3DTV and other immersive services over bandwidth-constrained channels like mobile networks.

The historical context is the rise of 3D cinema and the initial push for 3D television in the late 2000s. 3GPP began incorporating support for advanced video services around Release 9. MVC, standardized earlier by MPEG, was adopted into 3GPP to provide a technically sound solution. It addressed the limitation of simulcast (independent streams) by providing typically 20-50% bitrate savings for two-view stereo content, making mobile 3D video broadcast and streaming commercially and technically viable. It laid the groundwork for all subsequent multi-view coding work in 3GPP, including the later MV-HEVC.

Key Features

  • Efficient compression for stereoscopic (two-view) and multi-view video sequences
  • Backward compatibility: base view is decodable by standard H.264/AVC decoders
  • Uses hierarchical B-pictures and disparity-compensated prediction across views
  • Defined profiles and levels for interoperability in 3GPP services
  • Support for interactive view switching in free-viewpoint applications
  • Specified carriage in 3GPP media formats for streaming and broadcast

Evolution Across Releases

Rel-2 Initial

Initial introduction of basic video capabilities in 3GPP. MVC was not yet defined; this release focused on fundamental circuit-switched multimedia services. The video codec focus was on H.263 and MPEG-4 Visual Simple Profile, establishing the baseline for media services in the 3GPP framework.

Introduced support for H.264/AVC (MPEG-4 Part 10) as a primary video codec for PSS and MBMS, setting the foundational single-view codec upon which MVC would later be built as an extension.

Formally adopted the Multi-view Video Coding (MVC) standard into 3GPP specifications for the first time. Defined its use for stereoscopic video (two views) within the MBMS framework, enabling mobile 3D TV broadcast services. Specified profiles, levels, and carriage requirements.

Enhanced MVC support by defining its application for Packet-switched Streaming (PSS) in addition to MBMS. Provided more detailed specifications for file format (3GP) and streaming adaptations, solidifying MVC as a key codec for on-demand and broadcast 3D video delivery.

Marked the beginning of a transition towards next-generation codecs. While MVC remained supported, this release introduced the more efficient HEVC codec and its multi-view extension, MV-HEVC, as the advanced successor for future services.

Defining Specifications

SpecificationTitle
TS 22.977 3GPP TS 22.977
TS 26.904 3GPP TS 26.904
TS 26.905 3GPP TS 26.905