DIMS

Dynamic and Interactive Multimedia Scene

Services
Introduced in Rel-8
A 3GPP service enabling synchronized, interactive multimedia presentations across multiple devices. It defines 'scenes' composed of media components and scene commands, allowing for rich, coordinated user experiences like multi-screen sports broadcasts or interactive educational content.

Description

DIMS (Dynamic and Interactive Multimedia Scene) is a 3GPP standardized service (TS 26.140) that provides a framework for creating, delivering, and rendering synchronized, interactive multimedia presentations. A DIMS presentation is called a 'Scene'. A Scene is a temporal composition of multimedia components—such as video, audio, images, text, and graphics—that are spatially arranged and can change dynamically based on user interaction or timed events. The core innovation of DIMS is the 'Scene Command', a set of instructions defined in the DIMS specification that modify the state of a Scene in a deterministic way. These commands are delivered alongside or within the media streams.

Architecturally, a DIMS service involves several key entities: the DIMS Application Server, which authors and packages the Scene; the delivery network (e.g., MBMS, PSS, or streaming servers); and the DIMS Client residing on the user's device. The Scene description, often using a declarative format, defines the initial layout, media components, and their relationships. The dynamic behavior is controlled by Scene Commands. These commands are small, efficient instructions that can trigger actions like 'ShowComponent', 'HideComponent', 'MoveTo', 'ChangeVolume', or 'LoadNewMedia'. They are timestamped and synchronized with the media timeline, ensuring all clients update the Scene state at precisely the same moment.

The DIMS Client is responsible for interpreting the Scene description, receiving and buffering media components, and executing the incoming stream of Scene Commands. It maintains a scene graph representing the current state of all components. Upon receiving a command, the client updates this internal model and renders the changes accordingly. This allows for complex interactivity, such as letting a user select a camera angle during a sports event, which triggers a Scene Command to switch the video source, all while keeping other on-screen graphics synchronized. The service supports unicast, multicast, and broadcast delivery, making it suitable for both personalized and mass-media experiences.

DIMS plays a significant role in enabling next-generation multimedia services. It moves beyond simple streaming to offer a rich, application-like experience within a standardized broadcast or streaming framework. Its synchronization mechanisms are crucial for creating 'social viewing' experiences where multiple users see the same interactive elements simultaneously. Furthermore, by separating the control logic (Scene Commands) from the media assets, it allows for lightweight updates and personalization without needing to retransmit entire video streams, optimizing bandwidth usage.

Purpose & Motivation

DIMS was created to address the limitations of traditional linear broadcast and on-demand streaming services, which offered little room for synchronized interactivity and multi-screen experiences. Before DIMS, creating an interactive TV experience where graphics, alternative audio tracks, or camera angles could be changed in sync across millions of viewers required proprietary, non-interoperable technologies. This fragmented the market and increased costs for content creators and device manufacturers.

The primary problem DIMS solves is the standardized delivery of synchronized, stateful multimedia presentations. It was motivated by the growing demand for enriched TV (e.g., sports stats overlays, interactive advertising, choose-your-own-adventure stories) and the proliferation of second-screen applications. Without DIMS, synchronizing a companion app on a tablet with the main broadcast on a TV was complex and error-prone. DIMS provides a unified model where a single authored Scene can be rendered consistently across different device types, with built-in mechanisms for user input and state changes.

Historically, DIMS (introduced in Release 8) was part of 3GPP's broader push into advanced multimedia services alongside MBMS (Multimedia Broadcast Multicast Service) and PSS (Packet Switched Streaming). It aimed to leverage mobile networks' bidirectional capabilities to add interactivity to broadcast content. The technology was also seen as an enabler for rich media education, interactive advertising, and enhanced public warning systems. By providing a standardized 'scene graph' model and command set, it allowed content creators to write once and deploy everywhere, fostering innovation in interactive media while ensuring reliability and synchronization—key aspects where previous ad-hoc solutions failed.

Key Features

  • Defines 'Scene Commands' as a standardized set of instructions to dynamically modify a multimedia presentation.
  • Supports precise temporal synchronization of commands with media playback across multiple devices.
  • Enables rich interactivity, allowing user inputs to trigger scene state changes.
  • Uses a declarative scene description format to define initial layout and media components.
  • Supports multiple delivery mechanisms including unicast (PSS), broadcast/multicast (MBMS), and file download.
  • Facilitates multi-screen and synchronized second-screen experiences from a single authored scene.

Evolution Across Releases

Rel-8 Initial

Introduced the core DIMS framework in TS 26.140. Defined the fundamental concepts of Scenes, Scene Commands, and the DIMS client architecture. Established support for delivery over PSS and MBMS, enabling basic interactive broadcast services.

Enhanced DIMS with additional media codec support and refined scene description capabilities. Introduced more sophisticated scene command types for improved control over graphics and text components, broadening the range of possible interactive applications.

Added support for 3GPP Adaptive HTTP Streaming (AHS) as a delivery method, aligning DIMS with evolving streaming technologies. Enhanced synchronization mechanisms for better lip-sync and command timing in variable bandwidth conditions.

Introduced support for DIMS in conjunction with Enhanced Multimedia Broadcast Multicast Service (eMBMS), leveraging the improved capacity and efficiency of LTE broadcast for large-scale interactive events.

Further expanded the set of scene commands and interactivity features, supporting more complex user interface elements and integration with device sensors (e.g., accelerometer) for immersive experiences.

Focused on optimizations for power efficiency and memory usage on the DIMS client, crucial for mobile devices. Enhanced support for scalable video coding (SVC) within DIMS presentations.

Explored integration with Virtual Reality (VR) and 360-degree video components within a DIMS scene, preparing the standard for immersive media formats.

Aligned DIMS with 5G system architecture, ensuring compatibility with 5G Media Streaming and 5G Broadcast. Investigated ultra-low latency command delivery for real-time interactivity.

Enhanced support for multi-device synchronization, enabling more robust second-screen and social viewing scenarios. Worked on improved error resilience and recovery mechanisms for scene command streams.

Continued evolution for 5G-Advanced, exploring use of DIMS for interactive public warning systems and augmented reality (AR) overlays delivered via network broadcast.

Investigated AI/ML-based optimizations for scene authoring and adaptive command generation based on user behavior or network conditions.

Ongoing maintenance and feature enhancements to support next-generation interactive media services, ensuring DIMS remains relevant in the era of pervasive XR (Extended Reality) experiences.

Defining Specifications

SpecificationTitle
TS 26.140 3GPP TS 26.140
TS 26.142 3GPP TS 26.142
TS 26.234 3GPP TS 26.234
TS 26.244 3GPP TS 26.244
TS 26.346 3GPP TS 26.346
TS 26.430 3GPP TS 26.430
TS 26.907 3GPP TS 26.907