SMIL

Synchronized Multimedia Integration Language

Services →
Introduced in Rel-2

SMIL is an XML-based markup language used in 3GPP to define multimedia messaging and streaming presentations, enabling precisely timed and synchronized audio, video, text, and graphics on mobile devices.

Category
Services
Introduced
Rel-2
Where
Services › Codecs
Specifications
9 specs
SMIL Description Purpose Related Classification Specifications

Description

The Synchronized Multimedia Integration Language (SMIL) is a World Wide Web Consortium (W3C) standard that provides an XML-based framework for authoring interactive multimedia presentations. It allows content creators to combine various media elements—such as audio clips, video streams, text, and images—into a single, synchronized presentation. SMIL defines a timeline-based model where each media object has specified start and end times, durations, and spatial positions on a display. This enables precise control over when and where media items appear, creating cohesive experiences like slideshows with audio narration, animated graphics with video overlays, or interactive training modules. The language uses tags and attributes to describe the temporal behavior, layout, and hyperlinking of media, making it platform-independent and easily parsed by SMIL players or browsers.

In 3GPP, SMIL is adopted primarily for multimedia messaging services (MMS) and streaming applications in mobile networks. For MMS, SMIL serves as the presentation layer that defines how multimedia messages are structured and rendered on recipient devices. A typical MMS message includes a SMIL document that specifies the sequence of media objects (e.g., an image followed by text and then audio), along with timing information and layout instructions. This ensures that the message is displayed consistently across different handsets, enhancing user experience. 3GPP specifications, such as 23.140 for MMS and 26.234 for packet-switched streaming service, detail the use of SMIL profiles tailored for mobile environments, considering limitations like screen size, bandwidth, and processing power. These profiles define subsets of SMIL features to ensure interoperability and efficient delivery over wireless networks.

The architecture of SMIL-based services in 3GPP involves several components. Content creators use SMIL authoring tools to generate presentations, which are then packaged with media files into a single entity (e.g., an MMS message or streaming playlist). On the network side, multimedia messaging service centers (MMSCs) or streaming servers deliver these packages to mobile devices. The client device includes a SMIL player—often integrated into the messaging or media application—that interprets the SMIL document, retrieves the referenced media files, and renders the presentation according to the specified timing and layout. Key aspects include support for adaptive streaming, where SMIL can describe alternative media sources for different bandwidth conditions, and integration with 3GPP file formats like 3GP. SMIL's role extends to rich communication services (RCS) and other multimedia applications, providing a standardized way to create engaging content that leverages the capabilities of mobile networks.

Purpose & Motivation

SMIL was developed to address the need for a standardized way to create synchronized multimedia presentations on the web, where early approaches relied on proprietary technologies or complex scripting. Before SMIL, integrating multiple media types with precise timing required custom solutions that were often not interoperable across different platforms. The W3C created SMIL as an open, XML-based language to enable authors to easily combine audio, video, and graphics into cohesive presentations without deep programming knowledge. This democratized multimedia content creation and ensured that presentations could be played back consistently on compliant players.

3GPP adopted SMIL to enhance mobile multimedia services, particularly with the introduction of MMS in 2G/3G networks. As mobile devices gained capabilities for images, audio, and video, there was a need for a standard format to structure multimedia messages beyond simple attachments. SMIL provided a lightweight, text-based format that could define the temporal and spatial layout of media, making MMS messages more interactive and visually appealing. It solved the problem of inconsistent rendering across different handset models by providing a common presentation layer. Additionally, SMIL supported streaming services by enabling playlists and synchronized content delivery, which was crucial for mobile TV and video-on-demand applications. Its adoption allowed operators to offer rich media experiences while ensuring interoperability in a multi-vendor ecosystem, driving the success of early mobile data services.

Classification

Part ofXML
Related approachesMMS3GP

Evolution Across Releases

Rel-2 Initial

Introduced SMIL for multimedia messaging services (MMS) in 3GPP, defining a mobile profile to structure MMS content. It enabled synchronized presentation of text, images, and audio in messages, establishing a standard format for rich media delivery over GSM and UMTS networks.

Explore further

Broader topics and technologies where SMIL plays a role.

Defining Specifications

3GPP specifications that define or reference SMIL, with the latest known release. Sourced from the 3GPP document catalog — see methodology.

SpecificationTitleRelease
TS 23.140 v1600 MMS Non-Realtime Service Definition Rel-6
TS 26.142 vj00 3GPP TS 26.142: Dynamic and Interactive Multimedia Scenes (DIMS) Rel-19
TS 26.233 vf00 3GPP Packet-Switched Streaming Service (PSS) Rel-15
TS 26.234 vj00 3GPP PSS Protocols and Codecs Specification Rel-19
TS 26.245 vj00 3GPP Timed Text Format Specification Rel-19
TS 26.246 vj00 3GPP SMIL Language Profile Specification Rel-19
TS 26.247 vj00 3GPP Progressive Download & DASH over HTTP Rel-19
TR 26.907 vj00 HTML5 for 3GPP Services Study Rel-19
TR 26.937 vj00 3GPP PSS Characterization Rel-19