Description
Media Identification (MID) is a framework used within 3GPP to uniquely identify and describe distinct media components within a multimedia session established via protocols like the Session Initiation Protocol (SIP) and the Session Description Protocol (SDP). It is a crucial part of media negotiation and session control, particularly in IP Multimedia Subsystem (IMS) services. A multimedia session, such as a voice call with video or a multi-stream conferencing service, consists of multiple media streams (e.g., primary audio, secondary video for content sharing, text). Each of these logical media components is assigned a unique MID attribute within the SDP offer/answer exchange. The MID attribute, defined in IETF RFC 5888, is included in the 'm=' (media) line of the SDP description. It allows endpoints and network nodes (like the Media Resource Function Controller, MRFC) to correlate media lines across SDP offers and answers, and to refer to specific media streams for manipulation. For instance, during a session modification, an endpoint can use the MID value to indicate which specific media stream it wishes to put on hold, change the codec for, or remove entirely. This is more robust than using the order of media lines, which can change. Within 3GPP, MID is used in conjunction with other SDP and SIP mechanisms for features like Early Media, Interactive Connectivity Establishment (ICE) for NAT traversal, and media authorization. The Policy and Charging Rules Function (PCRF) or the Application Function (AF) can also use media identification to apply specific Quality of Service (QoS) and charging policies to individual media flows within a session, enabling sophisticated service differentiation.
Purpose & Motivation
MID was introduced to solve the limitations of managing complex multimedia sessions where multiple, simultaneous media streams are involved. Early multimedia sessions typically described media based on the order of 'm=' lines in SDP, which was fragile—if the order changed during negotiation, it could lead to misassociation of streams and failed sessions. The need for a stable, unique identifier became acute with the advent of advanced IMS services like Video Telephony, Collaborative Workspaces, and Rich Communication Services (RCS), where sessions may dynamically add or remove media components (e.g., adding a file transfer or a second video stream). MID provides an explicit, protocol-level handle for each media component, enabling robust session modification, hold/resume operations, and content adaptation. It allows network elements to precisely identify which media flow a policy rule should apply to, which is essential for implementing advanced charging models (e.g., charging video separately from voice) and guaranteed bitrate QoS. Its creation was driven by the IETF's work on SDP and adopted by 3GPP to ensure interoperability and reliable service control in the all-IP service architecture of IMS.
Key Features
- Provides a unique textual identifier for each SDP media description ('m=' line)
- Defined in IETF RFC 5888 and adopted by 3GPP
- Enables stable referencing of media streams during SDP offer/answer exchanges
- Facilitates independent manipulation (hold, modify, remove) of individual media components
- Used by network policy functions (PCRF, AF) for granular media-aware policy control
- Essential for complex IMS services like video calls and multimedia conferencing
Evolution Across Releases
Introduced with the initial IMS architecture. Adopted the SDP framework for media negotiation in IMS-based multimedia sessions. MID usage was implicit in early media control mechanisms, laying the groundwork for identifying media flows within the nascent all-IP service layer.
Defining Specifications
| Specification | Title |
|---|---|
| TS 23.048 | 3GPP TS 23.048 |
| TS 24.501 | 3GPP TS 24.501 |
| TS 29.333 | 3GPP TS 29.333 |
| TS 31.115 | 3GPP TR 31.115 |
| TS 33.812 | 3GPP TR 33.812 |