Description
Audio-Visual (AV) in 3GPP standards represents a comprehensive framework for handling multimedia content that includes audio, video, or both synchronized components. The architecture for AV services spans multiple network domains including the User Equipment (UE), Radio Access Network (RAN), and Core Network (CN), with specific protocols and codecs defined for efficient transmission. AV content is processed through media codecs (like AMR for audio and H.264/AVC or H.265/HEVC for video), packetized according to RTP/RTCP protocols, and transported over IP-based bearers with appropriate QoS treatment.
At the technical level, AV services involve several key components: media encoding/decoding functions in the UE and application servers, media gateways for format conversion, media resource functions for processing, and policy control for quality management. The 3GPP specifications define detailed procedures for AV session establishment, modification, and termination through SIP or HTTP-based signaling. Media delivery can occur through unicast (point-to-point), multicast, or broadcast mechanisms depending on the service type, with specific adaptations for different radio conditions and network capabilities.
AV services integrate with multiple 3GPP subsystems including IMS (IP Multimedia Subsystem) for conversational services, MBMS (Multimedia Broadcast Multicast Service) for broadcast, and EPS (Evolved Packet System) bearers for streaming. Security mechanisms including encryption (via SRTP) and integrity protection are applied to AV media streams, with key management handled through protocols like MIKEY or DTLS-SRTP. The system supports dynamic adaptation to network conditions through techniques like adaptive bitrate streaming, where video quality adjusts based on available bandwidth and device capabilities.
Quality of Service for AV is managed through dedicated bearers with appropriate QoS Class Identifiers (QCIs) that ensure low latency for conversational video and sufficient bandwidth for streaming. The Packet Delay Budget (PDB), Packet Error Loss Rate (PELR), and priority levels are specifically configured for different AV service types. Network functions including the PCRF (Policy and Charging Rules Function) and PCEF (Policy and Charging Enforcement Function) work together to enforce these policies and ensure consistent AV quality across the mobile network.
Purpose & Motivation
AV capabilities were introduced to enable rich multimedia services over mobile networks, addressing the growing demand for video communication and content consumption. Before standardized AV handling, mobile networks primarily supported voice and basic data services, with proprietary solutions for multimedia that lacked interoperability. The formal definition of AV within 3GPP specifications created a unified framework that allowed consistent implementation across devices and networks, enabling global multimedia services.
The creation of AV standards solved several critical problems: it provided efficient compression techniques suitable for bandwidth-constrained mobile networks, defined end-to-end quality of service mechanisms for time-sensitive media, and established security protocols for protecting multimedia content. This addressed limitations of earlier approaches where video services were either unavailable or implemented through non-standardized methods that suffered from poor quality, limited device compatibility, and security vulnerabilities.
Historically, the introduction of AV in Release 6 coincided with the deployment of 3G networks that offered sufficient bandwidth for video services. This enabled new revenue-generating services for operators including video calling, mobile TV, and video streaming. The standardization ensured that AV services could scale across different network generations while maintaining backward compatibility, supporting the evolution from circuit-switched video telephony to packet-based multimedia services in later releases.
Key Features
- Support for synchronized audio and video streams with lip-sync requirements
- Multiple codec support including adaptive bitrate streaming capabilities
- End-to-end QoS management with dedicated bearers for media traffic
- Integrated security through SRTP encryption and key management protocols
- Support for multiple delivery methods: unicast, multicast, and broadcast
- Dynamic adaptation to network conditions and device capabilities
Evolution Across Releases
Initial AV framework introduced with support for conversational video services over IMS, including basic video telephony. Defined fundamental codecs (H.263, MPEG-4 Visual), transport protocols (RTP/RTCP over IP), and QoS mechanisms. Established security baseline with 3GPP authentication and basic media protection for circuit-switched and packet-switched video services.
Defining Specifications
| Specification | Title |
|---|---|
| TS 22.827 | 3GPP TS 22.827 |
| TS 24.109 | 3GPP TS 24.109 |
| TS 26.922 | 3GPP TS 26.922 |
| TS 26.923 | 3GPP TS 26.923 |
| TS 29.866 | 3GPP TS 29.866 |
| TS 33.102 | 3GPP TR 33.102 |
| TS 33.203 | 3GPP TR 33.203 |
| TS 33.401 | 3GPP TR 33.401 |
| TS 33.501 | 3GPP TR 33.501 |
| TS 33.503 | 3GPP TR 33.503 |
| TS 33.514 | 3GPP TR 33.514 |
| TS 33.812 | 3GPP TR 33.812 |
| TS 33.835 | 3GPP TR 33.835 |
| TS 33.859 | 3GPP TR 33.859 |
| TS 33.924 | 3GPP TR 33.924 |
| TS 35.234 | 3GPP TR 35.234 |
| TS 35.937 | 3GPP TR 35.937 |