Description
Immersive Voice and Audio Services (IVAS) is a comprehensive 3GPP standard introduced in Release 16 that defines a new media codec and an associated service framework designed to deliver high-quality, immersive audio experiences over 5G and evolved packet core networks. At its core is the IVAS codec, a highly efficient and flexible audio codec capable of encoding not just traditional stereo or mono signals, but also complex spatial audio scenes comprising multiple audio channels (e.g., 5.1, 7.1.4) and discrete audio objects with associated metadata (like position, size, and gain). This allows for the rendering of sound in a three-dimensional space around the listener.
Architecturally, IVAS integrates into the 3GPP Multimedia Telephony Service for IMS (MTSI) and other media streaming frameworks. It operates within the media plane of the IP Multimedia Subsystem (IMS). Key components include the IVAS encoder, which compresses the immersive audio scene; the IVAS decoder, which reconstructs it; and the IVAS renderer, which uses head-related transfer functions (HRTF) and playback system information to correctly spatialize the audio for the listener's specific setup (headphones, speaker arrays). The service framework, detailed in specs like 26.114 and 26.119, defines session negotiation procedures using Session Description Protocol (SDP) to establish IVAS-capable media sessions, including support for dynamic switching between codec modes based on network conditions.
How it works: During a call or streaming session, endpoints negotiate IVAS support. The capturing device (e.g., a 360-degree microphone array or an XR headset) captures a spatial audio scene. The IVAS encoder compresses this scene, efficiently representing ambient channels and moving audio objects. This bitstream is packetized and transmitted over the 5G network, benefiting from ultra-reliable low-latency communication (URLLC) for real-time applications. The receiving device's IVAS decoder reconstructs the scene, and the renderer adapts it in real-time based on the listener's head orientation (using head-tracking data) to maintain a fixed sound field, creating a compelling sense of presence. Its role is to be the enabling audio technology for telepresence, social XR, and immersive entertainment.
Purpose & Motivation
IVAS was created to address the limitations of traditional voice and audio codecs (like AMR, EVS) in the emerging era of extended reality (XR), telepresence, and immersive media. Legacy codecs were designed for mono or stereo playback, incapable of conveying the spatial cues necessary for realistic virtual environments or group communication where understanding who is speaking and from where is critical. The motivation was to define a single, efficient standard for all immersive audio use cases, avoiding fragmentation.
The historical context is the evolution of 5G, which promises enhanced mobile broadband (eMBB), massive IoT, and URLLC. While 5G provides the pipe, IVAS provides the next-generation audio content that justifies the need for high bandwidth and low latency. It solves the problem of delivering cinema-quality, object-based audio over wireless networks for applications like multi-player VR gaming, remote collaboration in virtual spaces, and immersive live music streaming. Prior approaches required proprietary codecs or bulky, uncompressed multi-channel audio, which were inefficient and not interoperable.
Furthermore, IVAS enables new service paradigms like 'Augmented Reality Telephony,' where remote participants can be represented as spatial audio objects in the user's environment. It addresses the need for a codec that is both high-quality for music and low-bitrate for conversational speech, with seamless switching between modes. Its creation was motivated by industry convergence from telecom, broadcasting, and consumer electronics to establish a universal immersive audio standard for 5G.
Classification
Detected Changes Across Releases
from 3GPP Change RequestsSpecific changes extracted from the „Change history“ tables of 3GPP specifications (38 CRs across 4 releases). Complements the general historical overview above with the evidence-based evolution of this function.
In Release 15, the IVAS function introduced the capability for the MRFP to perform a sensitivity calculation specifically for immersive audio playback, correcting its methodology. This enhancement is part of the broader immersive audio services framework, which includes functions like audio conferencing and multimedia recording with synchronized streams. The calculation ensures accurate audio level management during playback operations within the MRFP.
- Correction of sensitivity calculation for immersive audio playback TS 26.260CR002
In Release 16, the IVAS function introduced the new "Immersive Voice and Audio Services" codec, detailed in the new specifications TS 26.250 and TS 26.253, and expanded MRFP capabilities to include synchronized multimedia play and record functions, as well as multimedia conferencing with support for audio and video transcoding.
- Removing H.263 and MPEG-4 Visual from MTSI TS 26.114CR0501
In Release 18, the key new developments for Immersive Voice and Audio Services (IVAS) were the formal introduction and integration of the IVAS codec into the 3GPP ecosystem. This included adding support for the codec across specifications, incorporating it into the 3GPP File Format (3GP), and defining objective test methodologies for IVAS-based user equipment. The release also introduced the ISAR track-a split rendering feature and established codec levels, while providing corrections and updated test sequences for the IVAS specifications.
- Adding support for IVAS codec TS 23.333CR0146
- Adding support for IVAS codec TS 23.334CR0185
- Introduction of IVAS codec TS 26.114CR0561
- Introduction of IVAS into the 3GPP File Format (3GP) TS 26.244CR0066
- Updated IVAS test sequences TS 26.252CR0001
- Adding ISAR track-a split rendering feature to TS 26.258 and Corrections to the IVAS C-Code and corresponding specification text TS 26.258CR0002
+ 16 more changes
In Release 19, key additions for IVAS included the standardization of the fixed-point codec implementation in TS 26.251, the introduction of new conformance test methods, and the definition of a procedure for IVAS bandwidth computation. The release also provided corrections and clarifications to the rendering specification, SDP examples, and test sequences to improve interoperability and performance characterization.
- Updates related to availability of TS 26.251 (IVAS fixed-point code) TS 26.114CR0596
- Updates related to availability of TS 26.251 (IVAS fixed-point code) TS 26.244CR0067
- Updates related to availability of TS 26.251 (IVAS fixed-point code) TS 26.250CR0003
- Updates related to IVAS conformance testing of TS 26.258 and TS 26.251 TS 26.252CR0005
- Inclusion of TS 26.251 (IVAS fixed-point code) TS 26.256CR0002
- New test methods for immersive UEs TS 26.260CR0008
+ 8 more changes
Explore further
Broader topics and technologies where IVAS plays a role.
Defining Specifications
3GPP specifications that define or reference IVAS, with the latest known release. Sourced from the 3GPP document catalog — see methodology.
| Specification | Title | Release |
|---|---|---|
| TS 23.333 vj00 | MRFC-MRFP Mp Interface Requirements | Rel-19 |
| TS 23.334 vj00 | IMS-ALG to IMS-AGW Interface (Iq) Stage 2 | Rel-19 |
| TS 26.114 vj10 | IMS Multimedia Telephony Media Handling | Rel-19 |
| TS 26.119 vj00 | XR Media Capabilities for AR Devices | Rel-19 |
| TS 26.244 vj00 | 3GPP File Format (3GP) Specification | Rel-19 |
| TS 26.249 vj00 | Immersive Audio Split Rendering (ISAR) | Rel-19 |
| TS 26.250 vj00 | IVAS Codec Introduction | Rel-19 |
| TS 26.251 vj00 | IVAS Codec Fixed-Point C Code Specification | Rel-19 |
| TS 26.252 vj00 | IVAS Codec Test Sequences Specification | Rel-19 |
| TS 26.254 vj00 | IVAS Rendering Functions Specification | Rel-19 |
| TS 26.255 vj00 | IVAS Frame Loss Concealment Procedure | Rel-19 |
| TS 26.256 vj00 | Jitter Buffer Management for IVAS | Rel-19 |
| TS 26.258 vj10 | IVAS Codec Floating-Point C Code Specification | Rel-19 |
| TS 26.260 vj00 | Immersive Audio Objective Test Methods | Rel-19 |
| TS 26.261 vj00 | Electro-acoustic specs for immersive terminals | Rel-19 |
| TS 26.511 vj00 | 5G Media Streaming Profiles, Codecs & Formats | Rel-19 |
| TR 26.865 vi00 | Technical Report | Rel-18 |
| TR 26.926 vj00 | Traffic Models & Quality Evaluation for Media/XR in 5G | Rel-19 |
| TR 26.928 vj00 | Study on eXtended Reality (XR) in 5G | Rel-19 |
| TR 26.933 vj00 | Study on Diverse Audio Capturing System | Rel-19 |
| TR 26.996 vj00 | ISAR Split Rendering Audio Characterization | Rel-19 |
| TR 26.997 vj00 | IVAS Codec Specification | Rel-19 |
| TR 26.998 vj00 | 5G AR/MR Glasses Integration Study | Rel-19 |
| TS 29.162 vj00 | IMS-IP Network Interworking | Rel-19 |
| TS 29.163 vj00 | Interworking between 3GPP IM CN and CS networks | Rel-19 |
| TS 29.232 vj00 | Mc Interface Protocol Profile | Rel-19 |
| TS 29.238 vj00 | H.248 Profile for IBCF-TrGW Interface | Rel-19 |
| TS 29.292 vj00 | IMS Centralized Services (ICS) Interworking | Rel-19 |
| TS 29.332 vj00 | MGCF-IM-MGW Interface Protocol (Mn) | Rel-19 |
| TS 29.333 vj00 | MRFC-MRFP Mp Interface Protocol | Rel-19 |
| TS 29.334 vj00 | IMS-ALG to IMS-AGW Interface Protocol | Rel-19 |