IVAS

Immersive Voice and Audio Services

Services →
Introduced in Rel-16 Also in: Core Network

IVAS is a 3GPP-standardized media codec and service framework for delivering immersive, multi-channel spatial audio experiences over mobile networks to enable realistic soundscapes for calls, music, and extended reality applications.

Category
Services
Introduced
Rel-16
Where
Services › Codecs
Also touches
1 segments
Specifications
31 specs
IVAS Description Purpose Related Classification Detected Changes Specifications

Description

Immersive Voice and Audio Services (IVAS) is a comprehensive 3GPP standard introduced in Release 16 that defines a new media codec and an associated service framework designed to deliver high-quality, immersive audio experiences over 5G and evolved packet core networks. At its core is the IVAS codec, a highly efficient and flexible audio codec capable of encoding not just traditional stereo or mono signals, but also complex spatial audio scenes comprising multiple audio channels (e.g., 5.1, 7.1.4) and discrete audio objects with associated metadata (like position, size, and gain). This allows for the rendering of sound in a three-dimensional space around the listener.

Architecturally, IVAS integrates into the 3GPP Multimedia Telephony Service for IMS (MTSI) and other media streaming frameworks. It operates within the media plane of the IP Multimedia Subsystem (IMS). Key components include the IVAS encoder, which compresses the immersive audio scene; the IVAS decoder, which reconstructs it; and the IVAS renderer, which uses head-related transfer functions (HRTF) and playback system information to correctly spatialize the audio for the listener's specific setup (headphones, speaker arrays). The service framework, detailed in specs like 26.114 and 26.119, defines session negotiation procedures using Session Description Protocol (SDP) to establish IVAS-capable media sessions, including support for dynamic switching between codec modes based on network conditions.

How it works: During a call or streaming session, endpoints negotiate IVAS support. The capturing device (e.g., a 360-degree microphone array or an XR headset) captures a spatial audio scene. The IVAS encoder compresses this scene, efficiently representing ambient channels and moving audio objects. This bitstream is packetized and transmitted over the 5G network, benefiting from ultra-reliable low-latency communication (URLLC) for real-time applications. The receiving device's IVAS decoder reconstructs the scene, and the renderer adapts it in real-time based on the listener's head orientation (using head-tracking data) to maintain a fixed sound field, creating a compelling sense of presence. Its role is to be the enabling audio technology for telepresence, social XR, and immersive entertainment.

Purpose & Motivation

IVAS was created to address the limitations of traditional voice and audio codecs (like AMR, EVS) in the emerging era of extended reality (XR), telepresence, and immersive media. Legacy codecs were designed for mono or stereo playback, incapable of conveying the spatial cues necessary for realistic virtual environments or group communication where understanding who is speaking and from where is critical. The motivation was to define a single, efficient standard for all immersive audio use cases, avoiding fragmentation.

The historical context is the evolution of 5G, which promises enhanced mobile broadband (eMBB), massive IoT, and URLLC. While 5G provides the pipe, IVAS provides the next-generation audio content that justifies the need for high bandwidth and low latency. It solves the problem of delivering cinema-quality, object-based audio over wireless networks for applications like multi-player VR gaming, remote collaboration in virtual spaces, and immersive live music streaming. Prior approaches required proprietary codecs or bulky, uncompressed multi-channel audio, which were inefficient and not interoperable.

Furthermore, IVAS enables new service paradigms like 'Augmented Reality Telephony,' where remote participants can be represented as spatial audio objects in the user's environment. It addresses the need for a codec that is both high-quality for music and low-bitrate for conversational speech, with seamless switching between modes. Its creation was motivated by industry convergence from telecom, broadcasting, and consumer electronics to establish a universal immersive audio standard for 5G.

Classification

Part ofMTSI
Specific typesEVS

Detected Changes Across Releases

from 3GPP Change Requests

Specific changes extracted from the „Change history“ tables of 3GPP specifications (38 CRs across 4 releases). Complements the general historical overview above with the evidence-based evolution of this function.

Rel-15 1 change

In Release 15, the IVAS function introduced the capability for the MRFP to perform a sensitivity calculation specifically for immersive audio playback, correcting its methodology. This enhancement is part of the broader immersive audio services framework, which includes functions like audio conferencing and multimedia recording with synchronized streams. The calculation ensures accurate audio level management during playback operations within the MRFP.

  • Correction of sensitivity calculation for immersive audio playback TS 26.260CR002
Rel-16 1 change

In Release 16, the IVAS function introduced the new "Immersive Voice and Audio Services" codec, detailed in the new specifications TS 26.250 and TS 26.253, and expanded MRFP capabilities to include synchronized multimedia play and record functions, as well as multimedia conferencing with support for audio and video transcoding.

  • Removing H.263 and MPEG-4 Visual from MTSI TS 26.114CR0501
Rel-18 22 changes

In Release 18, the key new developments for Immersive Voice and Audio Services (IVAS) were the formal introduction and integration of the IVAS codec into the 3GPP ecosystem. This included adding support for the codec across specifications, incorporating it into the 3GPP File Format (3GP), and defining objective test methodologies for IVAS-based user equipment. The release also introduced the ISAR track-a split rendering feature and established codec levels, while providing corrections and updated test sequences for the IVAS specifications.

  • Adding support for IVAS codec TS 23.333CR0146
  • Adding support for IVAS codec TS 23.334CR0185
  • Introduction of IVAS codec TS 26.114CR0561
  • Introduction of IVAS into the 3GPP File Format (3GP) TS 26.244CR0066
  • Updated IVAS test sequences TS 26.252CR0001
  • Adding ISAR track-a split rendering feature to TS 26.258 and Corrections to the IVAS C-Code and corresponding specification text TS 26.258CR0002

+ 16 more changes

Rel-19 14 changes

In Release 19, key additions for IVAS included the standardization of the fixed-point codec implementation in TS 26.251, the introduction of new conformance test methods, and the definition of a procedure for IVAS bandwidth computation. The release also provided corrections and clarifications to the rendering specification, SDP examples, and test sequences to improve interoperability and performance characterization.

  • Updates related to availability of TS 26.251 (IVAS fixed-point code) TS 26.114CR0596
  • Updates related to availability of TS 26.251 (IVAS fixed-point code) TS 26.244CR0067
  • Updates related to availability of TS 26.251 (IVAS fixed-point code) TS 26.250CR0003
  • Updates related to IVAS conformance testing of TS 26.258 and TS 26.251 TS 26.252CR0005
  • Inclusion of TS 26.251 (IVAS fixed-point code) TS 26.256CR0002
  • New test methods for immersive UEs TS 26.260CR0008

+ 8 more changes

Explore further

Broader topics and technologies where IVAS plays a role.

Defining Specifications

3GPP specifications that define or reference IVAS, with the latest known release. Sourced from the 3GPP document catalog — see methodology.

SpecificationTitleRelease
TS 23.333 vj00 MRFC-MRFP Mp Interface Requirements Rel-19
TS 23.334 vj00 IMS-ALG to IMS-AGW Interface (Iq) Stage 2 Rel-19
TS 26.114 vj10 IMS Multimedia Telephony Media Handling Rel-19
TS 26.119 vj00 XR Media Capabilities for AR Devices Rel-19
TS 26.244 vj00 3GPP File Format (3GP) Specification Rel-19
TS 26.249 vj00 Immersive Audio Split Rendering (ISAR) Rel-19
TS 26.250 vj00 IVAS Codec Introduction Rel-19
TS 26.251 vj00 IVAS Codec Fixed-Point C Code Specification Rel-19
TS 26.252 vj00 IVAS Codec Test Sequences Specification Rel-19
TS 26.254 vj00 IVAS Rendering Functions Specification Rel-19
TS 26.255 vj00 IVAS Frame Loss Concealment Procedure Rel-19
TS 26.256 vj00 Jitter Buffer Management for IVAS Rel-19
TS 26.258 vj10 IVAS Codec Floating-Point C Code Specification Rel-19
TS 26.260 vj00 Immersive Audio Objective Test Methods Rel-19
TS 26.261 vj00 Electro-acoustic specs for immersive terminals Rel-19
TS 26.511 vj00 5G Media Streaming Profiles, Codecs & Formats Rel-19
TR 26.865 vi00 Technical Report Rel-18
TR 26.926 vj00 Traffic Models & Quality Evaluation for Media/XR in 5G Rel-19
TR 26.928 vj00 Study on eXtended Reality (XR) in 5G Rel-19
TR 26.933 vj00 Study on Diverse Audio Capturing System Rel-19
TR 26.996 vj00 ISAR Split Rendering Audio Characterization Rel-19
TR 26.997 vj00 IVAS Codec Specification Rel-19
TR 26.998 vj00 5G AR/MR Glasses Integration Study Rel-19
TS 29.162 vj00 IMS-IP Network Interworking Rel-19
TS 29.163 vj00 Interworking between 3GPP IM CN and CS networks Rel-19
TS 29.232 vj00 Mc Interface Protocol Profile Rel-19
TS 29.238 vj00 H.248 Profile for IBCF-TrGW Interface Rel-19
TS 29.292 vj00 IMS Centralized Services (ICS) Interworking Rel-19
TS 29.332 vj00 MGCF-IM-MGW Interface Protocol (Mn) Rel-19
TS 29.333 vj00 MRFC-MRFP Mp Interface Protocol Rel-19
TS 29.334 vj00 IMS-ALG to IMS-AGW Interface Protocol Rel-19