MASA

Metadata-Assisted Spatial Audio

Services →
Introduced in Rel-18 Also in: User Equipment

MASA is a 3GPP codec enhancement that uses metadata to describe spatial audio scenes for efficient compression and high-quality rendering on various devices, crucial for immersive applications like extended reality.

Category
Services
Introduced
Rel-18
Where
Services › Codecs
Also touches
1 segments
Specifications
12 specs
MASA Description Purpose Related Classification Detected Changes Specifications

Description

Metadata-Assisted Spatial Audio (MASA) is a standardized framework within 3GPP for representing, encoding, and rendering immersive audio. Unlike traditional channel-based or object-based audio, MASA employs a hybrid approach where the core audio is encoded using a traditional codec (like EVS or 3GPP-DRC), while a separate, compact metadata stream describes the spatial properties of the audio scene. This metadata includes parameters such as direction, distance, and size of sound sources, as well as environmental acoustic properties. The architecture is designed to be codec-agnostic, allowing the spatial metadata to be associated with various underlying audio bitstreams, providing flexibility for service providers. The key components include a metadata encoder, which analyzes the spatial audio scene, and a metadata decoder/renderer on the receiving device, which uses the metadata to reconstruct the immersive sound field appropriate for the listener's specific playback environment, be it headphones, stereo speakers, or a multi-channel home theater system. Its role in the network is as a service enabler, where the audio and metadata streams are packetized, transmitted over 5G networks (leveraging high bandwidth and low latency), and synchronized at the receiver to deliver a cohesive immersive experience. The specification details the syntax and semantics of the metadata, ensuring interoperability between content creation tools and consumer devices.

Purpose & Motivation

MASA was created to address the growing demand for immersive audio experiences, particularly driven by extended reality (XR), 360-degree video, and next-generation broadcasting. Traditional audio codecs were designed for fixed channel configurations (e.g., stereo 5.1) and struggle with the flexibility required for personalized, device-adaptive rendering. Object-based audio formats existed but could be inefficient for transmission over bandwidth-constrained mobile networks due to the high bitrate needed for numerous discrete audio objects. MASA solves this by decoupling the descriptive spatial metadata (which is very low bitrate) from the core audio payload. This allows for efficient network transmission while enabling the receiver to render an optimal sound field tailored to its specific output capabilities and the listener's orientation (in the case of head-tracking). Its creation was motivated by the need for a standardized, network-friendly immersive audio solution within the 3GPP ecosystem to complement advancements in video and XR services over 5G, ensuring a high-quality of experience without prohibitive bandwidth costs.

Classification

Part ofEVS
Specific typesMASA2MASA1

Detected Changes Across Releases

from 3GPP Change Requests

Specific changes extracted from the „Change history“ tables of 3GPP specifications (1 CRs across 1 releases). Complements the general historical overview above with the evidence-based evolution of this function.

Rel-15 1 change

In Release 15, the Metadata-Assisted Spatial Audio (MASA) function was introduced as a new supported input audio format for the IVAS codec, operating within a bit rate range from 13.2 to 512 kbps. This format supports source-controlled rate operation (DTX) up to the maximum 512 kbps. The release also included a correction to the sensitivity calculation specifically for immersive audio playback.

  • Correction of sensitivity calculation for immersive audio playback TS 26.260CR002

Explore further

Broader topics and technologies where MASA plays a role.

Defining Specifications

3GPP specifications that define or reference MASA, with the latest known release. Sourced from the 3GPP document catalog — see methodology.

SpecificationTitleRelease
TS 26.250 vj00 IVAS Codec Introduction Rel-19
TS 26.251 vj00 IVAS Codec Fixed-Point C Code Specification Rel-19
TS 26.253 vj00 IVAS Codec Algorithmic Description Rel-19
TS 26.254 vj00 IVAS Rendering Functions Specification Rel-19
TS 26.255 vj00 IVAS Frame Loss Concealment Procedure Rel-19
TS 26.258 vj10 IVAS Codec Floating-Point C Code Specification Rel-19
TS 26.260 vj00 Immersive Audio Objective Test Methods Rel-19
TS 26.261 vj00 Electro-acoustic specs for immersive terminals Rel-19
TR 26.865 vi00 Technical Report Rel-18
TR 26.933 vj00 Study on Diverse Audio Capturing System Rel-19
TR 26.996 vj00 ISAR Split Rendering Audio Characterization Rel-19
TR 26.997 vj00 IVAS Codec Specification Rel-19