Description
Object-Based Audio (OBA) is a paradigm shift in audio representation and delivery, standardized by 3GPP for media services. Unlike traditional channel-based audio (e.g., stereo or 5.1 surround), which encodes sound for fixed speaker positions, OBA decomposes an audio scene into discrete 'objects.' Each object is an audio signal (e.g., a dialogue track, a sound effect, or ambient music) accompanied by rich, time-variant metadata. This metadata describes the object's spatial position (coordinates in a 3D space), gain, and other perceptual attributes, allowing for dynamic rendering. The architecture involves a content creation stage where audio objects and metadata are authored, a delivery stage where they are efficiently encoded and transported (often using codecs like MPEG-H 3D Audio), and a client-side rendering stage. The renderer, based on the metadata and the capabilities of the playback device (from headphones to complex speaker arrays), synthesizes the final audio output in real-time. This decoupling of content from the presentation format is fundamental. In the network context, 3GPP specifications define how OBA services are delivered over mobile networks, including signaling, media formats, and quality of service considerations to ensure synchronized delivery of audio objects and their metadata for a seamless experience. Its role is to provide a future-proof audio foundation for immersive media, enabling features that are impossible with fixed-channel audio.
Purpose & Motivation
OBA was created to address the limitations of channel-based audio in the face of evolving media consumption. Traditional audio mixes are 'baked' for a specific speaker configuration, offering no flexibility for different listening environments (e.g., headphones vs. a soundbar), user preferences, or accessibility needs. The rise of virtual reality (VR), augmented reality (AR), and interactive media demanded audio that could adapt dynamically to user head movements and interactivity. OBA solves this by providing a flexible, scene-description-based approach. It allows for personalized audio, such as adjusting dialogue volume independently of background music, or enabling audio description tracks to be seamlessly integrated. From a network and service provider perspective, it also offers efficiency; a single OBA stream can be adapted to many output devices, reducing the need to store and transmit multiple channel-based versions. Its introduction in 3GPP Rel-14 was motivated by the industry's move towards immersive media standards and the need for telecom networks to support next-generation audio services as part of enhanced Multimedia Broadcast/Multicast Service (eMBMS) and streaming offerings.
Detected Changes Across Releases
from 3GPP Change RequestsSpecific changes extracted from the „Change history“ tables of 3GPP specifications (1 CRs across 1 releases). Complements the general historical overview above with the evidence-based evolution of this function.
Studied in Rel-14, normative work from Rel-15.
In Release 15, the Object-Based Audio (OBA) function was introduced as part of the new Immersive Voice and Audio Services (IVAS) codec suite. This included the specification for OBA metadata files, which are comma-separated value (CSV) files containing mandatory azimuth and elevation parameters to describe audio objects. The system also defined combined audio formats, such as OBA with Metadata-Assisted Spatial Audio (MASA), and enabled the renderer to support up to four discrete audio object inputs alongside other spatial audio types.
- Findings and Conclusions from study on 3GPP codecs for VR audio TS 26.918CR0003
Explore further
Broader topics and technologies where OBA plays a role.
Defining Specifications
3GPP specifications that define or reference OBA, with the latest known release. Sourced from the 3GPP document catalog — see methodology.
| Specification | Title | Release |
|---|---|---|
| TS 26.258 vj10 | IVAS Codec Floating-Point C Code Specification | Rel-19 |
| TR 26.918 vj00 | Virtual Reality Relevance Study for 3GPP | Rel-19 |
| TR 26.997 vj00 | IVAS Codec Specification | Rel-19 |