HOA3

Higher-Order Ambisonics (3rd order)

Other
Introduced in Rel-18
A 3D audio format for immersive media services, standardized for delivery over 5G systems. It enables spatial audio experiences by representing a sound field with higher resolution than first-order Ambisonics, crucial for virtual and augmented reality applications. Its inclusion in 3GPP standards facilitates efficient streaming of immersive audio content.

Description

Higher-Order Ambisonics (HOA) is a full-sphere surround sound technique that represents a three-dimensional sound field using spherical harmonics. HOA3 specifically refers to the third-order representation, which provides a higher spatial resolution and more accurate sound source localization compared to first-order Ambisonics (FOA). In the 3GPP context, HOA3 is standardized as an audio codec and format for Media Streaming Services, enabling the delivery of immersive audio experiences over 5G networks, particularly for Extended Reality (XR) applications like Virtual Reality (VR) and Augmented Reality (AR). The technical specifications define the bitstream format, encapsulation, and synchronization mechanisms for streaming HOA3 audio, ensuring interoperability between content creation tools, streaming servers, and client devices.

Architecturally, HOA3 audio is processed and encoded by a media server, often as part of a Multimedia Broadcast/Multicast Service (MBMS) or unicast streaming service. The encoded bitstream is packetized according to 3GPP-defined formats (e.g., in MPEG-4 or similar containers) and transported over the 5G core and radio access network. At the receiver, a client application or media player decodes the HOA3 bitstream. The decoding process involves reconstructing the spherical harmonic coefficients that describe the sound pressure field around a point in space. These coefficients are then rendered for playback through headphones or a speaker array, using binaural rendering techniques for headphones or loudspeaker decoding matrices for multi-channel setups, creating the illusion of sound coming from specific directions in 3D space.

The role of HOA3 in the network is as a service enabler for advanced media. It is a key component of the 5G Media Streaming (5GMS) architecture and the 5G Immersive Media services framework. The network must provide sufficient bandwidth and low latency to stream the HOA3 data, which has a higher bitrate than conventional stereo or first-order spatial audio. Quality of Service (QoS) mechanisms may be applied to prioritize these streams. The specification work ensures that the audio format is agnostic to the underlying transport, allowing it to work with both broadcast/multicast and on-demand streaming models defined by 3GPP.

Purpose & Motivation

The primary purpose of standardizing HOA3 within 3GPP is to support the delivery of high-quality, immersive audio experiences over mobile networks, which is a fundamental requirement for compelling Extended Reality (XR) applications. Prior to its inclusion, XR services often relied on simpler audio formats like stereo or first-order Ambisonics, which could not provide the precise spatial audio cues necessary for true immersion and user presence in virtual environments. The limitations of these earlier approaches included poor externalization of sound (sounds seeming inside the head) and inaccurate localization of sound sources above, below, and behind the listener. HOA3 addresses these limitations by providing a higher-fidelity representation of the sound field.

The motivation for its creation in Rel-18 stems from the industry push towards metaverse and immersive media services as key 5G use cases. Standardization ensures interoperability across different content creators, network providers, and device manufacturers, preventing market fragmentation. By defining HOA3 in specifications like TS 26.260 (Codec for immersive speech and audio services), 3GPP provides a clear, royalty-free (where applicable) technical baseline. This allows the mobile ecosystem to efficiently stream complex 3D audio without relying on proprietary, non-interoperable formats, thereby accelerating the adoption of XR services over 5G networks.

Key Features

  • Third-order spherical harmonic representation for high spatial resolution
  • Standardized bitstream format for interoperable streaming over 5G
  • Support for full 3D audio (periphonic sound), including elevation
  • Efficient compression for network transmission
  • Synchronization mechanisms with video for XR applications
  • Compatibility with 5G Media Streaming (5GMS) and MBMS architectures

Evolution Across Releases

Rel-18 Initial

Initial introduction and standardization of the HOA3 format within the 3GPP ecosystem. Specifications defined the audio codec, bitstream format, and encapsulation for streaming services, establishing it as a component for 5G immersive media. It was integrated into the media delivery framework alongside existing video and audio codecs.

Defining Specifications

SpecificationTitle
TS 26.260 3GPP TS 26.260
TS 26.933 3GPP TS 26.933
TS 26.996 3GPP TS 26.996
TS 26.997 3GPP TS 26.997