MASA2

MASA with 2 TC (stereo-MASA)

Services
Introduced in Rel-18
A MASA profile where the core audio comprises two transport channels (typically forming a stereo pair). The metadata provides additional spatial information that enhances or repositions elements beyond the basic left-right stereo image, enabling more accurate and immersive soundscapes than conventional stereo.

Description

MASA2, known as stereo-MASA, is a profile within the MASA standard that uses two audio transport channels (TCs) as its core audio payload. These channels typically form a standard stereo signal (Left and Right). The accompanying MASA metadata does not replace this stereo information but augments it. The metadata can describe the scene in a way that allows the renderer to understand the original intended positions of sound sources, which may be different from the simple panning implied by the stereo waveform. For example, a sound panned to the center in the stereo mix could be metadata-tagged as originating from behind the listener. The renderer then uses this metadata to process the two-channel signal, applying corrections and enhancements to recreate a more accurate three-dimensional sound field suitable for the listener's playback environment. This process can involve upmixing, cross-talk cancellation, and personalized HRTF processing. The architecture efficiently builds upon familiar stereo delivery while adding a layer of spatial intelligence, making it a practical upgrade path for existing stereo music and broadcast services.

Purpose & Motivation

MASA2 was created to bridge the gap between ubiquitous stereo content and next-generation immersive audio. Billions of existing audio assets are in stereo format, and direct broadcasting/streaming infrastructure is optimized for two-channel audio. The purpose of MASA2 is to enhance these assets with spatial capabilities without requiring a full re-authoring into a complex object-based format. It addresses the limitation of stereo, which confines sound to a one-dimensional left-right panorama, by allowing content creators to embed intended spatial intent via metadata. This enables backward compatibility (the two channels play as normal stereo on non-MASA devices) while offering a premium, immersive experience on MASA-capable receivers. It is particularly motivated by music streaming, live sports broadcasting, and cinematic content where a rich, enveloping soundstage is desired.

Key Features

  • Core audio consists of two transport channels (stereo base)
  • Metadata provides enhancement and repositioning data beyond the stereo image
  • Maintains backward compatibility with legacy stereo playback systems
  • Enables upmixing and 3D rendering from a standard stereo source
  • Ideal for enhancing music, film, and broadcast content
  • Provides a balanced trade-off between audio quality, bitrate efficiency, and immersion

Evolution Across Releases

Rel-18 Initial

Standardized concurrently with MASA1 in Rel-18. The specification defined the profile for two transport channels, outlining how stereo audio signals are associated with spatial metadata and the specific rendering techniques used to create an immersive presentation from a stereo base.

Defining Specifications

SpecificationTitle
TS 26.253 3GPP TS 26.253