Description
Inter Channel Coherence (ICC) is a parameter defined within various 3GPP audio codec specifications, particularly those supporting stereo or multi-channel audio, such as the Extended Adaptive Multi-Rate Wideband (AMR-WB+) and the Enhanced Voice Services (EVS) codec. It quantifies the degree of correlation or similarity between the left and right channels of a stereo audio signal over specific time-frequency intervals. Technically, ICC is often derived as a normalized cross-correlation coefficient, with values ranging typically from 0 (completely uncorrelated or out-of-phase) to 1 (perfectly correlated or identical). In parametric stereo coding techniques, instead of transmitting two full independent channels, the encoder transmits a mono downmix signal along with side information parameters like ICC (and Inter Channel Level Difference, ICLD), which describe the spatial properties.
The parameter works within a perceptual audio coding framework that operates in a frequency domain, such as a Modified Discrete Cosine Transform (MDCT) representation. The encoder analyzes the stereo input signal in time-frequency tiles, calculating the ICC for each tile. This parameter, along with others, is then quantized and sent as part of the encoded bitstream. The decoder uses the received mono signal and the ICC parameter (along with ICLD) to reconstruct a stereo output by applying appropriate upmixing rules. For instance, a high ICC value indicates the channels are similar, so the decoder can generate the stereo image by applying minimal decorrelation, while a low ICC value might trigger specific decorrelation filters to recreate a wider, more diffuse stereo image.
Key components in its operation include the parametric stereo analysis module in the encoder and the synthesis module in the decoder. Its role is fundamental to achieving high compression efficiency for stereo content, such as music, in mobile communication and streaming services. By transmitting only one channel's worth of core audio data plus compact parametric side information, significant bitrate savings are achieved compared to independent channel coding, without a perceptible loss in stereo quality for most listening scenarios. This makes it a cornerstone of bandwidth-efficient high-quality audio delivery in 3GPP systems.
Purpose & Motivation
ICC was developed to address the challenge of transmitting high-fidelity stereo audio over bandwidth-constrained mobile networks. Early mobile audio services, like speech codecs, were mono and inefficient for music. As services evolved to include music streaming and multimedia messaging, there was a need for stereo coding that could operate at low bitrates. Simple joint-stereo techniques like mid-side (M/S) coding provided some gains but were less flexible. Parametric stereo approaches, using parameters like ICC and ICLD, were motivated by the need for greater compression by exploiting the high degree of redundancy and specific spatial relationships between stereo channels.
This technology solves the problem of bitrate inefficiency in naive stereo transmission. Transmitting two independent channels essentially doubles the bitrate requirement. By analyzing and parameterizing the spatial image, parametric stereo can reconstruct a perceptually convincing stereo signal from a mono downmix at a fraction of the bitrate. The creation of standardized ICC parameters within 3GPP codecs like AMR-WB+ and EVS ensured interoperability and consistent high-quality stereo performance across different devices and networks, enabling rich audio services like music telephony and streaming to become viable on 3G, 4G, and 5G networks. It addressed the limitations of previous non-parametric or simpler stereo coding methods which either consumed too much bandwidth or produced inferior stereo quality at very low bitrates.
Key Features
- Quantifies correlation between left and right audio channels in time-frequency domains
- Enables parametric stereo coding for high compression efficiency
- Transmitted as side information alongside a mono downmix core bitstream
- Used in codecs like AMR-WB+ and EVS for music and high-quality voice services
- Allows perceptual reconstruction of stereo image at the decoder
- Values typically range from 0 (uncorrelated) to 1 (fully correlated)
Evolution Across Releases
Introduced as part of the parametric stereo coding framework, likely within the context of the AMR-WB+ codec specification. Initial definition provided the mathematical formulation for calculating ICC and its application in stereo signal reconstruction from a mono downmix at the decoder.
Defining Specifications
| Specification | Title |
|---|---|
| TS 21.111 | 3GPP TS 21.111 |
| TS 21.905 | 3GPP TS 21.905 |
| TS 26.253 | 3GPP TS 26.253 |
| TS 26.405 | 3GPP TS 26.405 |
| TS 31.102 | 3GPP TR 31.102 |
| TS 31.103 | 3GPP TR 31.103 |
| TS 31.121 | 3GPP TR 31.121 |
| TS 31.900 | 3GPP TR 31.900 |