Description
Source Controlled Rate (SCR) operation is a fundamental feature of speech codecs used in mobile communication systems, most notably the Adaptive Multi-Rate (AMR) and AMR-Wideband codecs. It is a codec mode of operation where the encoding rate is dynamically controlled based on the presence or absence of active speech from the source (the speaker). During active speech segments, the codec operates at its full, designated bit rate to maintain high voice quality. However, during periods of silence or background noise—which statistically constitute over 50% of a typical conversation—the codec switches to a significantly lower bit rate mode. In this low-rate mode, the encoder does not transmit full speech frames but instead transmits Silence Descriptor (SID) frames at periodic intervals. These SID frames contain parameters that allow the receiver's decoder to generate comfort noise, which is artificial background noise that mimics the acoustic characteristics of the caller's environment, preventing an unnerving 'dead silence' for the listener.
Architecturally, SCR involves close coordination between the voice activity detector (VAD) and the speech codec within the mobile terminal or network transcoder. The VAD analyzes the input audio signal to classify each frame as containing active speech or not. This classification is passed to the codec, which then selects the appropriate encoding algorithm and frame type. The system also includes a comfort noise generator (CNG) at the receiving end, which uses the parameters in the SID frames to synthesize noise. The operation is managed by in-band signaling within the speech frame structure itself, indicating whether a frame is a speech frame, a SID frame, or a marker for the end of a silence period.
SCR's role in the network is primarily one of efficiency. By drastically reducing the average bit rate of a voice call, it decreases the load on the radio interface. This translates directly into increased system capacity, as more users can be supported on the same radio resources. For the user, it provides the significant benefit of extended battery life in the mobile terminal, as the transmitter can be powered down or operate at a lower power level during silence periods. It is a core component of the traffic adaptation and power saving mechanisms defined in 3GPP specifications, making voice services more spectrally efficient and user-friendly.
Purpose & Motivation
SCR was developed to address the inherent inefficiency of transmitting constant bit rate audio during a voice conversation, where significant portions consist of silence or low-activity noise. Early digital voice systems transmitted a continuous stream of bits regardless of speech activity, wasting precious radio spectrum and draining mobile phone batteries unnecessarily. The motivation for SCR was twofold: to increase the capacity of cellular networks—a critical commercial driver—and to improve the talk time of handsets, a key user experience factor.
The technology solves the problem of resource waste during speech pauses. By introducing a discontinuous transmission (DTX) mechanism controlled by the source signal itself, it allows the network to re-allocate the freed-up radio resources (timeslots, codes, or bandwidth) to other users. This was particularly important for GSM and its evolution, where spectrum was a limited and expensive commodity. SCR, combined with the AMR codec's adaptability, became a cornerstone for providing high-quality voice service while optimizing the use of the entire system. It represented a shift from treating voice as a constant bit-rate stream to treating it as a variable-rate service, adapting to the actual information content.
Classification
Evolution Across Releases
Source Controlled Rate operation was standardized as a core feature of the Adaptive Multi-Rate (AMR) speech codec. The initial architecture defined the Voice Activity Detector (VAD), algorithms for generating Silence Descriptor (SID) frames, and the comfort noise generation procedures, establishing the framework for efficient discontinuous transmission in GSM and UMTS voice services.
Explore further
Broader topics and technologies where SCR plays a role.
Defining Specifications
3GPP specifications that define or reference SCR, with the latest known release. Sourced from the 3GPP document catalog — see methodology.
| Specification | Title | Release |
|---|---|---|
| TS 26.071 vj00 | AMR Speech Codec Introduction | Rel-19 |
| TS 26.091 vj00 | AMR Error Concealment Procedure | Rel-19 |
| TS 26.092 vj00 | AMR Comfort Noise for SCR Operation | Rel-19 |
| TS 26.093 vj00 | SCR operation of AMR codec for UMTS | Rel-19 |
| TS 26.101 vj00 | Generic frame format for AMR and GSM-EFR speech codecs | Rel-19 |
| TS 26.102 vj00 | Mapping of AMR and other codecs to interfaces | Rel-19 |
| TS 26.103 vj00 | 3GPP Codec Lists for OoBTC and TrFO | Rel-19 |
| TS 26.171 vj00 | Introduction to AMR-WB Speech Processing | Rel-19 |
| TS 26.191 vj00 | AMR-WB Error Concealment Procedure | Rel-19 |
| TS 26.192 vj00 | AMR-WB Comfort Noise Requirements | Rel-19 |
| TS 26.193 vj00 | AMR-WB Source Controlled Rate (SCR) Operation | Rel-19 |
| TS 26.201 vj00 | AMR-WB Speech Codec Frame Format | Rel-19 |
| TS 26.202 vj00 | AMR-WB Speech Codec Mapping Specification | Rel-19 |
| TS 26.454 vj00 | EVS Codec Mapping for 3G CS Networks | Rel-19 |
| TS 28.062 vj00 | Tandem Free Operation (TFO) Service Description | Rel-19 |