PCM

Pulse Code Modulation

Services
Introduced in Rel-5
A standard method for digitally representing analog audio signals, specifically speech, using ITU-T G.711 A-law or μ-law coding at 64 kbit/s. It forms the foundational digital format for voice in traditional telephony networks (PSTN, PLMN) and is used as a reference codec and interconnection format in 3GPP systems.

Description

Pulse Code Modulation (PCM) in the 3GPP context specifically refers to the digital representation of speech signals as defined by ITU-T Recommendation G.711. This involves a three-step process: sampling, quantization, and encoding. First, the continuous analog voice signal is sampled at a rate of 8000 samples per second (8 kHz), following the Nyquist theorem for voice bandwidth up to 4 kHz. Each sample is then quantized, meaning its amplitude is mapped to one of a finite number of levels. G.711 defines two companding laws—A-law (used primarily in Europe and international routes) and μ-law (used primarily in North America and Japan)—that non-uniformly quantize the signal, providing more precision for lower amplitude signals (which are more common in speech) and less for higher amplitudes, improving the signal-to-noise ratio.

The quantized sample is then encoded into an 8-bit digital word (or codeword), resulting in a constant bit rate of 64 kbit/s (8000 samples/s * 8 bits/sample). This 64 kbit/s PCM stream is the fundamental building block of digital circuit-switched telephony, known as a DS0 timeslot in T1/E1 systems. Within 3GPP specifications, PCM is not typically used as the over-the-air codec for radio transmission due to its high bit rate inefficiency. Instead, more advanced codecs like AMR (Adaptive Multi-Rate) are used for radio interface optimization. However, PCM plays several crucial architectural roles.

In the core network, especially in the Circuit-Switched (CS) domain, PCM is the standard format for interconnection between network elements like Mobile Switching Centers (MSCs) and between the PLMN and the Public Switched Telephone Network (PSTN). Media Gateways (MGWs) within the network often transcode between various voice codecs (like AMR from the radio) and the standardized PCM format for transport over TDM (Time-Division Multiplexing) backbone networks. Furthermore, PCM serves as a common reference point for voice quality testing and benchmarking. Many 3GPP performance specifications (e.g., TS 26.132 on speech quality) use PCM as an input or output reference when defining testing methodologies for other codecs, ensuring quality comparisons are made against this well-understood standard.

Its role extends to service testing and interworking. For example, when testing voice call continuity or codec negotiation, the PCM format is often used as a baseline. The extensive list of 3GPP specifications referencing PCM, covering areas from vocabulary (21.905) to service requirements (22-series) and codec specifications (26-series), highlights its pervasive nature as the underlying digital voice representation that the entire cellular voice ecosystem was built upon and continues to interwork with, even as networks evolve towards VoLTE and VoNR which use IP-based transport but may still use G.711 for certain legacy interconnections or recording systems.

Purpose & Motivation

PCM, and specifically G.711, exists as the foundational digital voice coding standard that enabled the global transition from analog to digital telephony. Its primary purpose was to provide a high-quality, standardized method for converting analog voice signals into a digital format suitable for transmission and switching over digital networks. The problem it solved was the need for a robust, predictable, and interoperable digital voice format that could become the universal "currency" for voice in core network trunks and at network boundaries.

Historically, before digitalization, telephone networks were entirely analog, suffering from noise accumulation, signal degradation over long distances, and inefficient use of transmission infrastructure. The introduction of PCM with the G.711 standard in 1972 created a universal digital format. This allowed for the development of digital switches (like MSCs) and transmission systems (like T1/E1 lines), which were more reliable, easier to maintain, and enabled time-division multiplexing to carry multiple calls on a single physical line. The choice of 64 kbit/s was a pragmatic balance between quality (excellent for telephony, often considered 'toll quality') and the digital hierarchy standards of the time.

Within the 3GPP ecosystem, the purpose of specifying PCM is not to define a new codec but to ensure seamless interworking with the global telephony infrastructure. When GSM was developed, it used more efficient codecs like Full Rate (FR) for the radio link, but the core network and interconnection to other networks (PSTN, other PLMNs) relied on the ubiquitous 64 kbit/s PCM standard. This allowed cellular networks to plug directly into the existing worldwide telephony grid. Even as 3GPP has evolved to define many more efficient and advanced speech and audio codecs (e.g., AMR, AMR-WB, EVS), PCM remains critical as a fixed reference point for quality testing, a mandated fallback or interconnection format in certain scenarios, and the format understood by virtually all legacy network equipment and recording/legal intercept systems, ensuring backward compatibility and regulatory compliance.

Key Features

  • Standardized by ITU-T G.711, using A-law or μ-law companding.
  • Fixed bit rate of 64 kbit/s (8 kHz sampling, 8 bits/sample).
  • Provides high-quality, 'toll-quality' speech representation.
  • Serves as the primary digital format for circuit-switched core network trunks (TDM).
  • Acts as a universal reference and interconnection format between different networks (e.g., PLMN to PSTN).
  • Used as a baseline for voice quality testing and benchmarking of other codecs in 3GPP.

Evolution Across Releases

Rel-5 Initial

PCM (G.711) was formally referenced in 3GPP specifications from the early releases, but its inclusion in the vocabulary (TS 21.905) and numerous technical specifications around Rel-5 solidified its role as the fundamental digital voice format for interworking and core network transport. The initial architecture assumed PCM as the standard format for the Circuit-Switched (CS) core network, with Media Gateways performing transcoding between over-the-air codecs (like AMR) and the PCM-based TDM network for interconnection and transport.

Defining Specifications

SpecificationTitle
TS 21.905 3GPP TS 21.905
TS 22.401 3GPP TS 22.401
TS 22.944 3GPP TS 22.944
TS 26.071 3GPP TS 26.071
TS 26.102 3GPP TS 26.102
TS 26.114 3GPP TS 26.114
TS 26.115 3GPP TS 26.115
TS 26.118 3GPP TS 26.118
TS 26.131 3GPP TS 26.131
TS 26.132 3GPP TS 26.132
TS 26.171 3GPP TS 26.171
TS 26.202 3GPP TS 26.202
TS 26.226 3GPP TS 26.226
TS 26.230 3GPP TS 26.230
TS 26.231 3GPP TS 26.231
TS 26.267 3GPP TS 26.267
TS 26.268 3GPP TS 26.268
TS 26.269 3GPP TS 26.269
TS 26.448 3GPP TS 26.448
TS 26.806 3GPP TS 26.806
TS 26.818 3GPP TS 26.818
TS 26.854 3GPP TS 26.854
TS 26.969 3GPP TS 26.969
TS 26.975 3GPP TS 26.975
TS 26.978 3GPP TS 26.978
TS 28.062 3GPP TS 28.062
TS 43.050 3GPP TR 43.050
TS 43.901 3GPP TR 43.901
TS 46.002 3GPP TR 46.002
TS 46.008 3GPP TR 46.008
TS 46.051 3GPP TR 46.051
TS 46.055 3GPP TR 46.055
TS 46.085 3GPP TR 46.085
TS 48.103 3GPP TR 48.103