SEG

Speech Expert Group

Other
Introduced in Rel-8
A 3GPP working group responsible for standardizing speech and audio codecs, quality assessment methods, and related end-to-end performance aspects for mobile communication systems. It ensures high-quality voice services across 3GPP technologies by defining codecs like AMR, AMR-WB, and EVS.

Description

The Speech Expert Group (SEG) is a specialized technical working group within the 3GPP organizational structure, operating under the Technical Specification Group Services and System Aspects (TSG SA). Its primary mandate is the standardization of speech and audio codecs, as well as the methodologies for testing and assessing their quality and performance in mobile networks. The group's work is critical for ensuring interoperable, high-quality voice and audio services across all 3GPP-defined systems, from GSM and UMTS to LTE and 5G. The SEG does not develop the core codec algorithms itself but rather selects, profiles, and standardizes codecs developed by external bodies (like ITU-T, ETSI, or 3GPP member companies) for use within the 3GPP ecosystem.

The SEG's activities encompass several key areas. First, it specifies the codec bitrates, frame structures, and operational modes that must be supported by terminals and network equipment for mandatory codecs like the Adaptive Multi-Rate (AMR) codec for narrowband speech and the AMR-Wideband (AMR-WB) codec for HD Voice. Second, it defines the detailed protocol for codec negotiation and mode adaptation during call setup and within a call (in-band signaling), which is essential for robust operation over variable radio conditions. Third, the group develops and maintains test specifications for codec performance, including objective and subjective quality measurement methodologies (e.g., using PESQ or POLQA algorithms) and conformance testing procedures to ensure interoperability between different vendors' implementations.

How the SEG works involves a consensus-based standardization process. The group reviews contributions from member companies, evaluates candidate codecs through rigorous testing campaigns (often involving multiple independent labs), and creates the technical specifications that define the codec's 3GPP profile. These specifications cover the codec's interface to the radio access and core network protocols, its adaptation for circuit-switched and packet-switched domains (e.g., in VoLTE and VoNR), and its integration with services like voice over IP and emergency calls. The SEG also addresses emerging audio needs, such as music delivery, multi-channel audio, and immersive audio experiences, expanding its scope beyond traditional telephony speech. Its outputs are integral specifications that ensure a user in a GSM network can have a high-quality voice call with a user on a 5G VoNR connection, using the most appropriate codec negotiated seamlessly between the devices and networks.

Purpose & Motivation

The SEG was formed to address the fundamental requirement for standardized, high-quality digital voice in mobile networks. In the early days of digital cellular (GSM), proprietary codecs would have led to interoperability failures and poor user experience. The group's purpose is to provide a single, agreed-upon set of speech codec standards that all network operators and device manufacturers implement, guaranteeing that voice calls work reliably and sound good across different networks, countries, and device types. It solves the problem of fragmentation and ensures a consistent baseline for voice quality.

Historically, as networks evolved from circuit-switched to packet-switched voice (VoIP), the SEG's role expanded to adapt speech codecs for IP transport, manage packet loss concealment, and define quality of service requirements for voice packets. The motivation for creating and maintaining the SEG includes the continuous drive for improved voice quality (e.g., moving from narrowband to wideband and fullband audio), increased bandwidth efficiency (better quality at lower bitrates), and support for new services like voice over LTE (VoLTE) and voice over NR (VoNR). The group ensures backward compatibility while enabling forward-looking enhancements, balancing quality, complexity, and interoperability in every new release of the 3GPP standards.

Key Features

  • Standardization and selection of mandatory and optional speech/audio codecs for 3GPP systems
  • Definition of codec negotiation and mode adaptation procedures for circuit and packet-switched domains
  • Development of test specifications for codec performance and conformance
  • Specification of objective and subjective speech quality assessment methodologies
  • Profiling of external codecs (e.g., from ITU-T) for 3GPP network and device implementation
  • Evolution of codec standards to support HD Voice, super-wideband, and fullband audio

Evolution Across Releases

Rel-8 Initial

Formally established the Speech Expert Group (SEG) as a dedicated working group within 3GPP SA4 (Codec). Its initial focus in Rel-8 included the ongoing maintenance of the AMR and AMR-WB codecs for GSM and UMTS, and it began critical work on standardizing the Enhanced Voice Services (EVS) codec for future high-quality voice services over LTE and beyond. It also addressed codec requirements for the emerging IP Multimedia Subsystem (IMS)-based voice services.

Defining Specifications

SpecificationTitle
TS 29.204 3GPP TS 29.204
TS 32.843 3GPP TR 32.843
TS 33.102 3GPP TR 33.102
TS 33.141 3GPP TR 33.141
TS 33.180 3GPP TR 33.180
TS 33.203 3GPP TR 33.203
TS 33.210 3GPP TR 33.210
TS 33.303 3GPP TR 33.303
TS 33.310 3GPP TR 33.310
TS 33.401 3GPP TR 33.401
TS 33.501 3GPP TR 33.501
TS 33.838 3GPP TR 33.838
TS 46.055 3GPP TR 46.055
TS 46.085 3GPP TR 46.085