VOIP

Voice over IP

Services
Introduced in Rel-8
Voice over IP (VoIP) is a technology enabling voice communication and multimedia sessions over Internet Protocol (IP) networks, such as the Internet or private data networks. In 3GPP, it refers to the standardized delivery of voice services over packet-switched networks, forming the foundation for services like VoLTE and VoNR, ensuring carrier-grade quality and interoperability.

Description

Voice over IP (VoIP) within the 3GPP framework is the standardized methodology for transmitting voice communications as data packets over IP-based networks, specifically the Evolved Packet Core (EPC) and 5G Core (5GC). It represents a fundamental shift from traditional circuit-switched telephony (like 2G/3G CS voice) to an all-IP, packet-switched architecture. The technology leverages the IP Multimedia Subsystem (IMS) as the core service delivery platform. IMS provides the necessary control functions for session establishment, management, and teardown using protocols like SIP (Session Initiation Protocol). The actual voice data is encoded using codecs such as AMR (Adaptive Multi-Rate) or EVS (Enhanced Voice Services), packetized into Real-time Transport Protocol (RTP) streams, and transported over the underlying IP bearer network.

Architecturally, VoIP service delivery involves several key network functions. The User Equipment (UE) must support IMS client functionality. The call signaling traverses the P-CSCF (Proxy-Call Session Control Function), S-CSCF (Serving-CSCF), and other IMS nodes for routing and service logic. Media resources, if needed for conferencing or transcoding, are handled by a Media Resource Function (MRF). The core network (EPC/5GC) provides the guaranteed QoS-enabled bearers, typically dedicated EPS bearers or QoS Flows with appropriate QoS Class Identifiers (QCIs/5QIs) for real-time conversational voice, ensuring low latency and packet loss.

VoIP's operation is tightly integrated with policy control. The Policy and Charging Rules Function (PCRF) in EPC or the Policy Control Function (PCF) in 5GC interacts with the IMS via the Rx interface. Upon session initiation, the IMS informs the policy controller of the required media parameters (codec, bandwidth, ports). The policy controller then provisions the corresponding rules in the Packet Gateway (PGW) or User Plane Function (UPF) to establish a dedicated bearer with the correct QoS profile, ensuring the voice packets receive priority treatment across the radio and transport networks. This end-to-end QoS management is critical for achieving carrier-grade voice quality, matching or exceeding that of legacy circuit-switched networks.

Purpose & Motivation

VoIP was introduced to address the limitations and inefficiencies of circuit-switched (CS) voice networks in the era of broadband data. Traditional CS voice dedicated an entire 64 kbps timeslot for the duration of a call, which was resource-inefficient and could not be easily integrated with emerging IP-based data services. The primary motivation was network convergence—creating a single, unified IP infrastructure capable of delivering all services (voice, video, data), thereby simplifying network architecture, reducing operational costs, and enabling seamless service integration.

Historically, 2G (GSM) and 3G (UMTS) networks relied on a separate circuit-switched core network for voice. With the development of high-speed packet access (HSPA) and the move towards LTE (a pure packet-switched radio technology), a new solution for voice was imperative. VoIP over IMS (commmercialized as VoLTE) was standardized to provide a future-proof, high-quality voice service native to the packet network. It solved the problem of the 'voice gap' in LTE and enabled advanced features like simultaneous voice and high-speed data (SVLTE was a workaround, not a solution), HD voice, and rich communication services (RCS) integration.

Furthermore, VoIP standardization ensured interoperability across different operators and device vendors, preventing fragmentation. It established a clear evolution path from CS voice to IP voice, which has been fully realized in 5G as Voice over New Radio (VoNR), where VoIP is the sole native voice solution. This transition supports network simplification, as the legacy CS core can be retired, and enables voice to be treated as just another data application, albeit with stringent QoS requirements, within a fully software-defined and cloud-native core network.

Key Features

  • Packet-switched voice transmission using RTP/UDP/IP protocols
  • Reliance on IMS for session control and service delivery
  • Mandatory support for end-to-end QoS provisioning via policy control (PCRF/PCF)
  • Utilization of efficient voice codecs (e.g., AMR, AMR-WB, EVS) for high quality and bandwidth adaptation
  • Support for emergency calls (eT911/eT112) and lawful interception over IP
  • Enabler for service innovation like HD voice, video calling, and Rich Communication Services (RCS)

Evolution Across Releases

Rel-8 Initial

Initial standardization of VoIP as the foundation for IMS-based voice in LTE (VoLTE). Defined the fundamental architecture integrating IMS with the EPC, specified the Rx interface for QoS policy interaction, and mandated support for the AMR codec. Established the baseline for SRVCC (Single Radio Voice Call Continuity) to handover calls to legacy 2G/3G CS networks.

Enhanced IMS emergency call support for VoIP, crucial for regulatory compliance. Introduced improvements to SRVCC procedures for reduced call setup time and handover interruption. Defined location services support for VoIP emergency calls.

Specified enhancements for robust header compression (ROHC) for VoIP to improve radio efficiency. Introduced TTI bundling for LTE coverage extension to improve VoIP performance at cell edges. Further refined SRVCC and added support for emergency call SRVCC (eSRVCC).

Standardized the Enhanced Voice Services (EVS) codec for superior audio quality and robustness. Introduced IMS Centralized Services (ICS) enhancements for better service consistency. Defined further optimizations for battery life during VoIP calls (e.g., discontinuous reception DRX configurations).

Focused on VoLTE enhancements for commercial deployments, including detailed performance requirements and test specifications. Introduced support for Wi-Fi calling (VoWiFi or eVoLTE) via trusted and untrusted non-3GPP access, seamlessly integrating with IMS.

Enhanced EVS codec with new operational modes and improved efficiency. Specified Voice over LTE via Generic Access (VoLGA) was not pursued; focus remained on native VoLTE and VoWiFi. Introduced further optimizations for mission-critical push-to-talk (MCPTT) over LTE, which uses VoIP as a base.

Continued enhancements for mission-critical services, including group communication. Improved support for high-quality audio with wider bandwidths in EVS. Worked on enabling richer communication experiences as part of the RCS evolution.

Defined the framework for Voice over New Radio (VoNR) in 5G Standalone (SA) architecture. Ensured service continuity between VoNR, VoLTE, and legacy networks. Integrated VoIP/IMS services with the new 5G Core (5GC) and its service-based architecture.

Enhanced 5G voice with support for 5G media streaming, integration with edge computing (MEC) for ultra-low latency services, and improved support for industrial IoT communications that may include voice. Strengthened QoS framework for 5G voice.

Introduced support for extended reality (XR) services which may include conversational voice components. Enhanced audio codec performance for immersive communications. Further optimizations for energy efficiency and coverage for 5G voice services.

Exploring AI/ML enhancements for voice quality optimization and predictive QoS. Further integration of voice services with network slicing to create dedicated voice slices. Work on advanced acoustic features for next-generation voice experiences.

Ongoing evolution expected to focus on convergence of communication services (voice, video, messaging, XR) into unified experiences, deeper integration with AI for network and service management, and enhancements for non-terrestrial network (NTN) support for global voice coverage.

Defining Specifications

SpecificationTitle
TS 26.935 3GPP TS 26.935