Description
Hand-held Speakerphone (HHS) mode is a defined operational state for User Equipment (UE), standardized by 3GPP to ensure consistent audio performance when the device is used as a speakerphone. In this mode, the UE is held in the user's hand (not worn on the head), and audio is played through the device's main loudspeaker while the primary microphone(s) capture the user's voice. This contrasts with the traditional 'handset' mode where the audio transducer is held against the ear, and the 'hands-free' mode which typically implies the use of an external accessory like a Bluetooth headset or a car kit. The HHS mode is essentially an on-device, acoustic hands-free solution.
Technically, enabling HHS mode triggers specific signal processing algorithms within the UE's audio subsystem. The key challenge is managing acoustic echo and improving voice pickup in a potentially noisy environment. Since the loudspeaker and microphone are in close proximity on the same device, the microphone picks up the output from the loudspeaker, creating a loud acoustic echo for the far-end listener. Therefore, robust Acoustic Echo Cancellation (AEC) is a mandatory component. Furthermore, the microphone is not positioned close to the mouth, so beamforming, noise suppression, and gain control algorithms are employed to enhance the near-end speech signal and suppress background noise and the speaker's own output from the loudspeaker.
The 3GPP specifications, particularly TS 26.931, define performance requirements for HHS mode. These include minimum requirements for loudspeaker volume, microphone sensitivity, and the effectiveness of echo cancellation and noise reduction. For example, the standard defines metrics like Terminal Coupling Loss (TCL) which measures how much the echo is attenuated by the echo canceller. It also specifies test scenarios and background noise conditions to ensure devices provide acceptable and interoperable audio quality. The implementation involves close coordination between the modem's voice codec (e.g., AMR, EVS) and the application processor's audio digital signal processing (DSP) chain.
From a network perspective, the UE signals its capabilities related to HHS mode during call setup or via other capability exchange mechanisms. The network itself is largely agnostic to the mode; the voice stream is still encoded and packetized normally. However, the use of HHS mode can influence the user's experience of voice quality, as the acoustic environment plays a larger role. The standardization ensures that regardless of the device manufacturer, when a user activates speakerphone mode, a baseline level of audio clarity and echo-free performance is maintained, which is critical for service interoperability and user satisfaction.
Purpose & Motivation
The HHS feature was standardized to address the growing use of mobile devices as speakerphones, a mode that was becoming increasingly common but was previously implemented in an ad-hoc, vendor-specific manner. Without standardization, the audio quality—particularly the level of echo cancellation and background noise handling—could vary drastically between devices, leading to poor user experiences and complaints during calls. The purpose was to define a minimum performance bar for this fundamental usage scenario.
It solves several practical problems. Firstly, it mitigates the severe acoustic echo that occurs when a device's loudspeaker output is picked up by its own microphone, which can render a call unusable. By mandating effective AEC, it ensures call clarity. Secondly, it improves speech intelligibility in typical speakerphone use cases (e.g., in a room or car) where the microphone is far from the mouth and ambient noise is present. Standardized noise suppression helps with this. Finally, it enables safe usage while driving, as many regions allow hands-free device use, and the built-in speakerphone is the most accessible form of hands-free operation.
Its creation was motivated by the need for service quality consistency. As mobile phones evolved from purely voice devices to multimedia tools, the speakerphone function became a core utility for group conversations, multitasking, and in-vehicle use. 3GPP's standardization ensures that this utility works reliably across the ecosystem, supporting both basic telephony and richer communication services like Voice over LTE (VoLTE) and Voice over NR (VoNR), where high-definition voice codecs are used, making good acoustic performance even more critical.
Key Features
- Defines acoustic and audio processing requirements for on-device speakerphone use
- Mandates robust Acoustic Echo Cancellation (AEC) to prevent far-end echo
- Specifies noise suppression and microphone gain control for distant speech pickup
- Includes performance metrics like Terminal Coupling Loss (TCL) for echo cancellation
- Ensures interoperability and consistent user experience across different UE models
- Supports safe hands-free operation, particularly relevant for in-car use cases
Evolution Across Releases
Introduced the formal definition and performance requirements for the Hand-held Speakerphone (HHS) mode within the 3GPP specifications, particularly in TS 26.931. This established it as a standardized service feature for voice communication, defining test methods and minimum performance criteria for acoustic echo cancellation, noise suppression, and overall audio quality in this specific usage scenario, ensuring consistency for VoLTE and subsequent voice services.
Defining Specifications
| Specification | Title |
|---|---|
| TS 26.931 | 3GPP TS 26.931 |