Description
The Keyword Dictionary Identifier (KD-ID) is a parameter defined in 3GPP TS 23.042, which specifies the compression protocols for Short Message Service (SMS). It is a numerical identifier that points to a specific "Keyword Dictionary" – a standardized look-up table containing a list of predefined words or phrases. This mechanism is part of the SMS compression architecture designed to reduce the number of bits required to transmit a text message, thereby increasing network capacity and enabling longer messages within the constraints of the 140-byte SMS transport layer. Architecturally, the KD-ID is carried within the Protocol Data Unit (PDU) of a compressed SMS, specifically in the Compression Indication Header.
How it works involves both the sender and receiver (either the mobile station or the SMS Service Centre) having access to the same dictionary referenced by the KD-ID. When a user composes a message containing phrases present in the dictionary, the messaging application or the network's compression function can replace the full text string with a short reference code (an index into the dictionary) along with the KD-ID. For example, instead of sending the bytes for "Thank you for your message," the system sends the KD-ID for the "English Common Phrases" dictionary and an index number like 15. The receiving entity uses the KD-ID to select the correct dictionary from its local storage and uses the index to reconstruct the original phrase.
Key components include the Keyword Dictionaries themselves, which are standardized or operator-specific lists, and the compression/decompression engines in the UE and the SMS-SC. The KD-ID's role is critical for interoperability; it ensures that the receiver uses the exact same dictionary as the sender for successful decompression. Dictionaries can be pre-loaded on SIM/USIM cards, embedded in device firmware, or potentially downloaded over-the-air. The system supports multiple dictionaries (e.g., for different languages or specialized domains like banking), each with its unique KD-ID. This functionality is primarily leveraged for common phrases in business-to-consumer messaging (e.g., delivery notifications, one-time passwords) or for overcoming language barriers in roaming scenarios by using a universal dictionary of travel-related terms.
Purpose & Motivation
KD-ID and the associated keyword dictionary compression were developed to overcome the inherent limitations of the original SMS specification, which was based on the 7-bit GSM default alphabet and limited to 160 characters per message. As SMS usage exploded, there was a commercial and technical drive to fit more information into a single message segment to reduce costs, improve user experience for longer messages, and conserve signaling bandwidth on the radio interface. Traditional Huffman-based text compression is less effective for very short messages, but replacing common phrases with short codes offers significant gains.
The creation of this standard was motivated by the need for a smart, semantic-level compression technique that could achieve high compression ratios for repetitive content typical in automated or transactional messaging. It solved the problem of sending identical phrases millions of times across a network, such as "Your balance is" or "Your code is," by turning them into tiny references. This was particularly valuable for mobile network operators and service providers sending high volumes of similar notifications.
Furthermore, it addressed the challenge of international roaming and multi-lingual support. A traveler could receive a compressed message with a KD-ID for a "Universal Travel Dictionary," and their phone could decompress it into their native language if their local dictionary had the same indices mapped to translated phrases. This provided a path for language-independent messaging services. While its practical widespread deployment has been limited by the rise of rich messaging apps, the KD-ID framework remains a standardized, efficient solution for specific machine-to-machine (M2M) and application-to-person (A2P) SMS use cases where bandwidth and cost are critical constraints.
Key Features
- Unique identifier for a specific Keyword Dictionary
- Enables high-ratio compression for predefined text phrases
- Carried within the SMS compression protocol header
- Supports multiple dictionaries for different languages or use cases
- Facilitates language-independent message exchange
- Reduces signaling load and increases effective SMS capacity
Evolution Across Releases
Initially introduced as part of the SMS compression feature in TS 23.042. It defined the basic framework for keyword dictionary compression, including the KD-ID parameter structure, to enable efficient transmission of repetitive text in SMS, enhancing network capacity for the booming text messaging service.
Defining Specifications
| Specification | Title |
|---|---|
| TS 23.042 | 3GPP TS 23.042 |