Description
The Structural Similarity Index Measure (SSIM) is a full-reference image and video quality assessment algorithm. Unlike simple error summation methods like Mean Squared Error (MSE) or Peak Signal-to-Noise Ratio (PSNR), which calculate absolute pixel differences, SSIM aims to model perceived visual quality. It is based on the hypothesis that the human visual system is highly adapted to extract structural information from a scene. Therefore, SSIM measures the perceived change in structural information, luminance, and contrast between a reference (original, undistorted) image and a distorted (e.g., compressed, transmitted) image.
The SSIM index is calculated for local windows (typically 8x8 or 11x11 pixel blocks) of the image. For each window, three comparison functions are computed: luminance comparison, contrast comparison, and structure comparison. Luminance is estimated as the mean pixel intensity, contrast as the standard deviation, and structure as the normalized pixel values (by subtracting the mean and dividing by the standard deviation). These three components are combined multiplicatively to produce a local SSIM index map. The overall SSIM score for the entire image is usually the mean of this local SSIM map, resulting in a value between -1 and 1, where 1 indicates perfect similarity to the reference.
Within 3GPP, SSIM is standardized as an objective quality measurement tool, primarily within the Technical Specification Group Services and System Aspects (SA4) which deals with codecs and media delivery. Specifications such as 3GPP TS 26.812 (Performance metrics for streaming services) and TS 26.955 (QoE metrics for VR services) reference SSIM as a key metric. It is used to evaluate the performance of video codecs (like AVC/H.264, HEVC/H.265, and VVC/H.266), adaptive bitrate streaming algorithms, and error resilience mechanisms under various network conditions (packet loss, jitter, bandwidth constraints).
For network and service optimization, SSIM provides a more accurate correlation with subjective human ratings (Mean Opinion Score - MOS) than PSNR, especially for compression artifacts typical in modern video codecs. In Media Function (e.g., in 5G Media Streaming architecture) or at the client-side video player, SSIM can be calculated (if a reference is available) to monitor Quality of Experience (QoE) in real-time. This data can be fed back to the network or content server to trigger adaptive actions, such as switching to a different bitrate representation or applying error concealment, to maintain a high perceived quality for the end-user. Its adoption reflects the shift from network-centric metrics (throughput, delay) to user-centric perceptual quality metrics in multimedia service delivery.
Purpose & Motivation
SSIM was developed in the early 2000s to address the inadequacy of traditional pixel-based error metrics like PSNR for evaluating visual quality. PSNR often correlates poorly with human subjective opinions, especially for modern compression techniques that introduce structured artifacts (e.g., blocking, blurring, ringing) which are more or less annoying than the random noise assumed by PSNR. There was a clear need for an objective metric that could automatically predict subjective quality without requiring expensive and time-consuming human trials.
3GPP's standardization of SSIM and other perceptual metrics was motivated by the explosive growth of mobile video traffic and the need to efficiently utilize scarce radio resources while ensuring a satisfactory user experience. Operators and service providers needed reliable, automated ways to tune video encoding parameters, design adaptive streaming logic, and benchmark different codecs. Using PSNR alone could lead to suboptimal decisions that either wasted bandwidth on imperceptible quality improvements or, conversely, degraded quality in ways highly noticeable to users.
By incorporating SSIM into its technical specifications, 3GPP provided a standardized, vendor-neutral tool for objective quality assessment. This enables fair comparison of different vendor implementations, aids in conformance testing for codec profiles, and supports the development of QoE-aware network management systems. For services like 5G-enhanced Mobile Broadband (eMBB) and immersive media (VR/AR), where visual fidelity is paramount, SSIM helps ensure that the complex trade-offs between bandwidth, latency, and compression are managed in a way that prioritizes the end-user's perceptual experience.
Classification
Detected Changes Across Releases
from 3GPP Change RequestsSpecific changes extracted from the „Change history“ tables of 3GPP specifications (1 CRs across 1 releases). Complements the general historical overview above with the evidence-based evolution of this function.
Studied in Rel-12, normative work from Rel-18.
In Release 18, the SSIM function was refined through an editorial correction to its specified file structure. This adjustment ensures clarity and consistency in how SSIM, as an image similarity metric for comparing reprojected and rendered frames, is integrated within the broader framework for measuring XR QoE metrics like reprojection errors. The change supports the standardized measurement procedures defined for evaluating immersive media quality.
- Editorial Correction to SSIM file structure TS 31.105CR0003
Explore further
Broader topics and technologies where SSIM plays a role.
Defining Specifications
3GPP specifications that define or reference SSIM, with the latest known release. Sourced from the 3GPP document catalog — see methodology.
| Specification | Title | Release |
|---|---|---|
| TR 26.812 vi10 | Technical Report | Rel-18 |
| TR 26.926 vj00 | Traffic Models & Quality Evaluation for Media/XR in 5G | Rel-19 |
| TR 26.938 vj00 | DASH Deployment Guidelines for 3GPP Networks | Rel-19 |
| TR 26.955 vj00 | Video Codec Analysis for 5G Services | Rel-19 |
| TS 31.105 vj10 | Slice Subscriber Identity Module (SSIM) Application | Rel-19 |