Exploring Human Perception-Aligned Perceptual Hashing

De Geest Jelle; De Smet, Patrick; Bonetto, Lucio; Lambert, Peter; Van Wallendael, Glenn; Mareen, Hannes

doi:10.1109/mce.2025.3551813

Simple item page Full metadata Statistics

cris.virtual.department	#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department	#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department	#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid	0000-0001-9530-3466
cris.virtual.orcid	0000-0002-0660-3190
cris.virtual.orcid	0000-0001-5313-4158
cris.virtualsource.department	932d8640-1cb6-404a-8aad-f92227775c6e
cris.virtualsource.department	817f2cb4-7e19-4453-9161-77eae0680f92
cris.virtualsource.department	d9dcf0ec-40cb-4b14-9140-ec6762bb40e4
cris.virtualsource.orcid	932d8640-1cb6-404a-8aad-f92227775c6e
cris.virtualsource.orcid	817f2cb4-7e19-4453-9161-77eae0680f92
cris.virtualsource.orcid	d9dcf0ec-40cb-4b14-9140-ec6762bb40e4
dc.contributor.author	De Geest Jelle
dc.contributor.author	De Smet, Patrick
dc.contributor.author	Bonetto, Lucio
dc.contributor.author	Lambert, Peter
dc.contributor.author	Van Wallendael, Glenn
dc.contributor.author	Mareen, Hannes
dc.date.accessioned	2026-04-15T10:22:18Z
dc.date.available	2026-04-15T10:22:18Z
dc.date.createdwos	2025-12-16
dc.date.issued	2026
dc.description.abstract	The widespread sharing of images has led to challenges in controlling the spread of harmful content in consumer devices, particularly child sexual abuse material. Perceptual hashing offers a solution by enabling the fast detection of blacklisted images through compact representations of visual content. However, automatically detecting (near-)duplicate images in overwhelming volumes of data is challenging due to the limitations of traditional perceptual hashing methods. For example, existing methods can fail to detect images with minor modifications, specifically spatial modifications. In addition, they were often designed to find images derived from the same original image, and hence are incapable of recognizing visually similar images that originate from a different acquisition origin. This study explores the use of Vision Transformers (ViTs), specifically the contrastive language–image pretraining model, to enhance perceptual hashing, better aligning with human perception. The proposed ViTHash method is compared against traditional perceptual hashing methods, such as pHash, dHash, and PDQHash. Quantitative results show that ViTHash outperforms traditional methods in handling spatial distortions, such as rotation and mirroring, although it is less robust to visual quality distortions, such as blurring and compression. Qualitative analysis reveals that ViTHash aligns more closely with human perception and is capable of identifying visually similar images, even when they are images depicting visually similar content yet originate from different acquisition origins. These findings demonstrate that ViTHash offers significant potential for applications requiring nuanced image similarity assessments, providing a valuable tool to enhance the detection of illicit content in consumer electronics devices and support law enforcement efforts.
dc.description.wosFundingText	This work was supported in part by the Research Foundation-Flanders (FWO), in part by the IDLab (Ghent University-imec), in part by the Flanders Innovation & Entrepreneurship (VLAIO), in part by the Flemish Government, and in part by the NICC's participation, European Union (Belgian Internal Security Fund project) under Grant ISF-084-108. The code is available online at https://github. com/JelleDeGeest/ViTHash.
dc.identifier.doi	10.1109/mce.2025.3551813
dc.identifier.issn	2162-2248
dc.identifier.uri	https://imec-publications.be/handle/20.500.12860/59099
dc.language.iso	eng
dc.provenance.editstepuser	greet.vanhoof@imec.be
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
dc.source.beginpage	41
dc.source.endpage	54
dc.source.issue	1
dc.source.journal	IEEE CONSUMER ELECTRONICS MAGAZINE
dc.source.numberofpages	14
dc.source.volume	15
dc.title	Exploring Human Perception-Aligned Perceptual Hashing
dc.type	Journal article
dspace.entity.type	Publication
imec.internal.crawledAt	2026-04-07
imec.internal.source	crawler
imec.internal.wosCreatedAt	2026-04-07
Files	Original bundle Name: DS996_acc.pdf Size: 14.17 MB Format: Adobe Portable Document Format Description: Accepted Download Name: DS996.pdf Size: 2.76 MB Format: Adobe Portable Document Format Description: Published Download
Publication available in collections:	Articles

Exploring Human Perception-Aligned Perceptual Hashing

Date