Publication:

Exploring Human Perception-Aligned Perceptual Hashing

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0001-9530-3466
cris.virtual.orcid0000-0002-0660-3190
cris.virtual.orcid0000-0001-5313-4158
cris.virtualsource.department932d8640-1cb6-404a-8aad-f92227775c6e
cris.virtualsource.department817f2cb4-7e19-4453-9161-77eae0680f92
cris.virtualsource.departmentd9dcf0ec-40cb-4b14-9140-ec6762bb40e4
cris.virtualsource.orcid932d8640-1cb6-404a-8aad-f92227775c6e
cris.virtualsource.orcid817f2cb4-7e19-4453-9161-77eae0680f92
cris.virtualsource.orcidd9dcf0ec-40cb-4b14-9140-ec6762bb40e4
dc.contributor.authorDe Geest Jelle
dc.contributor.authorDe Smet, Patrick
dc.contributor.authorBonetto, Lucio
dc.contributor.authorLambert, Peter
dc.contributor.authorVan Wallendael, Glenn
dc.contributor.authorMareen, Hannes
dc.date.accessioned2026-04-15T10:22:18Z
dc.date.available2026-04-15T10:22:18Z
dc.date.createdwos2025-12-16
dc.date.issued2026
dc.description.abstractThe widespread sharing of images has led to challenges in controlling the spread of harmful content in consumer devices, particularly child sexual abuse material. Perceptual hashing offers a solution by enabling the fast detection of blacklisted images through compact representations of visual content. However, automatically detecting (near-)duplicate images in overwhelming volumes of data is challenging due to the limitations of traditional perceptual hashing methods. For example, existing methods can fail to detect images with minor modifications, specifically spatial modifications. In addition, they were often designed to find images derived from the same original image, and hence are incapable of recognizing visually similar images that originate from a different acquisition origin. This study explores the use of Vision Transformers (ViTs), specifically the contrastive language–image pretraining model, to enhance perceptual hashing, better aligning with human perception. The proposed ViTHash method is compared against traditional perceptual hashing methods, such as pHash, dHash, and PDQHash. Quantitative results show that ViTHash outperforms traditional methods in handling spatial distortions, such as rotation and mirroring, although it is less robust to visual quality distortions, such as blurring and compression. Qualitative analysis reveals that ViTHash aligns more closely with human perception and is capable of identifying visually similar images, even when they are images depicting visually similar content yet originate from different acquisition origins. These findings demonstrate that ViTHash offers significant potential for applications requiring nuanced image similarity assessments, providing a valuable tool to enhance the detection of illicit content in consumer electronics devices and support law enforcement efforts.
dc.description.wosFundingTextThis work was supported in part by the Research Foundation-Flanders (FWO), in part by the IDLab (Ghent University-imec), in part by the Flanders Innovation & Entrepreneurship (VLAIO), in part by the Flemish Government, and in part by the NICC's participation, European Union (Belgian Internal Security Fund project) under Grant ISF-084-108. The code is available online at https://github. com/JelleDeGeest/ViTHash.
dc.identifier.doi10.1109/mce.2025.3551813
dc.identifier.issn2162-2248
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/59099
dc.language.isoeng
dc.provenance.editstepusergreet.vanhoof@imec.be
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
dc.source.beginpage41
dc.source.endpage54
dc.source.issue1
dc.source.journalIEEE CONSUMER ELECTRONICS MAGAZINE
dc.source.numberofpages14
dc.source.volume15
dc.title

Exploring Human Perception-Aligned Perceptual Hashing

dc.typeJournal article
dspace.entity.typePublication
imec.internal.crawledAt2026-04-07
imec.internal.sourcecrawler
imec.internal.wosCreatedAt2026-04-07
Files

Original bundle

Name:
DS996_acc.pdf
Size:
14.17 MB
Format:
Adobe Portable Document Format
Description:
Accepted
Name:
DS996.pdf
Size:
2.76 MB
Format:
Adobe Portable Document Format
Description:
Published
Publication available in collections: