Publication:
Exploring Human Perception-Aligned Perceptual Hashing
| cris.virtual.department | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
| cris.virtual.department | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
| cris.virtual.department | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
| cris.virtual.orcid | 0000-0001-9530-3466 | |
| cris.virtual.orcid | 0000-0002-0660-3190 | |
| cris.virtual.orcid | 0000-0001-5313-4158 | |
| cris.virtualsource.department | 932d8640-1cb6-404a-8aad-f92227775c6e | |
| cris.virtualsource.department | 817f2cb4-7e19-4453-9161-77eae0680f92 | |
| cris.virtualsource.department | d9dcf0ec-40cb-4b14-9140-ec6762bb40e4 | |
| cris.virtualsource.orcid | 932d8640-1cb6-404a-8aad-f92227775c6e | |
| cris.virtualsource.orcid | 817f2cb4-7e19-4453-9161-77eae0680f92 | |
| cris.virtualsource.orcid | d9dcf0ec-40cb-4b14-9140-ec6762bb40e4 | |
| dc.contributor.author | De Geest Jelle | |
| dc.contributor.author | De Smet, Patrick | |
| dc.contributor.author | Bonetto, Lucio | |
| dc.contributor.author | Lambert, Peter | |
| dc.contributor.author | Van Wallendael, Glenn | |
| dc.contributor.author | Mareen, Hannes | |
| dc.date.accessioned | 2026-04-15T10:22:18Z | |
| dc.date.available | 2026-04-15T10:22:18Z | |
| dc.date.createdwos | 2025-12-16 | |
| dc.date.issued | 2026 | |
| dc.description.abstract | The widespread sharing of images has led to challenges in controlling the spread of harmful content in consumer devices, particularly child sexual abuse material. Perceptual hashing offers a solution by enabling the fast detection of blacklisted images through compact representations of visual content. However, automatically detecting (near-)duplicate images in overwhelming volumes of data is challenging due to the limitations of traditional perceptual hashing methods. For example, existing methods can fail to detect images with minor modifications, specifically spatial modifications. In addition, they were often designed to find images derived from the same original image, and hence are incapable of recognizing visually similar images that originate from a different acquisition origin. This study explores the use of Vision Transformers (ViTs), specifically the contrastive language–image pretraining model, to enhance perceptual hashing, better aligning with human perception. The proposed ViTHash method is compared against traditional perceptual hashing methods, such as pHash, dHash, and PDQHash. Quantitative results show that ViTHash outperforms traditional methods in handling spatial distortions, such as rotation and mirroring, although it is less robust to visual quality distortions, such as blurring and compression. Qualitative analysis reveals that ViTHash aligns more closely with human perception and is capable of identifying visually similar images, even when they are images depicting visually similar content yet originate from different acquisition origins. These findings demonstrate that ViTHash offers significant potential for applications requiring nuanced image similarity assessments, providing a valuable tool to enhance the detection of illicit content in consumer electronics devices and support law enforcement efforts. | |
| dc.description.wosFundingText | This work was supported in part by the Research Foundation-Flanders (FWO), in part by the IDLab (Ghent University-imec), in part by the Flanders Innovation & Entrepreneurship (VLAIO), in part by the Flemish Government, and in part by the NICC's participation, European Union (Belgian Internal Security Fund project) under Grant ISF-084-108. The code is available online at https://github. com/JelleDeGeest/ViTHash. | |
| dc.identifier.doi | 10.1109/mce.2025.3551813 | |
| dc.identifier.issn | 2162-2248 | |
| dc.identifier.uri | https://imec-publications.be/handle/20.500.12860/59099 | |
| dc.language.iso | eng | |
| dc.provenance.editstepuser | greet.vanhoof@imec.be | |
| dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | |
| dc.source.beginpage | 41 | |
| dc.source.endpage | 54 | |
| dc.source.issue | 1 | |
| dc.source.journal | IEEE CONSUMER ELECTRONICS MAGAZINE | |
| dc.source.numberofpages | 14 | |
| dc.source.volume | 15 | |
| dc.title | Exploring Human Perception-Aligned Perceptual Hashing | |
| dc.type | Journal article | |
| dspace.entity.type | Publication | |
| imec.internal.crawledAt | 2026-04-07 | |
| imec.internal.source | crawler | |
| imec.internal.wosCreatedAt | 2026-04-07 | |
| Files | ||
| Publication available in collections: |