Publication:

Margin-mixup: a method for robust speaker verification in multi-speaker audio

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0001-8525-7160
cris.virtual.orcid0000-0001-9131-3309
cris.virtual.orcid0000-0001-5990-722X
cris.virtualsource.department7fbfb997-86a7-41a3-abda-68a0f1234b59
cris.virtualsource.department3933668b-dfa9-4ff8-be41-336de12aa428
cris.virtualsource.department092fe92a-3bc6-46c1-82b1-cb40096e8470
cris.virtualsource.orcid7fbfb997-86a7-41a3-abda-68a0f1234b59
cris.virtualsource.orcid3933668b-dfa9-4ff8-be41-336de12aa428
cris.virtualsource.orcid092fe92a-3bc6-46c1-82b1-cb40096e8470
dc.contributor.authorThienpondt, Jenthe
dc.contributor.authorMadhu, Nilesh
dc.contributor.authorDemuynck, Kris
dc.date.accessioned2026-03-16T13:18:37Z
dc.date.available2026-03-16T13:18:37Z
dc.date.createdwos2026-02-21
dc.date.issued2023
dc.description.abstractThis paper is concerned with the task of speaker verification on audio with multiple overlapping speakers. Most speaker verification systems are designed with the assumption of a single speaker being present in a given audio segment. However, in a real-world setting this assumption does not always hold. In this paper, we demonstrate that current speaker verification systems are not robust against audio with noticeable speaker overlap. To alleviate this issue, we propose margin-mixup, a simple training strategy that can easily be adopted by existing speaker verification pipelines to make the resulting speaker embeddings robust against multi-speaker audio. In contrast to other methods, margin-mixup requires no alterations to regular speaker verification architectures, while attaining better results. On our multi-speaker test set based on VoxCeleb1, the proposed margin-mixup strategy improves the EER on average with 44.4% relative to our state-of-the-art speaker verification baseline systems.
dc.description.wosFundingTextThis work is supported by the Research Foundation - Flanders (FWO) under grant numbers G081420N and S004923N.
dc.identifier.doi10.1109/icassp49357.2023.10095305
dc.identifier.issn1520-6149
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/58837
dc.language.isoeng
dc.provenance.editstepusergreet.vanhoof@imec.be
dc.publisherIEEE
dc.source.beginpage1
dc.source.conferenceIEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2023
dc.source.conferencedate2023-06-04
dc.source.conferencelocationRhodes, Greece
dc.source.endpage5
dc.source.journalIEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2023
dc.source.numberofpages5
dc.title

Margin-mixup: a method for robust speaker verification in multi-speaker audio

dc.typeProceedings paper
dspace.entity.typePublication
imec.internal.crawledAt2026-02-23
imec.internal.sourcecrawler
Files

Original bundle

Name:
DS632_acc.pdf
Size:
302.08 KB
Format:
Adobe Portable Document Format
Description:
Accepted
Publication available in collections: