Publication:

Weakly Supervised Phonological Features for Pathological Speech Analysis

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0001-8525-7160
cris.virtual.orcid0000-0001-7193-1863
cris.virtual.orcid0000-0001-5990-722X
cris.virtualsource.department7fbfb997-86a7-41a3-abda-68a0f1234b59
cris.virtualsource.department5e2c5e98-499c-4328-96cb-1de847eaa21b
cris.virtualsource.department092fe92a-3bc6-46c1-82b1-cb40096e8470
cris.virtualsource.orcid7fbfb997-86a7-41a3-abda-68a0f1234b59
cris.virtualsource.orcid5e2c5e98-499c-4328-96cb-1de847eaa21b
cris.virtualsource.orcid092fe92a-3bc6-46c1-82b1-cb40096e8470
dc.contributor.authorThienpondt, Jenthe
dc.contributor.authorVanderreydt, Geoffroy
dc.contributor.authorHammami, Abdessalem
dc.contributor.authorDemuynck, Kris
dc.date.accessioned2026-04-13T10:21:56Z
dc.date.available2026-04-13T10:21:56Z
dc.date.createdwos2026-02-04
dc.date.issued2025
dc.description.abstractParalinguistic properties of speech are essential in analyzing and choosing optimal treatment options for patients with speech disorders. However, automatic modeling of these characteristics is difficult due to the lack of labeled speech datasets describing paralinguistic properties, especially at the frame-level. In this paper, we propose a weakly supervised training method which exploits the known acoustic properties of phonemes by training an ASR model with an interpretable frame-level phonological feature bottleneck layer. Subsequently, we assess the viability of these phonological features in speech pathology analysis by developing corresponding models for intelligibility prediction and speech pathology classification. Models using our proposed phonological features perform similar to other state-of-the-art acoustic features on both tasks with a classification accuracy of 75% and a 8.43 RMSE on speech intelligibility prediction. In contrast to others, our phonological features are text-independent and highly interpretable, providing potentially useful insights for speech therapists.
dc.description.wosFundingTextSupported by Research Foundation Flanders (FWO) grant S004923N and EU Horizon 2020 programme TAPAS under Marie Curie grant 766287.
dc.identifier.doi10.1109/icassp49660.2025.10888038
dc.identifier.isbn979-8-3503-6875-8
dc.identifier.issn1520-6149
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/59059
dc.language.isoeng
dc.provenance.editstepusergreet.vanhoof@imec.be
dc.publisherIEEE
dc.source.beginpage1
dc.source.conference2025 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP
dc.source.conferencedate2025-04-06
dc.source.conferencelocationHyderabad, India
dc.source.endpage5
dc.source.journal2025 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP
dc.source.numberofpages5
dc.subject.keywordsINTELLIGIBILITY
dc.title

Weakly Supervised Phonological Features for Pathological Speech Analysis

dc.typeProceedings paper
dspace.entity.typePublication
imec.internal.crawledAt2026-04-07
imec.internal.sourcecrawler
imec.internal.wosCreatedAt2026-04-07
Files

Original bundle

Name:
DS946_acc.pdf
Size:
1.13 MB
Format:
Adobe Portable Document Format
Description:
Accepted
Publication available in collections: