Publication:
Weakly Supervised Phonological Features for Pathological Speech Analysis
| cris.virtual.department | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
| cris.virtual.department | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
| cris.virtual.department | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
| cris.virtual.orcid | 0000-0001-8525-7160 | |
| cris.virtual.orcid | 0000-0001-7193-1863 | |
| cris.virtual.orcid | 0000-0001-5990-722X | |
| cris.virtualsource.department | 7fbfb997-86a7-41a3-abda-68a0f1234b59 | |
| cris.virtualsource.department | 5e2c5e98-499c-4328-96cb-1de847eaa21b | |
| cris.virtualsource.department | 092fe92a-3bc6-46c1-82b1-cb40096e8470 | |
| cris.virtualsource.orcid | 7fbfb997-86a7-41a3-abda-68a0f1234b59 | |
| cris.virtualsource.orcid | 5e2c5e98-499c-4328-96cb-1de847eaa21b | |
| cris.virtualsource.orcid | 092fe92a-3bc6-46c1-82b1-cb40096e8470 | |
| dc.contributor.author | Thienpondt, Jenthe | |
| dc.contributor.author | Vanderreydt, Geoffroy | |
| dc.contributor.author | Hammami, Abdessalem | |
| dc.contributor.author | Demuynck, Kris | |
| dc.date.accessioned | 2026-04-13T10:21:56Z | |
| dc.date.available | 2026-04-13T10:21:56Z | |
| dc.date.createdwos | 2026-02-04 | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Paralinguistic properties of speech are essential in analyzing and choosing optimal treatment options for patients with speech disorders. However, automatic modeling of these characteristics is difficult due to the lack of labeled speech datasets describing paralinguistic properties, especially at the frame-level. In this paper, we propose a weakly supervised training method which exploits the known acoustic properties of phonemes by training an ASR model with an interpretable frame-level phonological feature bottleneck layer. Subsequently, we assess the viability of these phonological features in speech pathology analysis by developing corresponding models for intelligibility prediction and speech pathology classification. Models using our proposed phonological features perform similar to other state-of-the-art acoustic features on both tasks with a classification accuracy of 75% and a 8.43 RMSE on speech intelligibility prediction. In contrast to others, our phonological features are text-independent and highly interpretable, providing potentially useful insights for speech therapists. | |
| dc.description.wosFundingText | Supported by Research Foundation Flanders (FWO) grant S004923N and EU Horizon 2020 programme TAPAS under Marie Curie grant 766287. | |
| dc.identifier.doi | 10.1109/icassp49660.2025.10888038 | |
| dc.identifier.isbn | 979-8-3503-6875-8 | |
| dc.identifier.issn | 1520-6149 | |
| dc.identifier.uri | https://imec-publications.be/handle/20.500.12860/59059 | |
| dc.language.iso | eng | |
| dc.provenance.editstepuser | greet.vanhoof@imec.be | |
| dc.publisher | IEEE | |
| dc.source.beginpage | 1 | |
| dc.source.conference | 2025 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP | |
| dc.source.conferencedate | 2025-04-06 | |
| dc.source.conferencelocation | Hyderabad, India | |
| dc.source.endpage | 5 | |
| dc.source.journal | 2025 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP | |
| dc.source.numberofpages | 5 | |
| dc.subject.keywords | INTELLIGIBILITY | |
| dc.title | Weakly Supervised Phonological Features for Pathological Speech Analysis | |
| dc.type | Proceedings paper | |
| dspace.entity.type | Publication | |
| imec.internal.crawledAt | 2026-04-07 | |
| imec.internal.source | crawler | |
| imec.internal.wosCreatedAt | 2026-04-07 | |
| Files | Original bundle
| |
| Publication available in collections: |