Publication:

Select for better learning: identifying high-quality training data for a multimodal cyclic transformer

 
dc.contributor.authorZhang, Jingwei
dc.contributor.authorLiu, Zhaoyi
dc.contributor.authorChatzichristos, Christos
dc.contributor.authorMichiels, Sam
dc.contributor.authorVan Paesschen, Wim
dc.contributor.authorHughes, Danny
dc.contributor.authorDe Vos, Maarten
dc.date.accessioned2025-04-01T06:43:18Z
dc.date.available2025-04-01T06:43:18Z
dc.date.issued2025
dc.description.abstractObjective. Tonic–clonic seizures (TCSs), which present a significant risk for sudden unexpected death in epilepsy, require accurate detection to enable effective long-term monitoring. Previous studies have demonstrated the advantages of multimodal seizure detection systems in reliably detecting TCSs over extended periods. However, the effectiveness of these data-driven systems depends heavily on the availability of reliable training data. Approach. To address this need, we propose an innovative data selection method designed to identify high-quality training samples. Our approach evaluates sample quality based on learning difficulty, classifying samples with lower learning difficulty as higher quality. We then introduce a confidence-based method to quantify the proportion of high-quality samples within the dataset. Main results. Experimental results show that our method improves the performance of a state-of-the-art TCS detection model by 11%. Significance. Using this data selection method, we develop a training pipeline that enhances the training process of multimodal seizure detection models.
dc.description.wosFundingTextThe authors would like to thank all the patients who participated in this research. This study was funded by the EIT Health Grant: 21263 SeizeIT2 (Discreet Personalized Epileptic Seizure Detection Device). The research also received support from the following Projects: FWO SB Project 'Supporting the Development of Self-Regulation in Infants: A Promising Strategy in Preventive Mental Health Care' (S003524N), FWO Research Project 'Artificial Intelligence (AI) for Data-Driven Personalized Medicine' (G0C9623N), FWO Research Project 'Deep, Personalized Epileptic Seizure Detection' (G0D8321N), and the Bijzonder Onderzoeksfonds KU Leuven (BOF) project 'Prevalentie van epilepsie en slaapstoornissen in de ziekte van Alzheimer' (C24/18/097). Additionally, this research was funded by the Flemish Government's AI Research Program.
dc.identifier.doi10.1088/1741-2552/adbec0
dc.identifier.issn1741-2560
dc.identifier.pmidMEDLINE:40064111
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/45473
dc.publisherIOP Publishing Ltd
dc.source.beginpage026030
dc.source.issue2
dc.source.journalJOURNAL OF NEURAL ENGINEERING
dc.source.numberofpages11
dc.source.volume22
dc.subject.keywordsDEEP
dc.subject.keywordsEEG
dc.title

Select for better learning: identifying high-quality training data for a multimodal cyclic transformer

dc.typeJournal article
dspace.entity.typePublication
Files

Original bundle

Name:
Zhang+et+al_2025_J._Neural_Eng._10.1088_1741-2552_adbec0.pdf
Size:
1.25 MB
Format:
Adobe Portable Document Format
Description:
Accepted
Publication available in collections: