Publication:

Large Language Models on Race Commentary: Towards Granular Data in Cycling Analytics

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0003-1094-2184
cris.virtualsource.departmentea3f2e62-e271-4a8d-8820-58ca83be4023
cris.virtualsource.orcidea3f2e62-e271-4a8d-8820-58ca83be4023
dc.contributor.authorJanssens, Bram
dc.contributor.authorBogaert, Matthias
dc.contributor.authorVerstockt, Steven
dc.contributor.imecauthorVerstockt, Steven
dc.contributor.orcidimecVerstockt, Steven::0000-0003-1094-2184
dc.date.accessioned2025-05-12T09:49:23Z
dc.date.available2025-05-11T05:42:14Z
dc.date.available2025-05-12T09:49:23Z
dc.date.issued2025
dc.description.abstractCurrent cycling analytics studies are limited to data about the eventual race results. This study searches how online commentary can be used to capture information about in-race dynamics by harnessing the power of large language models. The results show that the direct application of these models is already promising but not accurate enough to base end-to-end machine learning applications on the generated data. Our results show the tendency of these models to use information from previous queries in its generation step, which indicates data leakage and might hamper the scientific validation of approaches comparing various techniques. To capture overall rider behavior we suggest using graph representation learning. Our results indicate that this method is capable of identifying similar rider behavior, which to date was not yet feasible.
dc.identifier.doi10.1007/978-3-031-86692-0_2
dc.identifier.eisbn978-3-031-86692-0
dc.identifier.isbn978-3-031-86691-3
dc.identifier.issn1865-0929
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/45621
dc.publisherSPRINGER INTERNATIONAL PUBLISHING AG
dc.source.beginpage14
dc.source.conference11th International Workshop on Machine Learning and Data Mining for Sports Analytics - MLSA
dc.source.conferencedate2024-09-09
dc.source.conferencelocationVilnius
dc.source.endpage25
dc.source.journalMachine Learning and Data Mining for Sports Analytics (MLSA 2024)
dc.source.numberofpages12
dc.title

Large Language Models on Race Commentary: Towards Granular Data in Cycling Analytics

dc.typeProceedings paper
dspace.entity.typePublication
Files

Original bundle

Name:
DS903.pdf
Size:
982.51 KB
Format:
Adobe Portable Document Format
Description:
Accepted
Publication available in collections: