Publication:
Large Language Models on Race Commentary: Towards Granular Data in Cycling Analytics
| cris.virtual.department | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
| cris.virtual.orcid | 0000-0003-1094-2184 | |
| cris.virtualsource.department | ea3f2e62-e271-4a8d-8820-58ca83be4023 | |
| cris.virtualsource.orcid | ea3f2e62-e271-4a8d-8820-58ca83be4023 | |
| dc.contributor.author | Janssens, Bram | |
| dc.contributor.author | Bogaert, Matthias | |
| dc.contributor.author | Verstockt, Steven | |
| dc.contributor.imecauthor | Verstockt, Steven | |
| dc.contributor.orcidimec | Verstockt, Steven::0000-0003-1094-2184 | |
| dc.date.accessioned | 2025-05-12T09:49:23Z | |
| dc.date.available | 2025-05-11T05:42:14Z | |
| dc.date.available | 2025-05-12T09:49:23Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Current cycling analytics studies are limited to data about the eventual race results. This study searches how online commentary can be used to capture information about in-race dynamics by harnessing the power of large language models. The results show that the direct application of these models is already promising but not accurate enough to base end-to-end machine learning applications on the generated data. Our results show the tendency of these models to use information from previous queries in its generation step, which indicates data leakage and might hamper the scientific validation of approaches comparing various techniques. To capture overall rider behavior we suggest using graph representation learning. Our results indicate that this method is capable of identifying similar rider behavior, which to date was not yet feasible. | |
| dc.identifier.doi | 10.1007/978-3-031-86692-0_2 | |
| dc.identifier.eisbn | 978-3-031-86692-0 | |
| dc.identifier.isbn | 978-3-031-86691-3 | |
| dc.identifier.issn | 1865-0929 | |
| dc.identifier.uri | https://imec-publications.be/handle/20.500.12860/45621 | |
| dc.publisher | SPRINGER INTERNATIONAL PUBLISHING AG | |
| dc.source.beginpage | 14 | |
| dc.source.conference | 11th International Workshop on Machine Learning and Data Mining for Sports Analytics - MLSA | |
| dc.source.conferencedate | 2024-09-09 | |
| dc.source.conferencelocation | Vilnius | |
| dc.source.endpage | 25 | |
| dc.source.journal | Machine Learning and Data Mining for Sports Analytics (MLSA 2024) | |
| dc.source.numberofpages | 12 | |
| dc.title | Large Language Models on Race Commentary: Towards Granular Data in Cycling Analytics | |
| dc.type | Proceedings paper | |
| dspace.entity.type | Publication | |
| Files | Original bundle
| |
| Publication available in collections: |