Publication:

A framework for flexibly guiding learning agents

 
dc.contributor.authorElbarbari, Mahmoud
dc.contributor.authorDelgrange, Florent
dc.contributor.authorVervlimmeren, Ivo
dc.contributor.authorEfthymiadis, Kyriakos
dc.contributor.authorVanderborght, Bram
dc.contributor.authorNowe, Ann
dc.contributor.imecauthorElbarbari, Mahmoud
dc.contributor.imecauthorVanderborght, Bram
dc.contributor.orcidimecElbarbari, Mahmoud::0000-0001-9094-4221
dc.contributor.orcidimecVanderborght, Bram::0000-0003-4881-9341
dc.date.accessioned2022-06-15T02:21:34Z
dc.date.available2022-06-15T02:21:34Z
dc.date.issued2025
dc.description.abstractReinforcement Learning (RL) enables artificial agents to learn through direct interaction with the environment. However, it usually does not scale up well to large problems due to its sampling inefficiency. Reward Shaping is a well-established approach that allows for more efficient learning by incorporating domain knowledge in RL agents via supplementary rewards. In this work, we propose a novel methodology that automatically generates reward shaping functions from user-provided Linear Temporal Logic on finite traces ( ) formulas. in our work serves as a rich language that allows the user to communicate domain knowledge to the learning agent. In both single and multi-agent settings, we demonstrate that our approach performs at least as well as the baseline approach while providing essential advantages in terms of flexibility and ease of use. We elaborate on some of these advantages empirically by demonstrating that our approach can handle domain knowledge with different levels of accuracy, and provides the user with the flexibility to express aspects of uncertainty in the provided advice.
dc.description.wosFundingTextThis research was supported by the Flemish Government under the "Onderzoeksprogramma Artificiele Intelligentie (AI) Vlaanderen" programme.
dc.identifier.doi10.1007/s00521-022-07396-x
dc.identifier.issn0941-0643
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/39954
dc.publisherSPRINGER LONDON LTD
dc.source.beginpage13101
dc.source.endpage13117
dc.source.issue19
dc.source.journalNEURAL COMPUTING & APPLICATIONS
dc.source.numberofpages17
dc.source.volume37
dc.title

A framework for flexibly guiding learning agents

dc.typeJournal article
dspace.entity.typePublication
Files
Publication available in collections: