A framework for flexibly guiding learning agents

Elbarbari, Mahmoud; Delgrange, Florent; Vervlimmeren, Ivo; Efthymiadis, Kyriakos; Vanderborght, Bram; Nowe, Ann

doi:10.1007/s00521-022-07396-x

Simple item page Full metadata Statistics

dc.contributor.author	Elbarbari, Mahmoud
dc.contributor.author	Delgrange, Florent
dc.contributor.author	Vervlimmeren, Ivo
dc.contributor.author	Efthymiadis, Kyriakos
dc.contributor.author	Vanderborght, Bram
dc.contributor.author	Nowe, Ann
dc.contributor.imecauthor	Elbarbari, Mahmoud
dc.contributor.imecauthor	Vanderborght, Bram
dc.contributor.orcidimec	Elbarbari, Mahmoud::0000-0001-9094-4221
dc.contributor.orcidimec	Vanderborght, Bram::0000-0003-4881-9341
dc.date.accessioned	2022-06-15T02:21:34Z
dc.date.available	2022-06-15T02:21:34Z
dc.date.issued	2025
dc.description.abstract	Reinforcement Learning (RL) enables artificial agents to learn through direct interaction with the environment. However, it usually does not scale up well to large problems due to its sampling inefficiency. Reward Shaping is a well-established approach that allows for more efficient learning by incorporating domain knowledge in RL agents via supplementary rewards. In this work, we propose a novel methodology that automatically generates reward shaping functions from user-provided Linear Temporal Logic on finite traces ( ) formulas. in our work serves as a rich language that allows the user to communicate domain knowledge to the learning agent. In both single and multi-agent settings, we demonstrate that our approach performs at least as well as the baseline approach while providing essential advantages in terms of flexibility and ease of use. We elaborate on some of these advantages empirically by demonstrating that our approach can handle domain knowledge with different levels of accuracy, and provides the user with the flexibility to express aspects of uncertainty in the provided advice.
dc.description.wosFundingText	This research was supported by the Flemish Government under the "Onderzoeksprogramma Artificiele Intelligentie (AI) Vlaanderen" programme.
dc.identifier.doi	10.1007/s00521-022-07396-x
dc.identifier.issn	0941-0643
dc.identifier.uri	https://imec-publications.be/handle/20.500.12860/39954
dc.publisher	SPRINGER LONDON LTD
dc.source.beginpage	13101
dc.source.endpage	13117
dc.source.issue	19
dc.source.journal	NEURAL COMPUTING & APPLICATIONS
dc.source.numberofpages	17
dc.source.volume	37
dc.title	A framework for flexibly guiding learning agents
dc.type	Journal article
dspace.entity.type	Publication
Files
Publication available in collections:	Articles

A framework for flexibly guiding learning agents

Date