No more hand-tuning rewards: Masked constrained policy optimization for safe reinforcement learning

dc.contributor.author	Van Havermaet, Stef
dc.contributor.author	Khaluf, Yara
dc.contributor.author	Simoens, Pieter
dc.contributor.imecauthor	Van Havermaet, Stef
dc.contributor.imecauthor	Khaluf, Yara
dc.contributor.imecauthor	Simoens, Pieter
dc.contributor.orcidimec	Simoens, Pieter::0000-0002-9569-9373
dc.date.accessioned	2021-10-31T12:02:30Z
dc.date.available	2021-10-31T12:02:30Z
dc.date.embargo	9999-12-31
dc.date.issued	2021
dc.identifier.uri	https://imec-publications.be/handle/20.500.12860/37290
dc.identifier.url	http://www.ifaamas.org/Proceedings/aamas2021/pdfs/p1344.pdf
dc.source.beginpage	1344
dc.source.conference	AAMAS2021, the 20th International Conference on Autonomous Agents and Multiagent Systems
dc.source.conferencedate	3/05/2021
dc.source.conferencelocation	Online Online
dc.source.endpage	1352
dc.title	No more hand-tuning rewards: Masked constrained policy optimization for safe reinforcement learning
dc.type	Proceedings paper
dspace.entity.type	Publication
Files	Original bundle Name: 48442.pdf Size: 1.38 MB Format: Adobe Portable Document Format Download
Publication available in collections:	Conference contributions