Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning

Liu, Gaoyuan; de Winter, Joris; Durodie, Yuri; Steckelmacher, Denis; Nowe, Ann; Vanderborght, Bram

doi:10.1109/LRA.2024.3398402

Simple item page Full metadata Statistics

cris.virtual.department	#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department	#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department	#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid	0000-0002-9063-2751
cris.virtual.orcid	0000-0003-4881-9341
cris.virtual.orcid	0000-0002-9931-7037
cris.virtualsource.department	9575d770-f1f3-41e0-b394-5f6e6e0e81ef
cris.virtualsource.department	47530ccc-659e-457a-9b3b-557ce3dd23e7
cris.virtualsource.department	94f6cb56-5869-4ef8-b3e9-2a1b5ab44edb
cris.virtualsource.orcid	9575d770-f1f3-41e0-b394-5f6e6e0e81ef
cris.virtualsource.orcid	47530ccc-659e-457a-9b3b-557ce3dd23e7
cris.virtualsource.orcid	94f6cb56-5869-4ef8-b3e9-2a1b5ab44edb
dc.contributor.author	Liu, Gaoyuan
dc.contributor.author	de Winter, Joris
dc.contributor.author	Durodie, Yuri
dc.contributor.author	Steckelmacher, Denis
dc.contributor.author	Nowe, Ann
dc.contributor.author	Vanderborght, Bram
dc.date.accessioned	2026-01-19T14:28:14Z
dc.date.available	2026-01-19T14:28:14Z
dc.date.issued	2024
dc.description.abstract	Task and motion planning (TAMP) for robotics manipulation necessitates long-horizon reasoning involving versatile actions and skills. While deterministic actions can be crafted by sampling or optimizing with certain constraints, planning actions with uncertainty, i.e., probabilistic actions, remains a challenge for TAMP. On the contrary, Reinforcement Learning (RL) excels in acquiring versatile, yet short-horizon, manipulation skills that are robust with uncertainties. In this letter, we design a method that integrates RL skills into TAMP pipelines. Besides the policy, a RL skill is defined with data-driven logical components that enable the skill to be deployed by symbolic planning. A plan refinement sub-routine is designed to further tackle the inevitable effect uncertainties. In the experiments, we compare our method with baseline hierarchical planning from both TAMP and RL fields and illustrate the strength of the method. The results show that by embedding RL skills, we extend the capability of TAMP to domains with probabilistic skills, and improve the planning efficiency compared to the previous methods.
dc.identifier	10.1109/LRA.2024.3398402
dc.identifier.doi	10.1109/LRA.2024.3398402
dc.identifier.issn	2377-3766
dc.identifier.uri	https://imec-publications.be/handle/20.500.12860/58665
dc.language.iso	en
dc.provenance.editstepuser	greet.vanhoof@imec.be
dc.publisher	IEEE
dc.relation.ispartof	IEEE ROBOTICS AND AUTOMATION LETTERS
dc.relation.ispartofseries	IEEE ROBOTICS AND AUTOMATION LETTERS
dc.source.beginpage	5974
dc.source.endpage	5981
dc.source.issue	6
dc.source.journal	IEEE Robotics and Automation Letters
dc.source.numberofpages	8
dc.source.volume	9
dc.subject	SAMPLING-BASED METHODS
dc.subject	Manipulation planning
dc.subject	reinforcement learning
dc.subject	task and motion planning
dc.subject	Science & Technology
dc.subject	Technology
dc.title	Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning
dc.type	Journal article
dspace.entity.type	Publication
oaire.citation.edition	WOS.SCI
oaire.citation.endPage	5981
oaire.citation.issue	6
oaire.citation.startPage	5974
oaire.citation.volume	9
person.identifier.orcid	0000-0002-9063-2751
person.identifier.orcid	0000-0002-5818-7539
person.identifier.orcid	0000-0003-1521-8494
person.identifier.orcid	0000-0001-6346-4564
person.identifier.orcid	0000-0003-4881-9341
person.identifier.rid	A-1599-2008
person.identifier.rid	#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.rid	#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.rid	#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.rid	#PLACEHOLDER_PARENT_METADATA_VALUE#
Files
Publication available in collections:	Articles

Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning

Date