Publication:

FOCUS: object-centric world models for robotic manipulation

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0003-3319-5986
cris.virtual.orcid0000-0001-8193-4240
cris.virtual.orcid0000-0002-7271-7479
cris.virtualsource.departmentba9dbec5-db1c-4340-9acd-b969f93c69b4
cris.virtualsource.departmenta3951998-889b-478d-8b37-96c5e29b8d88
cris.virtualsource.departmentc857a29f-e812-4c34-b941-a94d9bec46f5
cris.virtualsource.orcidba9dbec5-db1c-4340-9acd-b969f93c69b4
cris.virtualsource.orcida3951998-889b-478d-8b37-96c5e29b8d88
cris.virtualsource.orcidc857a29f-e812-4c34-b941-a94d9bec46f5
dc.contributor.authorFerraro, Stefano
dc.contributor.authorMazzaglia, Pietro
dc.contributor.authorVerbelen, Tim
dc.contributor.authorDhoedt, Bart
dc.contributor.imecauthorFerraro, Stefano
dc.contributor.imecauthorMazzaglia, Pietro
dc.contributor.imecauthorDhoedt, Bart
dc.contributor.orcidimecFerraro, Stefano::0000-0001-8193-4240
dc.contributor.orcidimecMazzaglia, Pietro::0000-0003-3319-5986
dc.contributor.orcidimecDhoedt, Bart::0000-0002-7271-7479
dc.date.accessioned2025-05-19T07:57:13Z
dc.date.available2025-05-17T05:44:53Z
dc.date.available2025-05-19T07:57:13Z
dc.date.issued2025
dc.description.abstractUnderstanding the world in terms of objects and the possible interactions with them is an important cognitive ability. However, current world models adopted in reinforcement learning typically lack this structure and represent the world state in a global latent vector. To address this, we propose FOCUS, a model-based agent that learns an object-centric world model. This novel representation also enables the design of an object-centric exploration mechanism, which encourages the agent to interact with objects and discover useful interactions. We benchmark FOCUS in several robotic manipulation settings, where we found that our method can be used to improve manipulation skills. The object-centric world model leads to more accurate predictions of the objects in the scene and it enables more efficient learning. The object-centric exploration strategy fosters interactions with the objects in the environment, such as reaching, moving, and rotating them, and it allows fast adaptation of the agent to sparse reward reinforcement learning tasks. Using a Franka Emika robot arm, we also showcase how FOCUS proves useful in real-world applications. Website: https://focus-manipulation.github.io/.
dc.description.wosFundingTextThe author(s) declare that financial support was received for the research and/or publication of this article. This research received funding from the Flemish Government (AI Research Program). Pietro Mazzaglia was funded by a Ph.D. grant from the Flanders Research Foundation (FWO).
dc.identifier.doi10.3389/fnbot.2025.1585386
dc.identifier.issn1662-5218
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/45673
dc.publisherFRONTIERS MEDIA SA
dc.source.beginpage1
dc.source.endpage11
dc.source.issue2025
dc.source.journalFRONTIERS IN NEUROROBOTICS
dc.source.numberofpages11
dc.source.volume19
dc.title

FOCUS: object-centric world models for robotic manipulation

dc.typeJournal article
dspace.entity.typePublication
Files

Original bundle

Name:
8801.pdf
Size:
1.66 MB
Format:
Adobe Portable Document Format
Description:
Published
Publication available in collections: