Interpretable reinforcement learning for heat pump control through asymmetric differentiable decision trees

Van Puyvelde, Toon; Zareh, Mehran; Develder, Chris

doi:10.1145/3679240.3734671

Simple item page Full metadata Statistics

cris.virtual.department	#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department	#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid	0009-0002-0785-4028
cris.virtual.orcid	0000-0003-2707-4176
cris.virtualsource.department	4df51e0c-d8c7-4197-ab29-dead1d206d21
cris.virtualsource.department	620b024a-fd0a-4fbf-9967-6a13307ced87
cris.virtualsource.orcid	4df51e0c-d8c7-4197-ab29-dead1d206d21
cris.virtualsource.orcid	620b024a-fd0a-4fbf-9967-6a13307ced87
dc.contributor.author	Van Puyvelde, Toon
dc.contributor.author	Zareh, Mehran
dc.contributor.author	Develder, Chris
dc.date.accessioned	2026-06-08T14:27:25Z
dc.date.available	2026-06-08T14:27:25Z
dc.date.createdwos	2025-09-14
dc.date.issued	2025
dc.description.abstract	In recent years, deep reinforcement learning (DRL) algorithms have gained traction in home energy management systems. However, their adoption by energy management companies remains limited due to the black-box nature of DRL, which fails to provide transparent decision-making feedback. To address this, explainable reinforcement learning (XRL) techniques have emerged, aiming to make DRL decisions more transparent. Among these, soft differential decision tree (DDT) distillation provides a promising approach due to the clear decision rules they are based on, which can be efficiently computed. However, achieving high performance often requires deep, and completely full, trees, which reduces interpretability. To overcome this, we propose a novel asymmetric soft DDT construction method. Unlike traditional soft DDTs, our approach adaptively constructs trees by expanding nodes only when necessary. This improves the efficient use of decision nodes, which require a predetermined depth to construct full symmetric trees, enhancing both interpretability and performance. We demonstrate the potential of asymmetric DDTs to provide transparent, efficient, and high-performing decision-making in home energy management systems.
dc.description.wosFundingText	This research was funded by Daikin Europe NV under the PhD Framework Agreement with Ghent University.
dc.identifier.doi	10.1145/3679240.3734671
dc.identifier.uri	https://imec-publications.be/handle/20.500.12860/59639
dc.language.iso	eng
dc.provenance.editstepuser	greet.vanhoof@imec.be
dc.publisher	ASSOC COMPUTING MACHINERY
dc.source.beginpage	968
dc.source.conference	16th ACM International Conference on Future and Sustainable Energy Systems - E-Energy
dc.source.conferencedate	2025-06-17
dc.source.conferencelocation	Rotterdam
dc.source.endpage	972
dc.source.journal	PROCEEDINGS OF THE 2025 THE 16TH ACM INTERNATIONAL CONFERENCE ON FUTURE AND SUSTAINABLE ENERGY SYSTEMS, E-ENERGY 2025
dc.source.numberofpages	5
dc.title	Interpretable reinforcement learning for heat pump control through asymmetric differentiable decision trees
dc.type	Proceedings paper
dspace.entity.type	Publication
imec.internal.crawledAt	2025-10-22
imec.internal.source	crawler
Files
Publication available in collections:	Conference contributions

Interpretable reinforcement learning for heat pump control through asymmetric differentiable decision trees

Date