Publication:

Interpretable reinforcement learning for heat pump control through asymmetric differentiable decision trees

Date

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0009-0002-0785-4028
cris.virtual.orcid0000-0003-2707-4176
cris.virtualsource.department4df51e0c-d8c7-4197-ab29-dead1d206d21
cris.virtualsource.department620b024a-fd0a-4fbf-9967-6a13307ced87
cris.virtualsource.orcid4df51e0c-d8c7-4197-ab29-dead1d206d21
cris.virtualsource.orcid620b024a-fd0a-4fbf-9967-6a13307ced87
dc.contributor.authorVan Puyvelde, Toon
dc.contributor.authorZareh, Mehran
dc.contributor.authorDevelder, Chris
dc.date.accessioned2026-06-08T14:27:25Z
dc.date.available2026-06-08T14:27:25Z
dc.date.createdwos2025-09-14
dc.date.issued2025
dc.description.abstractIn recent years, deep reinforcement learning (DRL) algorithms have gained traction in home energy management systems. However, their adoption by energy management companies remains limited due to the black-box nature of DRL, which fails to provide transparent decision-making feedback. To address this, explainable reinforcement learning (XRL) techniques have emerged, aiming to make DRL decisions more transparent. Among these, soft differential decision tree (DDT) distillation provides a promising approach due to the clear decision rules they are based on, which can be efficiently computed. However, achieving high performance often requires deep, and completely full, trees, which reduces interpretability. To overcome this, we propose a novel asymmetric soft DDT construction method. Unlike traditional soft DDTs, our approach adaptively constructs trees by expanding nodes only when necessary. This improves the efficient use of decision nodes, which require a predetermined depth to construct full symmetric trees, enhancing both interpretability and performance. We demonstrate the potential of asymmetric DDTs to provide transparent, efficient, and high-performing decision-making in home energy management systems.
dc.description.wosFundingTextThis research was funded by Daikin Europe NV under the PhD Framework Agreement with Ghent University.
dc.identifier.doi10.1145/3679240.3734671
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/59639
dc.language.isoeng
dc.provenance.editstepusergreet.vanhoof@imec.be
dc.publisherASSOC COMPUTING MACHINERY
dc.source.beginpage968
dc.source.conference16th ACM International Conference on Future and Sustainable Energy Systems - E-Energy
dc.source.conferencedate2025-06-17
dc.source.conferencelocationRotterdam
dc.source.endpage972
dc.source.journalPROCEEDINGS OF THE 2025 THE 16TH ACM INTERNATIONAL CONFERENCE ON FUTURE AND SUSTAINABLE ENERGY SYSTEMS, E-ENERGY 2025
dc.source.numberofpages5
dc.title

Interpretable reinforcement learning for heat pump control through asymmetric differentiable decision trees

dc.typeProceedings paper
dspace.entity.typePublication
imec.internal.crawledAt2025-10-22
imec.internal.sourcecrawler
Files
Publication available in collections: