Publication:

Reinforcement learning based mass flow and supply temperature control for combined heat distribution

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0002-0159-175X
cris.virtual.orcid0000-0001-9355-6566
cris.virtual.orcid0000-0002-5523-0634
cris.virtual.orcid0000-0001-8029-4720
cris.virtualsource.departmentfd3cb8ca-82fb-4f47-9ae5-69bd1f453077
cris.virtualsource.department1cf77b59-f7f6-4d1d-af45-e08f88df7d20
cris.virtualsource.departmentf790b071-ce23-4ac6-8ece-af46054a6e2c
cris.virtualsource.departmenta09c2484-580c-4e75-992d-302438c7c31d
cris.virtualsource.orcidfd3cb8ca-82fb-4f47-9ae5-69bd1f453077
cris.virtualsource.orcid1cf77b59-f7f6-4d1d-af45-e08f88df7d20
cris.virtualsource.orcidf790b071-ce23-4ac6-8ece-af46054a6e2c
cris.virtualsource.orcida09c2484-580c-4e75-992d-302438c7c31d
dc.contributor.authorJacobs, Stef
dc.contributor.authorGhane, Sara
dc.contributor.authorAnwar, Ali
dc.contributor.authorMercelis, Siegfried
dc.contributor.authorHellinckx, Peter
dc.contributor.authorVerhaert, Ivan
dc.date.accessioned2026-03-19T15:52:28Z
dc.date.available2026-03-19T15:52:28Z
dc.date.createdwos2025-10-31
dc.date.issued2022
dc.description.abstractCombined heat distribution circuits (CHDCs) are increasingly used in apartment buildings. Here only one supply pipe distributes both space heating (SH) and domestic hot water (DHW). Currently, the supply temperature is set to the highest temperature needed by one of the end-users (i.e. 65ºC for DHW), even if low-temperature emitters are used for SH. However, using decentral storage tanks for DHW enable demand-based temperature controls to reduce unnecessary heat losses and poor efficiencies. This research uses reinforcement learning (RL), a machine learning technique, to develop new control strategies for CHDCs with underfloor heating and DHW storage tanks. The agent controls the supply temperature and the mass flow in the hybrid boiler room. Whether the RL agent is able to find the optimal control strategy depends on the definition of its Markov Decision Process (MDP) model elements, namely the states, the possible control actions and the reward function. The results show that an increasing gamma and decreasing learning rate during training leads to better performance and that the agent with the largest flexibility develops a better control strategy that resulted in up to 23% primary energy savings.
dc.description.wosFundingTextThis research was funded by two instances. Firstly, by the IOF-SBO project Smart Thermal Grids of the University of Antwerp, Belgium. Secondly, by FWO with grant number 1S08622N from the panel SBWT7B.
dc.identifier.doi10.1109/IECON49645.2022.9968547
dc.identifier.issn1553-572X
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/58892
dc.language.isoeng
dc.provenance.editstepusergreet.vanhoof@imec.be
dc.publisherIEEE
dc.source.conferenceIECON 2022 – 48th Annual Conference of the IEEE Industrial Electronics Society
dc.source.conferencedate2022-10-17
dc.source.conferencelocationBrussels
dc.source.journalIECON 2022 - 48TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY
dc.source.numberofpages6
dc.title

Reinforcement learning based mass flow and supply temperature control for combined heat distribution

dc.typeProceedings paper
dspace.entity.typePublication
imec.internal.crawledAt2025-10-22
imec.internal.sourcecrawler
Files
Publication available in collections: