Publication:

HephaestusForge: Optimal microservice deployment across the Compute Continuum via Reinforcement Learning

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0003-4824-1199
cris.virtual.orcid0000-0002-6276-2057
cris.virtualsource.department505a9fa2-2261-4859-8c77-73c2ba21244c
cris.virtualsource.department5fc1041b-34c9-4bdb-ba41-1a986f0c4c25
cris.virtualsource.orcid505a9fa2-2261-4859-8c77-73c2ba21244c
cris.virtualsource.orcid5fc1041b-34c9-4bdb-ba41-1a986f0c4c25
dc.contributor.authorPereira dos Santos, José Pedro
dc.contributor.authorZaccarini, Mattia
dc.contributor.authorPoltronieri, Filippo
dc.contributor.authorTortonesi, Mauro
dc.contributor.authorStefanelli, Cesare
dc.contributor.authorDi Cicco, Nicola
dc.contributor.authorDe Turck, Filip
dc.contributor.imecauthorSantos, Jose
dc.contributor.imecauthorDe Turck, Filip
dc.contributor.orcidimecDe Turck, Filip::0000-0003-4824-1199
dc.date.accessioned2025-02-06T07:31:37Z
dc.date.available2025-02-05T20:50:25Z
dc.date.available2025-02-06T07:31:37Z
dc.date.issued2025
dc.description.abstractWith the advent of containerization technologies, microservices have revolutionized application deployment by converting old monolithic software into a group of loosely coupled containers, aiming to offer greater flexibility and improve operational efficiency. This transition made applications more complex, consisting of tens to hundreds of microservices. Designing effective orchestration mechanisms remains a crucial challenge, especially for emerging distributed cloud paradigms such as the Compute Continuum (CC). Orchestration across multiple clusters is still not extensively explored in the literature since most works consider single-cluster scenarios. In the CC scenario, the orchestrator must decide the optimal locations for each microservice, deciding whether instances are deployed altogether or placed across different clusters, significantly increasing orchestration complexity. This paper addresses orchestration in a containerized CC environment by studying a Reinforcement Learning (RL) approach for efficient microservice deployment in Kubernetes (K8s) clusters, a widely adopted container orchestration platform. This work demonstrates the effectiveness of RL in achieving near-optimal deployment schemes under dynamic conditions, where network latency and resource capacity fluctuate. We extensively evaluate a multi-objective reward function that aims to minimize overall latency, reduce deployment costs, and promote fair distribution of microservice instances, and we compare it against typical heuristic-based approaches. The results from an implemented OpenAI Gym framework, named as HephaestusForge, show that RL algorithms achieve minimal rejection rates (as low as 0.002%, 90x less than the baseline Karmada scheduler). Cost-aware strategies result in lower deployment costs (2.5 units), and latency-aware functions achieve lower latency (268–290 ms), improving by 1.5x and 1.3x, respectively, over the best-performing baselines. HephaestusForge is available in a public open-source repository, allowing researchers to validate their own placement algorithms. This study also highlights the adaptability of the DeepSets (DS) neural network in optimizing microservice placement across diverse multi-cluster setups without retraining. The DS neural network can handle inputs and outputs as arbitrarily sized sets, enabling the RL algorithm to learn a policy not bound to a fixed number of clusters.
dc.description.wosFundingTextThis work has been partially supported by the Spoke 1 "FutureHPC & BigData" of the Italian Research Center on High-Performance Computing, Big Data and Quantum Computing (ICSC) funded by MUR Missione 4-Next Generation EU (NGEU) . Jose Santos is funded by the Research Foundation Flanders (FWO) , Belgium, grant number 1299323N.
dc.identifier.doi10.1016/j.future.2024.107680
dc.identifier.issn0167-739X
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/45175
dc.publisherELSEVIER
dc.source.beginpage107680
dc.source.issueMay
dc.source.journalFUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE
dc.source.numberofpages16
dc.source.volume166
dc.subject.keywordsSERVICE FUNCTION CHAIN
dc.subject.keywordsCLOUD
dc.subject.keywordsORCHESTRATION
dc.title

HephaestusForge: Optimal microservice deployment across the Compute Continuum via Reinforcement Learning

dc.typeJournal article
dspace.entity.typePublication
Files

Original bundle

Name:
8739.pdf
Size:
2.72 MB
Format:
Adobe Portable Document Format
Description:
Published
Publication available in collections: