Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson

Chakraborty, Abhinaba; Tavernier, Wouter; Kourtis, Akis; Pickavet, Mario; Oikonomakis, Andreas; Colle, Didier

doi:10.1109/ISPASS64960.2025.00043

Simple item page Full metadata Statistics

dc.contributor.author	Chakraborty, Abhinaba
dc.contributor.author	Tavernier, Wouter
dc.contributor.author	Kourtis, Akis
dc.contributor.author	Pickavet, Mario
dc.contributor.author	Oikonomakis, Andreas
dc.contributor.author	Colle, Didier
dc.date.accessioned	2026-03-30T08:18:31Z
dc.date.available	2026-03-30T08:18:31Z
dc.date.createdwos	2025-09-26
dc.date.issued	2025
dc.description.abstract	The necessity of processing real-time data at the network edge is growing. Low-power AI accelerators, especially edge GPUs, help meet this demand by mitigating cloud-related latency and bandwidth issues. However, GPUs remain underutilised, even in heavy workloads, due to a limited understanding of resource sharing in edge computing. This work analyses key GPU metrics: utilisation, memory, streaming multiprocessors (SMs), and tensorcores on NVIDIA Jetson devices under concurrent vision-inference workloads. Our findings show that while GPU utilisation can reach 100 % with optimisations, SMs and tensor cores often run at only 15-30 % capacity.
dc.description.wosFundingText	The research work presented in this article has been supported by the European Commission under the Horizon Europe Programme and the OASEES project (no. 101092702).
dc.identifier.doi	10.1109/ISPASS64960.2025.00043
dc.identifier.isbn	979-8-3315-0295-9
dc.identifier.issn	2994-9513
dc.identifier.uri	https://imec-publications.be/handle/20.500.12860/58954
dc.language.iso	eng
dc.provenance.editstepuser	greet.vanhoof@imec.be
dc.publisher	IEEE COMPUTER SOC
dc.source.beginpage	359
dc.source.conference	IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
dc.source.conferencedate	2025-05-11
dc.source.conferencelocation	Gent
dc.source.endpage	361
dc.source.journal	2025 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, ISPASS
dc.source.numberofpages	3
dc.title	Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson
dc.type	Proceedings paper
dspace.entity.type	Publication
imec.internal.crawledAt	2025-10-22
imec.internal.source	crawler
Files	Original bundle Name: 8891.pdf Size: 4.56 MB Format: Adobe Portable Document Format Description: Published Download Name: 8891_acc.pdf Size: 4.5 MB Format: Adobe Portable Document Format Description: Published Download
Publication available in collections:	Conference contributions

Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson

Date