Publication:

Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0009-0007-7993-6686
cris.virtual.orcid0000-0001-5817-7886
cris.virtual.orcid0000-0002-1428-0301
cris.virtual.orcid0000-0003-4408-6523
cris.virtualsource.departmentcd809ef4-6c63-4775-8d82-298a275c14a9
cris.virtualsource.departmentc914e7c0-7efb-4c2b-87b4-ae881ddf37db
cris.virtualsource.department891de1ef-83e1-4ca0-ae39-c3daab198fe5
cris.virtualsource.department48554e7b-ff43-44b9-9f84-0dcbd96416d7
cris.virtualsource.orcidcd809ef4-6c63-4775-8d82-298a275c14a9
cris.virtualsource.orcidc914e7c0-7efb-4c2b-87b4-ae881ddf37db
cris.virtualsource.orcid891de1ef-83e1-4ca0-ae39-c3daab198fe5
cris.virtualsource.orcid48554e7b-ff43-44b9-9f84-0dcbd96416d7
dc.contributor.authorChakraborty, Abhinaba
dc.contributor.authorTavernier, Wouter
dc.contributor.authorKourtis, Akis
dc.contributor.authorPickavet, Mario
dc.contributor.authorOikonomakis, Andreas
dc.contributor.authorColle, Didier
dc.date.accessioned2026-03-30T08:18:31Z
dc.date.available2026-03-30T08:18:31Z
dc.date.createdwos2025-09-26
dc.date.issued2025
dc.description.abstractThe necessity of processing real-time data at the network edge is growing. Low-power AI accelerators, especially edge GPUs, help meet this demand by mitigating cloud-related latency and bandwidth issues. However, GPUs remain underutilised, even in heavy workloads, due to a limited understanding of resource sharing in edge computing. This work analyses key GPU metrics: utilisation, memory, streaming multiprocessors (SMs), and tensorcores on NVIDIA Jetson devices under concurrent vision-inference workloads. Our findings show that while GPU utilisation can reach 100 % with optimisations, SMs and tensor cores often run at only 15-30 % capacity.
dc.description.wosFundingTextThe research work presented in this article has been supported by the European Commission under the Horizon Europe Programme and the OASEES project (no. 101092702).
dc.identifier.doi10.1109/ISPASS64960.2025.00043
dc.identifier.isbn979-8-3315-0295-9
dc.identifier.issn2994-9513
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/58954
dc.language.isoeng
dc.provenance.editstepusergreet.vanhoof@imec.be
dc.publisherIEEE COMPUTER SOC
dc.source.beginpage359
dc.source.conferenceIEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
dc.source.conferencedate2025-05-11
dc.source.conferencelocationGent
dc.source.endpage361
dc.source.journal2025 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, ISPASS
dc.source.numberofpages3
dc.title

Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson

dc.typeProceedings paper
dspace.entity.typePublication
imec.internal.crawledAt2025-10-22
imec.internal.sourcecrawler
Files

Original bundle

Name:
8891.pdf
Size:
4.56 MB
Format:
Adobe Portable Document Format
Description:
Published
Name:
8891_acc.pdf
Size:
4.5 MB
Format:
Adobe Portable Document Format
Description:
Published
Publication available in collections: