Publication:

SideDRAM: Integrating SoftSIMD Datapaths near DRAM Banks for Energy-Efficient Variable Precision Computation

Date

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0002-1087-3433
cris.virtual.orcid0000-0002-3599-8515
cris.virtualsource.department92510db1-91b0-4865-a06f-c3b655429966
cris.virtualsource.department7a992f6f-feea-493d-b4d8-c297450cff52
cris.virtualsource.orcid92510db1-91b0-4865-a06f-c3b655429966
cris.virtualsource.orcid7a992f6f-feea-493d-b4d8-c297450cff52
dc.contributor.authorMorillas, Rafael Medina
dc.contributor.authorYu, Pengbo
dc.contributor.authorLevisse, Alexandre
dc.contributor.authorBiswas, Dwaipayan
dc.contributor.authorZapater, Marina
dc.contributor.authorAnsaloni, Giovanni
dc.contributor.authorCatthoor, Francky
dc.contributor.authorAtienza, David
dc.date.accessioned2026-02-02T12:40:13Z
dc.date.available2026-02-02T12:40:13Z
dc.date.createdwos2025-11-09
dc.date.issued2025
dc.description.abstractBy interfacing computing logic directly to the DRAM banks, bank-level Compute-near-Memory (CnM) architectures promise to mitigate the bottleneck at the memory interconnect. While this computation paradigm heavily reduces the energy requirements for data movement across the system, current solutions fail to co-optimize hardware and software to further increase efficiency. Instead, in this manuscript, we present SideDRAM, a co-designed bank-level CnM architecture to enable massively parallel and energy-efficient computations near DRAM. In contrast with past solutions, we support flexible data typing and heterogeneous quantization, relying on the robustness of workloads to employ small bitwidths, and enable a row-wide access to the banks to exploit parallelism and spatial locality. As a result, SideDRAM integrates (1) software-defined SIMD (SoftSIMD) datapaths, supporting low-energy computing with flexible precision, (2) an interface to the banks based on very wide registers (VWRs), enabling asymmetric data access to both utilize the full DRAM bank bandwidth and leverage data locality at the datapath, and (3) a low-overhead distributed control plane, allowing the efficient handling of variable data typing. We benchmark SideDRAM as a near-DRAM solution by analyzing the area, performance, and energy consumption of an HBM2 CnM channel executing heterogeneously quantized machine learning models. The results show that, compared to the state-of-the-art FIMDRAM design, energy improvements of up to 67% are achieved when a DeiT-S inference is executed with a batch size of 16 under the same area constraints, resulting in energy-delay-area product (EDAP) savings that reach 83%. When comparing to a massively parallel mixed-signal CnM solution, SideDRAM consistently obtains similar performance and better energy efficiency results (geomean of 15× improvement across workloads) at a lower area overhead.
dc.description.wosFundingTextThis work has been partially supported by a joint research grant for ESL-EPFL by IMEC, by the Swiss NSF Edge-Companions project (GA No. 10002812), by the EC H2020 FVLLMONTI project (GA No. 101016776) and by the Swiss State Secretariat for Education, Research, and Innovation (SERI) through the SwissChips research project. This research was partially conducted by ACCESS-AI Chip Center for Emerging Smart Systems, supported by the InnoHK initiative of the Innovation and Technology Commission of the Hong Kong Special Administrative Region Government.
dc.identifier.doi10.1145/3762641
dc.identifier.issn1539-9087
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/58760
dc.language.isoeng
dc.provenance.editstepusergreet.vanhoof@imec.be
dc.publisherASSOC COMPUTING MACHINERY
dc.source.beginpage111
dc.source.issue5
dc.source.journalACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS
dc.source.numberofpages24
dc.source.volume24
dc.subject.keywordsACCELERATOR
dc.subject.keywordsORGANIZATION
dc.title

SideDRAM: Integrating SoftSIMD Datapaths near DRAM Banks for Energy-Efficient Variable Precision Computation

dc.typeJournal article
dspace.entity.typePublication
imec.internal.crawledAt2025-11-20
imec.internal.sourcecrawler
Files
Publication available in collections: