Publication:

smartSim: simulation of splice aware single cell smart-seq3 data

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0002-2169-4588
cris.virtual.orcid0000-0002-2397-8269
cris.virtualsource.department9b4166a7-acf3-4fec-b419-a686774bae40
cris.virtualsource.department85c695be-df5e-4b5a-896a-a6d76cdde981
cris.virtualsource.orcid9b4166a7-acf3-4fec-b419-a686774bae40
cris.virtualsource.orcid85c695be-df5e-4b5a-896a-a6d76cdde981
dc.contributor.authorVan Hecke, Marie
dc.contributor.authorMarchal, Kathleen
dc.contributor.imecauthorVan Hecke, Marie
dc.contributor.imecauthorMarchal, Kathleen
dc.contributor.orcidimecVan Hecke, Marie::0000-0002-2397-8269
dc.contributor.orcidimecMarchal, Kathleen::0000-0002-2169-4588
dc.date.accessioned2025-08-29T03:57:13Z
dc.date.available2025-08-29T03:57:13Z
dc.date.issued2025
dc.description.abstractMotivation Smart-seq3 is a powerful full-length single-cell RNA sequencing protocol that enables transcript-level quantification and splicing analysis by preserving unique molecular identifier (UMI) information. However, benchmarking computational tools for isoform reconstruction and splicing quantification remains challenging due to the lack of ground truth datasets. Herein, we present smartSim, a Smart-seq3 read simulator designed to generate realistic sequencing data that accurately reflects the complexities of single-cell transcriptomics. Results smartSim simulates known and novel splicing events, generates both UMI-containing and internal reads, and mimics protocol-specific biases by leveraging empirical data distributions. Our results show that smartSim-generated data closely resembles real Smart-seq3 datasets in terms of fragment length distributions, internal read counts, and read quality scores. It generates raw sequencing reads in FASTQ format, making it compatible with both genome- and transcriptome-based alignment tools. By extending simulation beyond gene-level quantification, smartSim provides a crucial resource for evaluating and improving computational methods for alternative splicing detection and isoform reconstruction in single-cell RNA sequencing. Availability and implementation smartSim is available at https://github.com/MarchalLab/smartSim
dc.description.wosFundingTextThis work was supported by grants of Fonds Wetenschappelijk Onderzoek-Vlaanderen (FWO) (3G045620, 3G046318), UGent BOF (BOF 01J06219, BOF/ IOP/2022/045BOF) and SBO-FWO (S004824N).
dc.identifier.doi10.1093/bioadv/vbaf183
dc.identifier.issn2635-0041
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/46132
dc.publisherOXFORD UNIV PRESS
dc.source.beginpagevbaf183
dc.source.issue1
dc.source.journalBIOINFORMATICS ADVANCES
dc.source.numberofpages7
dc.source.volume5
dc.title

smartSim: simulation of splice aware single cell smart-seq3 data

dc.typeJournal article
dspace.entity.typePublication
Files

Original bundle

Name:
vbaf183.pdf
Size:
1.15 MB
Format:
Adobe Portable Document Format
Description:
Published
Publication available in collections: