Publication:

Inductive models for structured output prediction of lncRNA-disease associations

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0002-4884-9420
cris.virtual.orcid0000-0003-0983-256X
cris.virtualsource.department1818b505-f3d3-4102-86bb-f0b7724e974b
cris.virtualsource.departmentfdd92a25-30fb-4753-9761-308aae317a1a
cris.virtualsource.orcid1818b505-f3d3-4102-86bb-f0b7724e974b
cris.virtualsource.orcidfdd92a25-30fb-4753-9761-308aae317a1a
dc.contributor.authorNakano, Felipe Kenji
dc.contributor.authorBertoni, Livia
dc.contributor.authorCerri, Ricardo
dc.contributor.authorVens, Celine
dc.date.accessioned2026-04-14T08:49:52Z
dc.date.available2026-04-14T08:49:52Z
dc.date.createdwos2026-03-18
dc.date.issued2025
dc.description.abstractLong non-coding RNAs have gained significant attention due to their crucial roles in the pathogenesis of complex human diseases, such as neurological diseases, cardiovascular diseases, AIDS, diabetes, and various types of cancer. In the machine learning literature, lncRNA-disease association (LDA) has been widely investigated as a binary classification problem, where each lncRNA-disease pair is seen as an independent instance. This approach presents drawbacks as it does not exploit the correlation among the diseases, aggravates the already imbalanced dataset, and substantially increases the execution time. Furthermore, the literature focuses on the transductive setting where new disease associations are predicted in lncRNAs already seen by the model, which naturally restricts its application to already seen lncRNAs. As a solution, we propose to address LDA prediction as a structured output prediction problem, namely (hierarchical) multi-label classification, where all LDAs are predicted at once for a given lncRNA. We compared several LDA methods and their structured output variants with recent (hierarchical) multi-label classification methods in an inductive setting, e.g., disease associations are predicted in unseen lncRNAs. Our experiments reveal that approaching LDA prediction with structured output prediction leads to superior or competitive results while drastically reducing the running time.
dc.description.wosFundingTextThis study was financed by the Sao Paulo Research Foundation (FAPESP) grants 2022/02981-8, 2022/14762-9, and 2024/05438-9, and Research Fund Flanders (FWO) mandate 1235924N.
dc.identifier.doi10.1109/cibcb66090.2025.11177093
dc.identifier.issn2994-9351
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/59080
dc.language.isoeng
dc.provenance.editstepusergreet.vanhoof@imec.be
dc.publisherIEEE
dc.relation.ispartofseriesIEEE Symposium on Computational Intelligence and Bioinformatics and Computational Biology CIBCB
dc.source.beginpage225
dc.source.conference2025 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB
dc.source.conferencedate2025-08-20
dc.source.conferencelocationTainan, Taiwan
dc.source.endpage232
dc.source.journal2025 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB
dc.source.numberofpages8
dc.subject.keywordsLONG NONCODING RNAS
dc.subject.keywordsEXPRESSION
dc.subject.keywordsTREES
dc.title

Inductive models for structured output prediction of lncRNA-disease associations

dc.typeProceedings paper
dspace.entity.typePublication
imec.internal.crawledAt2026-04-07
imec.internal.sourcecrawler
imec.internal.wosCreatedAt2026-04-07
Files
Publication available in collections: