Publication:
Inductive models for structured output prediction of lncRNA-disease associations
| cris.virtual.department | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
| cris.virtual.department | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
| cris.virtual.orcid | 0000-0002-4884-9420 | |
| cris.virtual.orcid | 0000-0003-0983-256X | |
| cris.virtualsource.department | 1818b505-f3d3-4102-86bb-f0b7724e974b | |
| cris.virtualsource.department | fdd92a25-30fb-4753-9761-308aae317a1a | |
| cris.virtualsource.orcid | 1818b505-f3d3-4102-86bb-f0b7724e974b | |
| cris.virtualsource.orcid | fdd92a25-30fb-4753-9761-308aae317a1a | |
| dc.contributor.author | Nakano, Felipe Kenji | |
| dc.contributor.author | Bertoni, Livia | |
| dc.contributor.author | Cerri, Ricardo | |
| dc.contributor.author | Vens, Celine | |
| dc.date.accessioned | 2026-04-14T08:49:52Z | |
| dc.date.available | 2026-04-14T08:49:52Z | |
| dc.date.createdwos | 2026-03-18 | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Long non-coding RNAs have gained significant attention due to their crucial roles in the pathogenesis of complex human diseases, such as neurological diseases, cardiovascular diseases, AIDS, diabetes, and various types of cancer. In the machine learning literature, lncRNA-disease association (LDA) has been widely investigated as a binary classification problem, where each lncRNA-disease pair is seen as an independent instance. This approach presents drawbacks as it does not exploit the correlation among the diseases, aggravates the already imbalanced dataset, and substantially increases the execution time. Furthermore, the literature focuses on the transductive setting where new disease associations are predicted in lncRNAs already seen by the model, which naturally restricts its application to already seen lncRNAs. As a solution, we propose to address LDA prediction as a structured output prediction problem, namely (hierarchical) multi-label classification, where all LDAs are predicted at once for a given lncRNA. We compared several LDA methods and their structured output variants with recent (hierarchical) multi-label classification methods in an inductive setting, e.g., disease associations are predicted in unseen lncRNAs. Our experiments reveal that approaching LDA prediction with structured output prediction leads to superior or competitive results while drastically reducing the running time. | |
| dc.description.wosFundingText | This study was financed by the Sao Paulo Research Foundation (FAPESP) grants 2022/02981-8, 2022/14762-9, and 2024/05438-9, and Research Fund Flanders (FWO) mandate 1235924N. | |
| dc.identifier.doi | 10.1109/cibcb66090.2025.11177093 | |
| dc.identifier.issn | 2994-9351 | |
| dc.identifier.uri | https://imec-publications.be/handle/20.500.12860/59080 | |
| dc.language.iso | eng | |
| dc.provenance.editstepuser | greet.vanhoof@imec.be | |
| dc.publisher | IEEE | |
| dc.relation.ispartofseries | IEEE Symposium on Computational Intelligence and Bioinformatics and Computational Biology CIBCB | |
| dc.source.beginpage | 225 | |
| dc.source.conference | 2025 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB | |
| dc.source.conferencedate | 2025-08-20 | |
| dc.source.conferencelocation | Tainan, Taiwan | |
| dc.source.endpage | 232 | |
| dc.source.journal | 2025 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB | |
| dc.source.numberofpages | 8 | |
| dc.subject.keywords | LONG NONCODING RNAS | |
| dc.subject.keywords | EXPRESSION | |
| dc.subject.keywords | TREES | |
| dc.title | Inductive models for structured output prediction of lncRNA-disease associations | |
| dc.type | Proceedings paper | |
| dspace.entity.type | Publication | |
| imec.internal.crawledAt | 2026-04-07 | |
| imec.internal.source | crawler | |
| imec.internal.wosCreatedAt | 2026-04-07 | |
| Files | ||
| Publication available in collections: |