Publication:

Incremental Knowledge Graph Construction from Heterogeneous Data Sources

 
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid0000-0002-6645-1264
cris.virtual.orcid0000-0001-6917-2167
cris.virtual.orcid0000-0003-0248-0987
cris.virtual.orcid0000-0002-7195-9935
cris.virtualsource.departmentdd01cd39-80eb-41ed-821e-b536a72044dd
cris.virtualsource.department4fb1d6d7-4811-4069-ab93-685349dd45fa
cris.virtualsource.department54a7efb9-d406-4578-a72e-be6a136a978a
cris.virtualsource.department7660937d-c1c1-46b1-94dc-cc564c399a77
cris.virtualsource.orciddd01cd39-80eb-41ed-821e-b536a72044dd
cris.virtualsource.orcid4fb1d6d7-4811-4069-ab93-685349dd45fa
cris.virtualsource.orcid54a7efb9-d406-4578-a72e-be6a136a978a
cris.virtualsource.orcid7660937d-c1c1-46b1-94dc-cc564c399a77
dc.contributor.authorVan Assche, Dylan
dc.contributor.authorRojas Melendez, Julian
dc.contributor.authorDe Meester, Ben
dc.contributor.authorColpaert, Pieter
dc.date.accessioned2026-04-02T08:45:12Z
dc.date.available2026-04-02T08:45:12Z
dc.date.issued2026
dc.description.abstractSharing datasets that change (through creates, updates, deletes) poses challenges to data consumers, including reconciling historical versioning and managing frequent changes. This is evident for Knowledge Graphs (KGs), materialized from such datasets, where synchronization happens through frequent regeneration. However, this is time-consuming, loses history, and wastes computing resources through redundant processing. We present a KG generation approach that efficiently handles evolving data sources with different change signaling strategies. We investigate change signaling strategies of real-world datasets, propose corresponding change detection algorithms, and introduce a declarative approach based on the RDF Mapping Language (RML) and Function Ontology to materialize changes for evolving KGs. Detected changes can be automatically published as a Linked Data Event Stream (LDES), using the Activity Streams 2.0 vocabulary to describe changes and communicate them over the Web. We implement our approach in the RMLMapper as Incremental RML and evaluate it both functionally, and quantitatively using a modified version of the GTFS Madrid Benchmark and several real-world data sources. Our approach reduces storage and computing requirements for generating and storing multiple KG versions (up to 315.83x less storage, 4.59x less CPU time, and 1.51x less memory) and reduces KG construction time up to 4.41x. Performance gains are more pronounced for larger datasets, while our approach's overhead partially offsets benefits for smaller ones. Overall, our approach lowers the cost of publishing and maintaining KGs and, via LDES, supports timely, Web-native dissemination of changes. We plan to optimize our change detection algorithms and use windowing to support streaming data.
dc.identifier.doi10.1177/22104968251412270
dc.identifier.issn1570-0844
dc.identifier.issn2210-4968
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/58997
dc.language.isoeng
dc.provenance.editstepusergreet.vanhoof@imec.be
dc.publisherSage
dc.source.beginpageN/A
dc.source.issue2
dc.source.journalSemantic Web: – Interoperability, Usability, Applicability
dc.source.numberofpages33
dc.source.volume17
dc.title

Incremental Knowledge Graph Construction from Heterogeneous Data Sources

dc.typeJournal article
dspace.entity.typePublication
imec.internal.crawledAt2026-03-02
imec.internal.sourcecrawler
Files

Original bundle

Name:
van-assche-et-al-2026-incremental-knowledge-graph-construction-from-heterogeneous-data-sources.pdf
Size:
5.05 MB
Format:
Adobe Portable Document Format
Description:
Published
Publication available in collections: