Algorithm Design and Optimization for Knowledge Fusion: A Graph Matching-based Approach

Chunguang Li
Haixia Shang
Xiaolei Wu
Yuanyuan Wu
Lei Liu

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Knowledge fusion plays a crucial role in the construction of knowledge graphs, aiming to integrate knowledge from different data sources and improve the accuracy and completeness of the knowledge graph. Knowledge graphs built from different data sources often suffer from issues such as inconsistency, redundancy, and missing information, which negatively impact the quality and application effectiveness of the knowledge graph. Therefore, how to effectively fuse knowledge from different data sources has become an important topic in knowledge graph research. This paper proposes a graph matching-based algorithm for knowledge fusion, aiming to merge knowledge graphs from different sources into a unified one. The algorithm achieves knowledge fusion by identifying and matching identical or similar nodes and edges in different knowledge graphs, involving several steps including node matching, edge matching, and conflict resolution. Node matching identifies identical or similar nodes by calculating the similarity of their attributes, while edge matching further matches the relationships between nodes based on node matching. Conflict resolution is handled using rules or statistical methods to address conflicting information from different data sources, ensuring consistency and accuracy in the fusion results. To improve the efficiency and effectiveness of the algorithm, this paper proposes a series of optimization methods: feature selection and weight allocation, iterative matching and fusion, as well as parallel computing. Feature selection and weight allocation optimize the similarity calculation process by selecting key features and assigning appropriate weights to improve the accuracy of node and edge matching. Iterative matching and fusion continuously optimize fusion results by gradually performing node and edge matching. Parallel computing utilizes parallel computing techniques to accelerate the fusion process of large-scale knowledge graphs, enhancing algorithm processing capabilities and efficiency. Experimental results demonstrate significant effectiveness of the proposed algorithm in knowledge graph fusion. By testing with multiple publicly available knowledge graph datasets, the algorithm shows significant improvements in node matching accuracy, edge matching accuracy, and overall fusion quality compared to traditional fusion methods. Specifically, the average node matching accuracy has increased by over 10%, edge matching accuracy has significantly improved, and the overall fusion quality is also superior to the comparative methods. The experiments validate that the graph-based knowledge fusion algorithm can effectively enhance the quality and accuracy of knowledge graphs, demonstrating wide application prospects and practical value.

Version published to 10.21203/rs.3.rs-4641408/v1 on Research Square
Jul 23, 2024

Design of Knowledge Service Model Combining Dynamic Knowledge Graph and Enterprise Risk Management based on Bidirectional Encoder Representation from Transformers Bidirectional Long Short- Term Memory

This article has 5 authors:
1. Yu Jia-yin
2. Jiang Jiang
3. Ya-dong Shi
4. Zi-zhen LI
5. Shen Wu
This article has no evaluationsLatest version Jun 17, 2025
Intelligent Question-Answering on Geomorphology Knowledge Based on Knowledge Graph Retrieval-Augmented Generation Technology

This article has 4 authors:
1. Xueying Zhang
2. Junxi Du
3. Bohang Guo
4. Maosen Xiang
This article has no evaluationsLatest version May 16, 2025
M2M: Subgraph Matching Based on Minimum Spanning Tree and Candidate Graph

This article has 5 authors:
1. Haitao Ma
2. Ming Xu
3. Yufan Chen
4. Yuhai Zhao
5. Changyong Yu
This article has no evaluationsLatest version Jun 18, 2025

Listed in

Abstract

Article activity feed

Related articles

Design of Knowledge Service Model Combining Dynamic Knowledge Graph and Enterprise Risk Management based on Bidirectional Encoder Representation from Transformers Bidirectional Long Short- Term Memory

Intelligent Question-Answering on Geomorphology Knowledge Based on Knowledge Graph Retrieval-Augmented Generation Technology

M2M: Subgraph Matching Based on Minimum Spanning Tree and Candidate Graph