Options
November 3, 2023
Doctoral Thesis
Title
Investigating Graph Representation Learning Methods for Link Prediction in Knowledge Graphs
Abstract
Knowledge graphs (KGs) have become a fundamental approach to represent structured data and are employed in academic and industrial applications. KGs are used in various machine learning applications, such as question answering, dialogue systems, and recommendation systems. Although realworld KGs contain up to billions of links, they are usually still incomplete, which can severely impact downstream applications. Link prediction in KGs is the task of predicting missing links and can be performed in a transductive or inductive setting. In the past, a wide range of link prediction approaches have been proposed, encompassing rule-based and machine learning-based approaches. One promising line of research has been link prediction based on graph representation learning methods. In particular, a large number of knowledge graph embedding models (KGEMs) have been proposed and recently, also graph neural network (GNN) based approaches are used for link prediction within KGs. Despite the intensive research efforts in KGEMs, their capabilities are often not transparent. It has been shown that baseline models can obtain competitive results to the stateof-the-art models when configured appropriately, indicating that the performance of a KGEM may not merely depend on its model architecture, but on the interplay of various components. Link prediction within KGs has been investigated mainly within the transductive setting, prohibiting inference over unseen entities. However, lately, inductive link prediction approaches have obtained increased attention since they are capable of predicting links involving unseen entities. In this thesis, we propose an extensive ecosystem for investigating the performance of KGEM-based link prediction. We used the developed ecosystem to first perform a reproducibility study in which we investigated the reproducibility crisis of KGEM-based link prediction experiments. Second, we performed the most extensive KGEM-based link prediction study in which we investigated whether incremental performance improvements reported for KGEMs can solely be attributed to the model architectures or the combination of the KGEM’s components. After providing an in-depth analysis of transductive link prediction within triple-based KGs, we focus on inductive link prediction within hyper-relational KGs. We bridge the concepts of inductive link prediction and hyper-relational KGs and demonstrate that hyper-relational information improves semi- and fully-inductive link prediction. Finally, we demonstrate the effectiveness of knowledge graph representation learning for addressing biomedical applications.
Thesis Note
Bonn, Univ., Diss., 2023
Rights
Under Copyright
Language
English