• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Increasing Interpretability in Outside Knowledge Visual Question Answering
 
  • Details
  • Full
Options
June 22, 2024
Conference Paper
Title

Increasing Interpretability in Outside Knowledge Visual Question Answering

Abstract
The field of Visual Question Answering (VQA) bridges the disciplines of vision- and language-based reasoning by combining scene understanding and the answering of arbitrary questions regarding a given image. The number of questions that can be answered is limited by the visual information given in an image, but it can be expanded by utilizing external knowledge from different sources. Recently, the Outside Knowledge Visual Question Answering (OK-VQA) task was introduced to facilitate research in this field. Several current state-of-the-art solutions incorporate Graph Neural Networks (GNNs) for this task. Like other Neural Network-based architectures, GNNs usually behave as black boxes. The interpretability of the reasoning behind predictions from GNNs is, however, a desirable property. Especially in the context of Knowledge Management within organizations, it can be important (and in some cases, is also required by law) to know how the reasoning behind decisions made by utilizing GNNs came to be. Nonetheless, increasing the interpretability can come at the cost of decreasing the overall performance of a model. The following investigation concludes that this does not have to be the case in every scenario by evaluating a GNN-based model developed for the OK-VQA task and a selection of proposed updates to this model, which are based on the attention mechanism. Furthermore, potential interpretation techniques are explored, which focus on considering the attention values.
Author(s)
Upravitelev, Max
Fraunhofer-Institut für Offene Kommunikationssysteme FOKUS  
Krauss, Christopher  orcid-logo
Fraunhofer-Institut für Offene Kommunikationssysteme FOKUS  
Kuhlmann, Isabelle
Mainwork
Knowledge Management in Organisations. 18th International Conference, KMO 2024. Proceedings  
Journal
Knowledge Management in Organisations
Communications in Computer and Information Science
Conference
International Conference on Knowledge Management in Organisations 2024  
DOI
10.1007/978-3-031-63269-3_24
Language
English
Fraunhofer-Institut für Offene Kommunikationssysteme FOKUS  
Keyword(s)
  • Graph Neural Networks

  • Interpretability

  • Visual Question Answering

  • Technology transfer

  • Technology transfer

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024