• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. A Hypothesis-Driven Framework for the Analysis of Self-Rationalising Models
 
  • Details
  • Full
Options
2024
Conference Paper
Title

A Hypothesis-Driven Framework for the Analysis of Self-Rationalising Models

Abstract
The self-rationalising capabilities of LLMs are appealing because the generated explanations can give insights into the plausibility of the predictions. However, how faithful the explanations are to the predictions is questionable, raising the need to explore the patterns behind them further. To this end, we propose a hypothesis-driven statistical framework. We use a Bayesian network to implement a hypothesis about how a task (in our example, natural language inference) is solved, and its internal states are translated into natural language with templates. Those explanations are then compared to LLM-generated free-text explanations using automatic and human evaluations. This allows us to judge how similar the LLM’s and the Bayesian network’s decision processes are. We demonstrate the usage of our framework with an example hypothesis and two realisations in Bayesian networks. The resulting models do not exhibit a strong similarity to GPT-3.5. We discuss the implications of this as well as the framework’s potential to approximate LLM decisions better in future work.
Author(s)
Braun, Marc
Fraunhofer-Institut für Produktionstechnik und Automatisierung IPA  
Kunz, Jenny
Mainwork
EACL 2024, 18th Conference of the European Chapter of the Association for Computational Linguistics. Proceedings of the Student Research Workshop  
Conference
Association for Computational Linguistics, European Chapter (EACL Conference) 2024  
Student Research Workshop 2024  
Link
Link
Language
English
Fraunhofer-Institut für Produktionstechnik und Automatisierung IPA  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024