Sudhi, VijuVijuSudhiBhat, Sinchana RamakanthSinchana RamakanthBhatRudat, MaxMaxRudatTeucher, RomanRomanTeucher2024-08-282024-08-282024-07-10https://publica.fraunhofer.de/handle/publica/47418110.1145/3626772.36576602-s2.0-85200595421Owing to their size and complexity, large language models (LLMs) hardly explain why they generate a response. This effectively reduces the trust and confidence of end users in LLM-based applications, including Retrieval Augmented Generation (RAG) for Question Answering (QA) tasks. In this work, we introduce RAG-Ex, a model- and language-agnostic explanation framework that presents approximate explanations to the users revealing why the LLMs possibly generated a piece of text as a response, given the user input. Our framework is compatible with both open-source and proprietary LLMs. We report the significance scores of the approximated explanations from our generic explainer in both English and German QA tasks and also study their correlation with the downstream performance of LLMs. In the extensive user studies, our explainer yields an F1-score of 76.9% against the end user annotations and attains almost on-par performance with model-intrinsic approaches.enexplainabilitylarge language modelsretrieval augmented generationInformation retrievalEnd-usersGeneric frameworksLanguage modelModel-based OPCQuestion Answering TaskRetrieval augmented generationUser inputComputational linguisticsRAG-Ex: A Generic Framework for Explaining Retrieval Augmented Generationconference paper