MaxSimE: Explaining Transformer-based Semantic Similarity via Contextualized Best Matching Token Pairs

CC BY 4.0Brito Chacon, Eduardo AlfredoEduardo AlfredoBrito ChaconIser, HenriHenriIser2024-02-092024-02-092023-07-18https://publica.fraunhofer.de/handle/publica/461966https://doi.org/10.24406/h-46196610.1145/3539618.359201710.24406/h-4619662-s2.0-85168669787Current semantic search approaches rely on black-box language models, such as BERT, which limit their interpretability and transparency. In this work, we propose MaxSimE, an explanation method for language models applied to measure semantic similarity. Our approach is inspired by the explainable-by-design ColBERT architecture and generates explanations by matching contextualized query tokens to the most similar tokens from the retrieved document according to the cosine similarity of their embeddings. Unlike existing post-hoc explanation methods, which may lack fidelity to the model and thus fail to provide trustworthy explanations in critical settings, we demonstrate that MaxSimE can generate faithful explanations under certain conditions and how it improves the interpretability of semantic search results on ranked documents from the LoTTe benchmark, showing its potential for trustworthy information retrieval.enad-hoc explanationsexplainable searchneural modelssemantic similaritytrustworthy information retrievalMaxSimE: Explaining Transformer-based Semantic Similarity via Contextualized Best Matching Token Pairsconference paper