Auto Encoding Explanatory Examples with Stochastic Paths

Ojeda, César; Sánchez, Ramsés J.; Cvejoski, Kostadin; Schücker, Jannis; Bauckhage, Christian; Georgiev, Bogdan

doi:10.1109/ICPR48806.2021.9413267

May 5, 2021

Conference Paper

Abstract

In this paper we ask for the main factors that determine a classifiers decision making process and uncover such factors by studying latent codes produced by auto-encoding frameworks. To deliver an explanation of a classifiers behaviour, we propose a method that provides series of examples highlighting semantic differences between the classifiers decisions. These examples are generated through interpolations in latent space. We introduce and formalize the notion of a semantic stochastic path, as a suitable stochastic process defined in feature (data) space via latent code interpolations. We then introduce the concept of semantic Lagrangians as a way to incorporate the desired classifiers behaviour and find that the solution of the associated variational problem allows for highli ghting differences in the classifier decision. Very importantly, within our framework the classifier is used as a black-box, and only its evaluation is required.