Supporting verification of news articles with automated search for semantically similar articles

Gupta, Vishwani; Beckh, Katharina; Giesselbach, Sven; Wegener, Dennis; Wirtz, Tim

2021

Conference Paper

Abstract

Fake information poses one of the major threats for society in the 21st century. Identifying misinformation has become a key challenge due to the amount of fake news that is published daily. Yet, no approach is established that addresses the dynamics and versatility of fake news editorials. Instead of classifying content, we propose an evidence retrieval approach to handle fake news. The learning task is formulated as an unsupervised machine learning problem. For validation purpose, we provide the user with a set of news articles from reliable news sources supporting the hypothesis of the news article in query and the final decision is left to the user. Technically we propose a two-step process: (i) Aggregation-step: With information extracted from the given text we query for similar content from reliable news sources. (ii) Refining-step: We narrow the supporting evidence down by measuring the semantic distance of the text with the collection from step (i). The distance is calculated based on Word2Vec and the Word Mover's Distance. In our experiments, only content that is below a certain distance threshold is considered as supporting evidence. We find that our approach is agnostic to concept drifts, i.e. the machine learning task is independent of the hypotheses in a text. This makes it highly adaptable in times where fake news is as diverse as classical news is. Our pipeline offers the possibility for further analysis in the future, such as investigating bias and differences in news reporting.

Author(s)

Gupta, Vishwani

Beckh, Katharina

Giesselbach, Sven

Wegener, Dennis

Wirtz, Tim

Hauptwerk

Workshop Reducing Online Misinformation Through Credible Information Retrieval, ROMCIR 2021. Online resource

Konferenz

Workshop Reducing Online Misinformation through Credible Information Retrieval (ROMCIR) 2021

European Conference on IR Research (ECIR) 2021

Options

Supporting verification of news articles with automated search for semantically similar articles