Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Towards German Word Embeddings: A Use Case with Predictive Sentiment Analysis

: Brito, Eduardo; Sifa, Rafet; Cvejowski, Kostadin; Ojeda, César; Bauckhage, Christian


Haber, Peter; Lampoltshammer, Thomas; Mayr, Manfred:
Data Science - Analytics and Applications : Proceedings of the 1st International Data Science Conference iDSC2017
Berlin: Springer Vieweg, 2017
ISBN: 978-3-658-19286-0
ISBN: 978-3-658-19287-7
International Data Science Conference (iDSC) <1, 2017, Salzburg>
Conference Paper
Fraunhofer IAIS ()

Despite the research boom on words embeddings and their text mining applications from the last years, the vast majority of publications focus only on the English language. Furthermore, hyperparameter tuning is a rarely well documented process (specially for non English text) that is necessary to obtain high quality word representations. In this work, we present how different hyperparameter combinations impact the resulting German word vectors and how these word representations can be part of more complex models. In particular, we perform first an intrinsic evaluation of our German word embeddings, which are later used within a predictive sentiment analysis model. The latter does not only serve as an extrinsic evaluation of the German word embeddings but also shows the feasibility of predic ting preferences only from document embeddings.