• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. GiCCS: A German in-Context Conversational Similarity Benchmark
 
  • Details
  • Full
Options
December 2022
Conference Paper
Title

GiCCS: A German in-Context Conversational Similarity Benchmark

Abstract
The Semantic textual similarity (STS) task is commonly used to evaluate the semantic representations that language models (LMs) learn from texts, under the assumption that good-quality representations will yield accurate similarity estimates. When it comes to estimating the similarity of two utterances in a dialogue, however, the conversational context plays a particularly important role. We argue for the need of benchmarks specifically created using conversational data in order to evaluate conversational LMs in the STS task. We introduce GiCCS, a first conversational STS evaluation benchmark for German. We collected the similarity annotations for GiCCS using best-worst scaling and presenting the target items in context, in order to obtain highly-reliable context-dependent similarity scores. We present benchmarking experiments for evaluating LMs on capturing the similarity of utterances. Results suggest that pretraining LMs on conversational data and providing conversational context can be useful for capturing similarity of utterances in dialogues. GiCCS will be publicly available to encourage benchmarking of conversational LMs.
Author(s)
Asaadi, Shima
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Kolagar, Zahra  orcid-logo
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Liebel, Alina
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Zarcone, Alessandra
Hochschule Augsburg
Mainwork
2nd Workshop on Natural Language Generation, Evaluation, and Metrics, GEM 2022. Proceedings  
Project(s)
Konzept für eine KI-basierte Sprachassistenzplattform und -ökosystem made in Germany  
Funder
Bundesministerium für Wirtschaft und Klimaschutz  
Conference
Workshop on Natural Language Generation, Evaluation, and Metrics 2022  
Conference on Empirical Methods in Natural Language Processing 2022  
Link
Link
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • STS

  • Semantic textual similarity

  • context-dependent similarity scores

  • benchmarking of conversational LMs

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024