ELOQUENT Lab at CLEF 2026: Evaluation of Generative Language Model Quality

Karlgren, Jussi; Barrett, Maria; Bojar, Ondřej; Engels, Marie Isabel; Fabre, Diandra; Goeuriot, Lorraine; Mothe, Josiane; Mulhem, Philippe; Piacentini, Mario; Madriz, Luis Francisco Vargas; Schwab, Didier; Šindelář, Pavel; Stampoulidis, Georgios; Thomas, Katherina; Vartampetian, Markarit

doi:10.1007/978-3-032-21321-1_36

2026

Conference Paper

Abstract

The ELOQUENT lab for evaluation of generative language model quality and usefulness addresses high-level quality criteria for generative language models through a set of open-ended shared tasks implemented, where possible, to minimise human effort in assessment, and with an objective to study how much the languages that the foundation model has been trained on make a difference in its responses. In this third ELOQUENT edition, the three planned tasks investigate how human-like text generated by language models can be (the Voight-Kampff task), how reliably a language model handles varied but equivalent input across languages (the Robustness and Consistency task), and if a generative language model can be used productively to generate and score topical quizzes without diverging into general knowledge acquired in foundational training (the PISA task). All tasks are continued evolved versions of previous editions’ tasks.

Author(s)

Karlgren, Jussi

Silo AI

Barrett, Maria

Silo AI

Bojar, Ondřej

Charles University

Engels, Marie Isabel

Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Fabre, Diandra

Université Grenoble Alpes

Goeuriot, Lorraine

Université Grenoble Alpes

Mothe, Josiane

Université de Toulouse

Mulhem, Philippe

Université Grenoble Alpes

Piacentini, Mario

L'Organisation de Coopération et de Développement Economiques

Madriz, Luis Francisco Vargas

L'Organisation de Coopération et de Développement Economiques

Schwab, Didier

Université Grenoble Alpes

Šindelář, Pavel

Charles University

Stampoulidis, Georgios

Silo AI

Thomas, Katherina

L'Organisation de Coopération et de Développement Economiques

Vartampetian, Markarit

Université Grenoble Alpes

Mainwork

Advances in Information Retrieval. 48th European Conference on Information Retrieval, ECIR 2026. Proceedings. Part IV

Conference

European Conference on Information Retrieval 2026

Options

ELOQUENT Lab at CLEF 2026: Evaluation of Generative Language Model Quality