Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Why reinvent the wheel. Let's build question answering systems together

: Singh, Kuldeep; Radhakrishna, Arun Sethupat; Both, Andreas; Shekarpour, Saeedeh; Lytra, Ioanna; Usbeck, Ricardo; Vyas, Akhilesh; Khikmatullaev, Akmal; Punjani, Dharmen; Lange, Christoph; Vidal, Maria-Esther; Lehmann, Jens; Auer, Sören

Volltext urn:nbn:de:0011-n-4999897 (997 KByte PDF)
MD5 Fingerprint: 3cdad21d419240910b3b9855cf33eab7
(CC) by
Erstellt am: 10.7.2018

Association for Computing Machinery -ACM-:
WWW 2018, World Wide Web Conference. Proceedings : Lyon, France, April 23 - 27, 2018
New York: ACM, 2018
ISBN: 978-1-4503-5639-8
World Wide Web Conference (WWW) <2018, Lyon>
European Commission EC
H2020; 642795; WDAqua
Answering Questions using Web Data
European Commission EC
H2020; 644564; BigDataEurope
Integrating Big Data, Software and Communities for Addressing Europe’s Societal Challenges
Konferenzbeitrag, Elektronische Publikation
Fraunhofer IAIS ()

Modern question answering (QA) systems need to flexibly integrate a number of components specialised to fulfil specific tasks in a QA pipeline. Key QA tasks include Named Entity Recognition and Disambiguation, Relation Extraction, and Query Building. Since a number of different software components exist that implement different strategies for each of these tasks, it is a major challenge to select and combine the most suitable components into a QA system, given the characteristics of a question. We study this optimisation problem and train classifiers, which take features of a question as input and have the goal of optimising the selection of QA components based on those features. We then devise a greedy algorithm to identify the pipelines that include the suitable components and can effectively answer the given question. We implement this model within Frankenstein, a QA framework able to select QA components and compose QA pipelines. We evaluate the effectiveness of the pipelines generated by Frankenstein using the QALD and LC-QuAD benchmarks. These results not only suggest that Frankenstein precisely solves the QA optimisation problem but also enables the automatic composition of optimised QA pipelines, which outperform the static Baseline QA pipeline. Thanks to this flexible and fully automated pipeline generation process, new QA components can be easily included in Frankenstein, thus improving the performance of the generated pipelines.