Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Ontology-based entity recognition and annotation

: Hoppe, Thomas; Al Qundus, Jamal; Peikert, Silvio

Fulltext urn:nbn:de:0011-n-5750260 (589 KByte PDF)
MD5 Fingerprint: f2f5cadff47741a62d978692bfa91d5e
(CC) by
Created on: 5.2.2020

Paschke, Adrian (Ed.); Neudecker, Clemens (Ed.); Rehm, Georg (Ed.); Al Qundus, Jamal (Ed.); Pintscher, Lydia (Ed.) ; Fraunhofer-Institut für Offene Kommunikationssysteme -FOKUS-, Berlin:
Qurator 2020. Conference on Digital Curation Technologies. Online resource : Proceedings of the Conference on Digital Curation Technologies (Qurator 2020). Berlin, Germany, January 20th to 21st, 2020
Berlin: CEUR, 2020 (CEUR Workshop Proceedings 2535)
URN: urn:nbn:de:0074-2535-7
ISSN: 1613-0073
Paper 4, 11 pp.
International Conference on Digital Curation Technologies (Qurator) <1, 2020, Berlin>
Bundesministerium für Bildung und Forschung BMBF (Deutschland)
03WKDA1F; Qurator
Wachstumskern Qurator - Corporate Smart Insights
Conference Paper, Electronic Publication
Fraunhofer FOKUS ()
Ontology; Complex Entity Recognition; Text Annotation; DBpedia Spotlight; BioPortal; Annotator

The majority of transmitted information consists of written text, either printed or electronically. Extraction of this information from digital resources requires the identification of important entities. While Named Entity Recognition (NER) is an important task for the extraction of factual information and the construction of knowledge graphs, other information such as terminological concepts and relations between entities are of similar importance in the context of knowledge engineering, knowledge base enhancement and semantic search. While the majority of approaches focusses on NER recognition in the context of the World-Wide-Web and thus needs to cover the broad range of common knowledge, we focus in the present work on the recognition of entities in highly specialized domains and describe our approach to ontology-based entity recognition and annotation (OER). Our approach, implemented as a first prototype, outperforms existing approaches in precision of extracted entities, especially in the recognition of compound terms such as German Federal Ministry of Education and Research and inflected terms.