
Publica
Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten. Ontology-based entity recognition and annotation
:
Volltext urn:nbn:de:0011-n-5750260 (589 KByte PDF) MD5 Fingerprint: f2f5cadff47741a62d978692bfa91d5e Erstellt am: 5.2.2020 |
| Paschke, Adrian (Ed.); Neudecker, Clemens (Ed.); Rehm, Georg (Ed.); Al Qundus, Jamal (Ed.); Pintscher, Lydia (Ed.) ; Fraunhofer-Institut für Offene Kommunikationssysteme -FOKUS-, Berlin: Qurator 2020. Conference on Digital Curation Technologies. Online resource : Proceedings of the Conference on Digital Curation Technologies (Qurator 2020). Berlin, Germany, January 20th to 21st, 2020 Berlin: CEUR, 2020 (CEUR Workshop Proceedings 2535) http://ceur-ws.org/Vol-2535 URN: urn:nbn:de:0074-2535-7 ISSN: 1613-0073 Paper 4, 11 S. |
| International Conference on Digital Curation Technologies (Qurator) <1, 2020, Berlin> |
| Bundesministerium für Bildung und Forschung BMBF (Deutschland) 03WKDA1F; Qurator Wachstumskern Qurator - Corporate Smart Insights |
|
| Englisch |
| Konferenzbeitrag, Elektronische Publikation |
| Fraunhofer FOKUS () |
| Ontology; Complex Entity Recognition; Text Annotation; DBpedia Spotlight; BioPortal; Annotator |
Abstract
The majority of transmitted information consists of written text, either printed or electronically. Extraction of this information from digital resources requires the identification of important entities. While Named Entity Recognition (NER) is an important task for the extraction of factual information and the construction of knowledge graphs, other information such as terminological concepts and relations between entities are of similar importance in the context of knowledge engineering, knowledge base enhancement and semantic search. While the majority of approaches focusses on NER recognition in the context of the World-Wide-Web and thus needs to cover the broad range of common knowledge, we focus in the present work on the recognition of entities in highly specialized domains and describe our approach to ontology-based entity recognition and annotation (OER). Our approach, implemented as a first prototype, outperforms existing approaches in precision of extracted entities, especially in the recognition of compound terms such as German Federal Ministry of Education and Research and inflected terms.