Options
2014
Book Article
Title
Linguistics to structure unstructured information
Abstract
The extraction of semantics of unstructured documents requires the recognition and classification of textual patterns, their variability, and their inter-relationships, i.e., the analysis of the linguistic structure of documents. Being the integral part of a larger real-life application, this linguistic analysis process must be robust, fast and adaptable. This creates a big challenge for the development of the necessary linguistic base components. In this drill-down, we present several dimensions of this challenge and show how they have been successfully tackled in Ordo.