Options
2011
Conference Paper
Title
Multilingual content extraction extended with background knowledge for military intelligence
Abstract
Written information for military purposes is available in abundance. Documents are written in many languages. The question is how we can automate the content extraction of these documents. One possible approach is based on shallow parsing (information extraction) with application specific combination of analysis results. The ZENON research system is an example, it does a partial content analysis of some English, Dari and Tajik texts. Another principal approach for content extraction is based on a combination of deep and shallow parsing with logical inferences on the analysis results. In the project "Multilingual content analysis with semantic inference on military relevant texts" (mIE) we followed the second approach. In this paper we present the results of the mIE project. First, we briey contrast the ZENON project to the mIE project. In the main part of the paper, the mIE project is presented. After explaining the combined deep and shallow parsing approach with Head-driven Phrase Structured Grammars, the inference process is introduced. Then, we show how background knowledge is integrated into the logical inferences to increase the extent, quality and accuracy of the content extraction. The prototype is also presented.