Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Entity and relation extraction in texts with semi-supervised extensions

: Paaß, G.; Kindermann, J.

Gal, C.S.:
Security informatics and terrorism: patrolling the Web : Social and technical problems of detecting and controlling terrorists' use of the World Wide Web ; proceedings of the NATO Advanced Research Workshop on Security Informatics and Terrorism - Patrolling the Web, Beer-Sheva, Israel, 4-5 June 2007
Amsterdam: IOS Press, 2008 (NATO science for peace and security series 15)
ISBN: 978-1-58603-848-9
ISBN: 1-58603-848-6
Advanced Research Workshop on Security Informatics and Terrorism - Patrolling the Web <2008, Beer-Sheva>
Fraunhofer IAIS ()
named entity recognition; relation extraction; semi-supervised learning; conditional random fields

In the last few years the Internet has become a prominent vehicle for communication with the side effect that digital media also has become more relevant for criminal and terrorist activities. This necessitates the surveillance of these activities on the Internet. A simple way to monitor content is the spotting of suspicious words and phrases in texts. Yet one of the problems with simply looking for words is the ambiguity of words, whose meaning often depends on context. Information extraction aims at recovering the meaning of words and phrases from the neighboring words. We give an overview of term and relation extraction methods based on pattern matching and trainable statistical methods and report on experiments of semi-supervised training of such methods.