Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Using commonsense knowledge to automatically create (noisy) training examples from text

 
: Natarajan, S.; Picado, J.; Khot, T.; Kersting, K.; Re, C.; Shavlik, J.

Association for the Advancement of Artificial Intelligence -AAAI-:
Statistical relational artificial intelligence. Papers from the 2013 AAAI workshop : Collocated with AAAI-13 and held Monday, July 15, 2013 in Bellevue, Washington, USA
AI Access Foundation, 2013 (Technical Report AAAI-WS-13-16)
ISBN: 978-1-57735-627-1
S.31-36
Workshop Statistical Relational Artificial Intelligence <2013, Bellevue/Wash.>
Conference on Artificial Intelligence (AAAI) <27, 2013, Bellevue/Wash.>
Englisch
Konferenzbeitrag
Fraunhofer IAIS ()

Abstract
One of the challenges to information extraction is the requirement of human annotated examples. Current successful approaches alleviate this problem by employing some form of distant supervision i.e., look into knowledge bases such as Freebase as a source of supervision to create more examples. While this is perfectly reasonable, most distant supervision methods rely on a hand coded background knowledge that explicitly looks for patterns in text. In this work, we take a different approach - we create weakly supervised examples for relations by using commonsense knowledge. The key innovation is that this commonsense knowledge is completely independent of the natural language text. This helps when learning the full model for information extraction as against simply learning the parameters of a known CRF or MLN. We demonstrate on two domains that this form of weak supervision yields superior results when learning structure compared to simply using the gold standard labels. Copyright

: http://publica.fraunhofer.de/dokumente/N-350637.html