• English
  • Deutsch
  • Log In
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Learning Protein Protein Interaction Extraction using Distant Supervision
 
  • Details
  • Full
Options
2011
  • Konferenzbeitrag

Titel

Learning Protein Protein Interaction Extraction using Distant Supervision

Alternative
Learning to Extract Protein-Protein Interactions using Distant Supervision
Abstract
Most relation extraction methods, especially in the domain of biology, rely on machine learning methods to classify a co-occurring pair of entities in a sentence to be related or not. Such an approach requires a training corpus, which involves expert annotation and is tedious, time-consuming, and expensive. We overcome this problem by the use of existing knowledge in structured databases to automatically generate a training corpus for protein-protein interactions. An extensive evaluation of different instance selection strategies is performed to maximize robustness on this presumably noisy resource. Successful strategies to consistently improve performance include a majority voting ensemble of classifiers trained on subsets of the training corpus and the use of knowledge bases consisting of proven non-interactions. Our best configured model built without manually annotated data shows very competitive results on several publicly available benchmark corpora.
Author(s)
Thomas, P.
Solt, I.
Klinger, R.
Leser, U.
Hauptwerk
Proceedings of Workshop on Robust Unsupervised and Semisupervised Methods in Natural Language Processing, ROBUS 2011
Konferenz
Workshop on Robust Unsupervised and Semisupervised Methods in Natural Language Processing (ROBUS) 2011
Thumbnail Image
Externer Link
Externer Link
Language
Englisch
google-scholar
SCAI
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022