Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Chemical structure reconstruction with chemoCR

 
: Zimmermann, M.

Voorhees, E.M. ; National Institute of Standards and Technology -NIST-:
The Twentieth Text REtrieval Conference, TREC 2011. Proceedings : November 2011, Gaithersburg, Maryland, USA
Gaithersburg, MD: NIST, 2011 (NIST special publication 500-295)
Text REtrieval Conference (TREC) <20, 2011, Gaithersburg/Md.>
Englisch
Konferenzbeitrag
Fraunhofer SCAI ()

Abstract
chemoCR makes chemical information contained in depictions of chemical structures accessible as connection table for computer programs. In order to solve the problem of recognizing and translating chemical structures in image documents, our chemoCR system combines pattern recognition techniques with supervised machine learning concepts. The method is based on the idea of identifying from structural formulas the most significant semantic entities. Semantic entities are for example chiral bonds, superatoms and reaction arrows. The workflow consists of three phases: image preprocessing, semantic entity recognition, and molecule reconstruction plus validation of the result. All steps of the process make use of chemical knowledge in order to detect and fix errors. The system can be trained and adapted to different sources of input images. The reconstructed connection table can be used by all chemical software.

: http://publica.fraunhofer.de/dokumente/N-254001.html