Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Text mining in full text articles- methodical and representation issues

3rd International Biocuration Conference, Berlin, 16.-19.4. 2009
 
: Klinger, R.; Pesch, R.; Mevissen, T.; Fluck, J.

:
Fulltext urn:nbn:de:0011-n-936860 (1.4 MByte PDF)
MD5 Fingerprint: eb74f44a09962834761d9b35f3cddce4
Created on: 26.8.2009


Nature precedings. Online preprint server (2009), 22. April, 1 pp.
http://precedings.nature.com/
International Biocuration Conference <3, 2009, Berlin>
English
Poster, Electronic Publication
Fraunhofer SCAI ()
visualization; PDF; text mining; full text; text parsing; HTML; journals; parsing; ProMiner; publishing; biodatabase; biocurator

Abstract
In many cases, information from abstracts of biomedical publications is not sufficient for annotation of database entries. Therefore, text mining systems supporting curators of biodatabases should be able to process full text articles. Beside the technical problems arising from full text parsing, the representation of the annotated full text is an important issue. Journal articles are mostly electronically available in PDF or HTML format. Also with more easily manageable XML formats, readers would like to have a visualisation of annotations and semantic enrichment directly in the PDF or HTML. We summarize the technical problems arising from parsing of HTML and PDF journal full texts and show first results of visualisation in both formats.

: http://publica.fraunhofer.de/documents/N-93686.html