Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Streaming transformation of XML to RDF using XPath-based mappings

 
: Huang, J.-Y.; Lange, C.; Auer, S.

:

Association for Computing Machinery -ACM-:
11th International Conference on Semantic Systems, SEMANTiCS 2015. Proceedings : 16-17-September-2015, Vienna, Austria
New York: ACM, 2015
ISBN: 978-1-4503-3462-4
S.129-136
International Conference on Semantic Systems (SEMANTICS) <11, 2015, Vienna>
Englisch
Konferenzbeitrag
Fraunhofer IAIS ()

Abstract
The Extensible Markup Language (XML) has become a widely adopted data interchange format. With the rise of Linked Data published using the Resource Description Framework (RDF), a number of tools for transforming XML to RDF have been developed. Specifying XML-RDF mappings for these tools often requires skills in programming languages such as XSLT or XQuery. Moreover, these tools are rarely able to deal with large XML inputs. We introduce an XML to RDF transformation approach, which is based on map- pings comprising RDF triple templates that employ simple XPath expressions. Thanks to the restricted XPath expressions, which can be evaluated against a stream of XML data, our implementation can handle extremely large input XML files. To process the XML input efficiently, we employ XML filtering techniques and a strategy for selecting relevant XML nodes to generate RDF triples from. We show that the time complexity of our mapping algorithm is linear in the size of the XML input and also prove its practical efficiency with an evaluation on large real-world data.

: http://publica.fraunhofer.de/dokumente/N-418123.html