Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Applying Heuristic and Machine Learning Strategies to Product Resolution

 
: Strauß, Oliver; Almheidat, Ahmad; Kett, Holger

:
Volltext ()

Bozzon, Alessandro (Ed.) ; Institute for Systems and Technologies of Information, Control and Communication -INSTICC-, Setubal:
WEBIST 2019, 15th International Conference on Web Information Systems and Technologies. Proceedings : September 18-20, 2019, Vienna, Austria
Setúbal: SciTePress, 2019
ISBN: 978-989-758-386-5
S.242-249
International Conference on Web Information Systems and Technologies (WEBIST) <15, 2019, Vienna>
Bundesministerium für Bildung und Forschung BMBF (Deutschland)
01QE1632B; EUROSTARS
Intelligente Produktdatenextraktion und Product Resolution
Englisch
Konferenzbeitrag, Elektronische Publikation
Fraunhofer IAO ()

Abstract
In order to analyze product data obtained from different web shops a process is needed to determine which product descriptions refer to the same product (product resolution). Based on string similarity metrics and existing product resolution approaches a new approach is presented with the following components: a) extraction of information from the unstructured product title extracted from the e-shops, b) inclusion of additional information in the matching process, c) a method to compute a product similarity metric from the available data, d) optimization and adaption of model parameters to the characteristics of the underlying data via a genetic algorithm and e) a framework to automatically evaluate the matching method on the basis of realistic test data. The approach achieved a precision of 0.946 and a recall of 0.673.

: http://publica.fraunhofer.de/dokumente/N-565067.html