Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Towards extending bag-of-words-models using context features for an 2D inverted index

: Manger, Daniel; Herrmann, C.; Willersinn, Dieter


Institute of Electrical and Electronics Engineers -IEEE-:
International Conference on Digital Image Computing: Techniques and Applications, DICTA 2016 : Gold Coast, Australia, 30 November - 02 December 2016
Piscataway, NJ: IEEE, 2016
ISBN: 978-1-5090-2897-9
ISBN: 978-1-5090-2896-2
5 S.
International Conference on Digital Image Computing - Techniques and Applications (DICTA) <2016, Gold Coast>
Fraunhofer IOSB ()
image and object retrieval; Bag-of-Words-Model; 2D inverted index

This paper addresses the image retrieval problem of finding images in a large dataset that contain similar scenes or objects to a given query image. Often, this task is performed with the popular Bag-of-Words (BoW)-Model which quantizes local features such as SIFT for speeding up the retrieval by using an inverted file indexing scheme. We focus on the limits of the model for very large-scale datasets since the quantization of the individual feature descriptors impairs their discriminative power. Thus, with growing datasets, the model gets increasingly distracted by irrelevant images that occasionally result in similar signatures. Our goal is to also consider neighboring features and their geometry and to condense them into a new context-feature which is meant to be quantized as well. As this new quantized context information introduces a second dimension in the BoW-Model, it supports both performance and accuracy during the retrieval step. Using the public datasets Oxford5k and Holidays, we define an appropriate framework and evaluate different ways of context feature construction, dimensionality reduction and quantization.