Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Assisted descriptor selection based on visual comparative data analysis

: Bremm, Sebastian; Landesberger, Tatiana von; Bernard, Jürgen; Schreck, Tobias


Computer graphics forum 30 (2011), Nr.3, S.891-900
ISSN: 0167-7055
ISSN: 1467-8659
Symposium on Visualization (EuroVis) <13, 2011, Bergen>
Zeitschriftenaufsatz, Konferenzbeitrag
Fraunhofer IGD ()
interactive visualization; feature selection; self-organizing Maps (SOM); Forschungsgruppe Visual Search and Analysis (VISA)

Exploration and selection of data descriptors representing objects using a set of features are important components in many data analysis tasks. Usually, for a given dataset, an optimal data description does not exist, as the suitable data representation is strongly use case dependent. Many solutions for selecting a suitable data description have been proposed. In most instances, they require data labels and often are black box approaches. Non-expert users have difficulties to comprehend the coherency of input, parameters, and output of these algorithms. Alternative approaches, interactive systems for visual feature selection, overburden the user with an overwhelming set of options and data views. Therefore, it is essential to offer the users guidance in this analytical process.
In this paper, we present a novel system for data description selection, which facilitates the user's access to the data analysis process. As finding of suitable data description consists of several steps, we support the user with guidance. Our system combines automatic data analysis with interactive visualizations. By this, the system provides a recommendation for suitable data descriptor selections. It supports the comparison of data descriptors with differing dimensionality for unlabeled data. We propose specialized scores and interactive views for descriptor comparison. The visualization techniques are scatterplot-based and grid-based. For the latter case, we apply Self-Organizing Maps as adaptive grids which are well suited for large multi-dimensional data sets. As an example, we demonstrate the usability of our system on a real-world biochemical application.