Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Visual-interactive exploration of interesting multivariate relations in mixed research data sets

: Bernard, Jürgen; Steiger, Martin; Widmer, Sven; Lücke-Tieke, Hendrik; May, Thorsten; Kohlhammer, Jörn


Computer graphics forum 33 (2014), No.3, pp.291-300
ISSN: 0167-7055
ISSN: 1467-8659
Conference on Visualization (Vis) <16, 2014, Swansea>
Journal Article, Conference Paper
Fraunhofer IGD ()
Visual analytics; information visualization; Multivariate data; visualization of multidimensional feature spaces; decision support; data exploration; similarity measures; subspace mapping

The analysis of research data plays a key role in data-driven areas of science. Varieties of mixed research data sets exist and scientists aim to derive or validate hypotheses to find undiscovered knowledge. Many analysis techniques identify relations of an entire dataset only. This may level the characteristic behavior of different subgroups in the data. Like automatic subspace clustering, we aim at identifying interesting subgroups and attribute sets. We present a visual-interactive system that supports scientists to explore interesting relations between aggregated bins of multivariate attributes in mixed data sets. The abstraction of data to bins enables the application of statistical dependency tests as the measure of interestingness. An overview matrix view shows all attributes, ranked with respect to the interestingness of bins. Complementary, a node-link view reveals multivariate bin relations by positioning dependent bins close to each other. The system supports information drill-down based on both expert knowledge and algorithmic support. Finally, visual-interactive subset clustering assigns multivariate bin relations to groups. A list-based cluster result representation enables the scientist to communicate multivariate findings at a glance. We demonstrate the applicability of the system with two case studies from the earth observation domain and the prostate cancer research domain. In both cases, the system enabled us to identify the most interesting multivariate bin relations, to validate already published results, and, moreover, to discover unexpected relations.