Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Combining statistical independence testing, visual attribute selection and automated analysis to find relevant attributes for classification

: May, Thorsten; Davey, James


Institute of Electrical and Electronics Engineers -IEEE-:
IEEE Symposium on Visual Analytics Science and Technology 2010. Proceedings : VAST 2010, October 24 - 28, Salt Lake City, Utah, USA
Piscataway: IEEE Computer Society, 2010
ISBN: 978-1-4244-9486-6
Symposium on Visual Analytics Science and Technology (VAST) <2010, Salt Lake City/Utah>
Conference Paper
Fraunhofer IGD ()
cluster analysis; content filtering; classification method

We present an iterative strategy for finding a relevant subset of attributes for the purpose of classification in high-dimensional, heterogeneous data sets. The attribute subset is used for the construction of a classifier function. In order to cope with the challenge of scalability, the analysis is split into an overview of all attributes and a detailed analysis of small groups of attributes. The overview provides generic information on statistical dependencies between attributes. With this information the user can select groups of attributes and an analytical method for their detailed analysis.
The detailed analysis involves the identification of redundant attributes (via classification or regression) and the creation of summarizing attributes (via clustering or dimension reduction). Our strategy does not prescribe specific analytical methods. Instead, we recursively combine the results of different methods to find or generate a subset of attributes to use for classification.