Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Online k-Maxoids clustering

: Sifa, Rafet; Bauckhage, Christian


Institute of Electrical and Electronics Engineers -IEEE-; Association for Computing Machinery -ACM-; American Statistical Association -ASA-:
International Conference on Data Science and Advanced Analytics, DSAA 2017 : Tokyo, Japan, 19-21 October 2017; Proceedings
Piscataway, NJ: IEEE, 2017
ISBN: 978-1-5090-5004-8
ISBN: 978-1-5090-5005-5 (Print)
International Conference on Data Science and Advanced Analytics (DSAA) <4, 2017, Tokyo>
Fraunhofer IAIS ()
clustering algorithm; algorithm design and analysis; prototype; robustness; Data Science; training; optimization; Archetypal Analysis; unsupervised learning; online cluster analysis; behavioral profiling

We present an online learning algorithm to extract extremal prototypes from a set of data. As an online algorithm, our method can continue to learn during the application phase of a system. However, as a greedy update procedure, it may be sensitive to outliers. We therefore consider the use of extreme value theory for self-assessment and discuss how to incorporate Weibull statistics so as to increase robustness. We evaluate our approaches on synthetic as well as real world datasets to perform profiling. Our empirical results show that incorporating self-assessment not only results in better data representations but also reveals interpretable insights about the analyzed dataset.