Publica
Hier finden Sie wissenschaftliche Publikationen aus den FraunhoferInstituten. Optimal cluster preserving embedding of nonmetric proximity data
 IEEE Transactions on Pattern Analysis and Machine Intelligence 25 (2003), Nr.12, S.15401551 ISSN: 01628828 

 Englisch 
 Zeitschriftenaufsatz 
 Fraunhofer FIRST () 
Abstract
For several major applications of data analysis, objects are often not represented as feature vectors in a vector space, but rather by a matrix gathering pairwise proximites. Such pairwise data often violates metricity and, therefore, cannot be naturally embedded in a vector space. Concerning the problem of unsupervised structure detection or clustering, in this paper, a new embedding method for pairwise data into Euclidean vector spaces is introduced. We show that all clustering methods, which are invariant under additive shifts of the pairwise proximities, can be reformulated as grouping problems in Euclidian spaces. The most prominent property of this constant shift embedding framework is the complete preservation of the cluster structure in the embedding space. Restating pairwise clustering problems in vector spaces has several important consequences, such as the statistical description of the clusters by way of cluster prototypes, the generic extension of the grouping procedure to a discriminative prediction rule, and the applicability of standard preprocessing methods like denoising or dimensionality reduction.