• English
  • Deutsch
  • Log In
    Password Login
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Document clustering using a graph covering with pseudostable sets
 
  • Details
  • Full
Options
2017
Conference Paper
Titel

Document clustering using a graph covering with pseudostable sets

Abstract
In text mining, document clustering describes the efforts to assign unstructured documents to clusters, which in turn usually refer to topics. Clustering is widely used in science for data retrieval and organisation. In this paper we present a new graph theoretical approach to document clustering and its application on a real-world data set. We will show that the well-known graph partition to stable sets or cliques can be generalized to pseudostable sets or pseudocliques. This allows to make a soft clustering as well as a hard clustering. We will present an integer linear programming and a greedy approach for this NP-complete problem and discuss some results on random instances and some real world data for different similarity measures.
Author(s)
Dörpinghaus, Jens
Schaaf, Sebastian
Fluck, Juliane
Jacobs, Marc
Hauptwerk
Federated Conference on Computer Science and Information Systems 2017. Proceedings
Konferenz
Federated Conference on Computer Science and Information Systems (FedCSIS) 2017
International Workshop on Language Technologies and Applications 2017
Thumbnail Image
DOI
10.15439/2017F84
Language
English
google-scholar
Fraunhofer-Institut für Algorithmen und Wissenschaftliches Rechnen SCAI
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022