• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Toolkit-based high-performance data mining of large data on MapReduce clusters
 
  • Details
  • Full
Options
2009
Conference Paper
Title

Toolkit-based high-performance data mining of large data on MapReduce clusters

Abstract
The enormous growth of data in a variety of applications has increased the need for high performance data mining based on distributed environments. However, standard data mining toolkits per se do not allow the usage of computing clusters. The success of MapReduce for analyzing large data has raised a general interest in applying this model to other, data intensive applications. Unfortunately current research has not lead to an integration of GUI based data mining toolkits with distributed file system based MapReduce systems. This paper defines novel principles for modeling and design of the user interface, the storage model and the computational model necessary for the integration of such systems. Additionally, it introduces a novel system architecture for interactive GUI based data mining of large data on clusters based on MapReduce that overcomes the limitations of data mining toolkits. As an empirical demonstration we show an implementation based on Weka and Hadoop.
Author(s)
Wegener, Dennis  orcid-logo
Mock, Michael  
Adranale, D.
Wrobel, Stefan  
Mainwork
Ninth IEEE International Conference on Data Mining Workshops, ICDMW 2009  
Conference
International Conference on Data Mining (ICDM) 2009  
Workshop on Large-Scale Data Mining - Theory and Applications 2009  
Open Access
File(s)
Download (715.13 KB)
Rights
Use according to copyright law
DOI
10.1109/ICDMW.2009.34
10.24406/publica-r-363459
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • MapReduce

  • Hadoop

  • Weka

  • data mining toolkit

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024