• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Direct local pattern sampling by efficient two-step random procedures
 
  • Details
  • Full
Options
2011
Conference Paper
Title

Direct local pattern sampling by efficient two-step random procedures

Abstract
We present several exact and highly scalable local pattern sampling algorithms. They can be used as an alternative to exhaustive local pattern discovery methods (e.g, frequent set mining or optimistic-estimator-based subgroup discovery) and can substantially improve eficiency as well as controllability of pattern discovery processes. While previous sampling approaches mainly rely on the Markov chain Monte Carlo method, our procedures are direct, i.e., non processsimulating, sampling algorithms. The advantages of these direct methods are an almost optimal time complexity per pattern as well as an exactly controlled distribution of the produced patterns. Namely, the proposed algorithms can sample (item-)sets according to frequency, area, squared frequency, and a class discriminativity measure. Experiments demonstrate that these procedures can improve the accuracy of pattern-based models similar to frequent sets and often also lead to substantial gains in terms of scalability.
Author(s)
Boley, Mario  
Lucchese, Claudio
Paurat, Daniel  
Gärtner, Thomas  
Mainwork
17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2011. Proceedings  
Project(s)
LIFT  
Funder
European Commission EC  
Conference
International Conference on Knowledge Discovery and Data Mining (KDD) 2011  
Open Access
File(s)
Download (536.27 KB)
Rights
Use according to copyright law
DOI
10.1145/2020408.2020500
10.24406/publica-r-373478
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • local pattern discovery

  • sampling

  • patternbased classification

  • frequent sets

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024