• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. Effective approximation of parametrized closure systems over transactional data streams
 
  • Details
  • Full
Options
2020
Journal Article
Title

Effective approximation of parametrized closure systems over transactional data streams

Abstract
Strongly closed itemsets, defined by a parameterized closure operator, are a generalization of ordinary closed itemsets. Depending on the strength of closedness, the family of strongly closed itemsets typically forms a tiny subfamily of ordinary closed itemsets that is stable against changes in the input. In this paper we consider the problem of mining strongly closed itemsets from transactional data streams. Utilizing their algebraic and algorithmic properties, we propose an algorithm based on reservoir sampling for approximating this type of itemsets in the landmark streaming setting, prove its correctness, and show empirically that it yields a considerable speed-up over a straightforward naive algorithm without any significant loss in precision and recall. We motivate the problem setting considered by two practical applications. In particular, we first experimentally demonstrate that the above properties, i.e., compactness and stability, make strongly closed itemsets an excellent indicator of certain types of concept drifts in transactional data streams. As a second application we consider computer-aided product configuration, a real-world problem raised by an industrial project. For this problem, which is essentially exact concept identification, we propose a learning algorithm based on a certain type of subset queries formed by strongly closed itemsets and show on real-world datasets that it requires significantly less query evaluations than a naive algorithm based on membership queries.
Author(s)
Trabold, Daniel  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Horvath, Tamas  
Wrobel, Stefan  
Journal
Machine learning  
Project(s)
ML2R
Funder
Bundesministerium für Bildung und Forschung BMBF (Deutschland)  
DOI
10.1007/s10994-019-05827-w
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • data stream

  • itemset mining

  • closed itemsets

  • concept drift detection

  • application

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024