• English
  • Deutsch
  • Log In
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. The Impact of Event Log Subset Selection on the Performance of Process Discovery Algorithms
 
  • Details
  • Full
Options
2019
  • Konferenzbeitrag

Titel

The Impact of Event Log Subset Selection on the Performance of Process Discovery Algorithms

Abstract
Process discovery algorithms automatically discover process models on the basis of event data, captured during the execution of business processes. These algorithms tend to use all of the event data to discover a process model. When dealing with large event logs, it is no longer feasible using standard hardware in limited time. A straightforward approach to overcome this problem is to down-size the event data by means of sampling. However, little research has been conducted on selecting the right sample, given the available time and characteristics of event data. This paper evaluates various subset selection methods and evaluates their performance on real event data. The proposed methods have been implemented in both the ProM and the RapidProM platforms. Our experiments show that it is possible to speed up discovery considerably using ranking-based strategies. Furthermore, results show that biased selection of the process instances compared to random selection of them will result in process models with higher quality.
Author(s)
Fani Sani, M.
Zelst, S.J. van
Aalst, W.M.P. van der
Hauptwerk
New Trends in Databases and Information Systems. ADBIS 2019 Short Papers, Workshops BBIGAP, QAUCA, SemBDM, SIMPDA, M2P, MADEISD and Doctoral Consortium. Proceedings
Konferenz
European Conference on Advances in Databases and Information Systems (ADBIS) 2019
Workshop on Semantics in Big Data Management (SemBDM) and Data-Driven Process Discovery and Analysis (SIMPDA) 2019
Thumbnail Image
DOI
10.1007/978-3-030-30278-8_39
Language
Englisch
google-scholar
FIT
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022