• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. Benchmarking of Data Preprocessing Methods for Machine Learning-Applications in Production
 
  • Details
  • Full
Options
2021
Journal Article
Title

Benchmarking of Data Preprocessing Methods for Machine Learning-Applications in Production

Abstract
The application of machine learning (ML) is becoming increasingly common in production. However, many ML-projects in production fail due to poor data quality. To increase the quality, data needs to be preprocessed. Hundreds of methods exist for data preprocessing (DPP) that are selected manually depending on use-case requirements. For these reasons, DPP is currently performed unstructured and accounts for 80 % of ML-projects duration. Thus, we introduce a structured DPP-approach, in which DPP-methods are recommended based on production use-case requirements by benchmarking identified DPP-methods according to ML-model performance on five data sets. The approach is validated through two new use-cases.
Author(s)
Frye, Maik  
Fraunhofer-Institut für Produktionstechnologie IPT  
Mohren, Johannes
Fraunhofer-Institut für Produktionstechnologie IPT  
Schmitt, Robert  
Fraunhofer-Institut für Produktionstechnologie IPT  
Journal
Procedia CIRP  
Project(s)
Centre of Excellence in Production Informatics and Control  
Funder
European Commission EC  
Conference
Conference on Manufacturing Systems (CMS) 2021  
Open Access
DOI
10.1016/j.procir.2021.11.009
Language
English
Fraunhofer-Institut für Produktionstechnologie IPT  
Keyword(s)
  • data preprocessing

  • data preparation

  • machine learning

  • artificial intelligence

  • production

  • manufacturing

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024