• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Generic error identification in data sets
 
  • Details
  • Full
Options
2016
Conference Paper
Title

Generic error identification in data sets

Abstract
The manual acquisition of data is in many areas, as for example the United States Environmental Protection Agency (EPA) [1] does, quite common. This type of data acquisition can lead to many errors within the data set. Such errors can affect extracted rules and patterns from Data Mining algorithms. A wrong data entry for example could be a too high fuel consumption for a vehicle caused by a missing comma. If a customer considers buying this vehicle and looks up the fuel consumption via the EPA database an incorrect data entry could influence his purchase decision. A manual inspection of the data set is very time consuming and not practical for large data sets. The inspection of the data set therefore needs automatic procedures to remain accurate. This paper illustrates the approach to identify errors with the methodology of association rules. By combining various algorithms of the field of clustering and association analysis, the association rules are generated. These association rules can help prevent erroneous data entries in advance.
Author(s)
El Bekri, Nadia
Peinsipp, Byma
Mainwork
25th International Conference on Software Engineering and Data Engineering, SEDE 2016  
Conference
International Conference on Software Engineering and Data Engineering (SEDE) 2016  
International Conference on Computer Applications in Industry and Engineering (CAINE) 2016  
File(s)
Download (303.52 KB)
Rights
Use according to copyright law
DOI
10.24406/publica-fhg-394199
Language
English
Fraunhofer-Institut für Optronik, Systemtechnik und Bildauswertung IOSB  
Keyword(s)
  • data mining

  • clustering

  • association analysis

  • Association Rules

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024