Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Multi-objective hyperparameter tuning and feature selection using filter ensembles

 
: Binder, M.; Moosbauer, J.; Thomas, J.; Bischl, B.

:
Volltext ()

Coello Coello, C.A. ; Association for Computing Machinery -ACM-, Special Interest Group on Genetic and Evolutionary Computation:
Genetic and Evolutionary Computation Conference, GECCO 2020. Proceedings : Cancún, Mexico, July, 2020
New York: ACM, 2020
ISBN: 978-1-4503-7128-5
S.471-479
Genetic and Evolutionary Computation Conference (GECCO) <2020, Online>
Englisch
Konferenzbeitrag, Elektronische Publikation
Fraunhofer IIS ()

Abstract
Both feature selection and hyperparameter tuning are key tasks in machine learning. Hyperparameter tuning is often useful to increase model performance, while feature selection is undertaken to attain sparse models. Sparsity may yield better model interpretability and lower cost of data acquisition, data handling and model inference. While sparsity may have a beneficial or detrimental effect on predictive performance, a small drop in performance may be acceptable in return for a substantial gain in sparseness. We therefore treat feature selection as a multi-objective optimization task. We perform hyperparameter tuning and feature selection simultaneously because the choice of features of a model may influence what hyperparameters perform well. We present, benchmark, and compare two different approaches for multi-objective joint hyperparameter optimization and feature selection: The first uses multi-objective model-based optimization. The second is an evolutionary NSGA-I I-based wrapper approach to feature selection which incorporates specialized sampling, mutation and recombination operators. Both methods make use of parameterized filter ensembles. While model-based optimization needs fewer objective evaluations to achieve good performance, it incurs computational overhead compared to the NSGA-II, so the preferred choice depends on the cost of evaluating a model on given data.

: http://publica.fraunhofer.de/dokumente/N-614638.html