• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Architecture-aware Bayesian Optimization for Neural Network Tuning
 
  • Details
  • Full
Options
2019
Conference Paper
Title

Architecture-aware Bayesian Optimization for Neural Network Tuning

Abstract
Hyperparameter optimization of a neural network is a non-trivial task. It is time-consuming to evaluate a hyperparameter setting, no analytical expression of the impact of the hyperparameters are available, and the evaluations are noisy in the sense that the result is dependent on the training process and weight initialization. Bayesian optimization is a powerful tool to handle these problems. However, hyperparameter optimization of neural networks poses additional challenges, since the hyperparameters can be integer-valued, categorical, and/or conditional, whereas Bayesian optimization often assumes variables to be real-valued. In this paper we present an architecture-aware transformation of neural networks applied in the kernel of a Gaussian process to boost the performance of hyperparameter optimization. The empirical experiment in this paper demonstrates that by introducing an architecture-aware transformation of the kernel, the performance of the Bayesian optimizer shows a clear improvement over a naïve implementation and that the results are comparable to other state-of-the-art methods.
Author(s)
Sjöberg, Anders
Fraunhofer-Chalmers Reseach Centre for Industrial Mathematics FCC  
Önnheim, Magnus
Fraunhofer-Chalmers Reseach Centre for Industrial Mathematics FCC  
Gustavsson, Emil
Fraunhofer-Chalmers Reseach Centre for Industrial Mathematics FCC  
Jirstrand, Mats
Fraunhofer-Chalmers Reseach Centre for Industrial Mathematics FCC  
Mainwork
Artificial Neural Networks and Machine Learning - ICANN 2019. Deep Learning  
Conference
International Conference on Artificial Neural Networks (ICANN) 2019  
DOI
10.1007/978-3-030-30484-3_19
Language
English
FCC  
Fraunhofer-Institut für Techno- und Wirtschaftsmathematik ITWM  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024