• English
  • Deutsch
  • Log In
    Password Login
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Policy learning using SPSA
 
  • Details
  • Full
Options
2018
Conference Paper
Titel

Policy learning using SPSA

Abstract
We analyze the use of simultaneous perturbation stochastic approximation (SPSA), a stochastic optimization technique, for solving reinforcement learning problems. In particular, we consider settings of partial observability and leverage the short-term memory capabilities of echo state networks (ESNs) to learn parameterized control policies. Using SPSA, we propose three different variants to adapt the weight matrices of an ESN to the task at hand. Experimental results on classic control problems with both discrete and continuous action spaces reveal that ESNs trained using SPSA approaches outperform conventional ESNs trained using temporal difference and policy gradient methods.
Author(s)
Ramamurthy, Rajkumar
Bauckhage, Christian
Sifa, Rafet
Wrobel, Stefan
Hauptwerk
Artificial Neural Networks and Machine Learning - ICANN 2018. Proceedings, Part III
Project(s)
ML2R
Funder
Bundesministerium für Bildung und Forschung BMBF (Deutschland)
Konferenz
International Conference on Artificial Neural Networks (ICANN) 2018
Thumbnail Image
DOI
10.1007/978-3-030-01424-7_1
Language
English
google-scholar
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022