• English
  • Deutsch
  • Log In
    Password Login
    Have you forgotten your password?
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Policy learning using SPSA
 
  • Details
  • Full
Options
2018
Conference Paper
Title

Policy learning using SPSA

Abstract
We analyze the use of simultaneous perturbation stochastic approximation (SPSA), a stochastic optimization technique, for solving reinforcement learning problems. In particular, we consider settings of partial observability and leverage the short-term memory capabilities of echo state networks (ESNs) to learn parameterized control policies. Using SPSA, we propose three different variants to adapt the weight matrices of an ESN to the task at hand. Experimental results on classic control problems with both discrete and continuous action spaces reveal that ESNs trained using SPSA approaches outperform conventional ESNs trained using temporal difference and policy gradient methods.
Author(s)
Ramamurthy, Rajkumar  
Bauckhage, Christian  
Sifa, Rafet  
Wrobel, Stefan  
Mainwork
Artificial Neural Networks and Machine Learning - ICANN 2018. Proceedings, Part III  
Project(s)
ML2R
Funder
Bundesministerium für Bildung und Forschung BMBF (Deutschland)  
Conference
International Conference on Artificial Neural Networks (ICANN) 2018  
DOI
10.1007/978-3-030-01424-7_1
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024