• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Quantum Natural Policy Gradients: Towards Sample-Efficient Reinforcement Learning
 
  • Details
  • Full
Options
2023
Conference Paper
Title

Quantum Natural Policy Gradients: Towards Sample-Efficient Reinforcement Learning

Abstract
Reinforcement learning is a growing field in AI with a lot of potential. Intelligent behavior is learned automatically through trial and error in interaction with the environment. However, this learning process is often costly. Using variational quantum circuits as function approximators potentially can reduce this cost. In order to implement this, we propose the quantum natural policy gradient (QNPG) algorithm - a second-order gradient-based routine that takes advantage of an efficient approximation of the quantum Fisher information matrix. We experimentally demonstrate that QNPG outperforms first-order based training on different Contextual Bandits environments regarding convergence speed and stability and moreover reduces the sample complexity. Furthermore, we provide evidence for the practical feasibility of our approach by training on a 12-qubit hardware device.
Author(s)
Meyer, Nico
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Scherer, Daniel David
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Plinge, Axel  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Mutschler, Christopher  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Hartmann, Michael J.
Mainwork
IEEE International Conference on Quantum Computing and Engineering, QCE 2023. Vol.2  
Conference
International Conference on Quantum Computing and Engineering 2023  
Workshop "Quantum Machine Learning - From Foundations to Applications" 2023  
DOI
10.1109/QCE57702.2023.10181
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • contextual bandits

  • natural gradient

  • policy gradient

  • reinforcement learning

  • variational quantum computing

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024