• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Imitation learning in relational domains: A functional-gradient boosting approach
 
  • Details
  • Full
Options
2011
Conference Paper
Title

Imitation learning in relational domains: A functional-gradient boosting approach

Abstract
Imitation learning refers to the problem of learning how to behave by observing a teacher in action. We consider imitation learning in relational domains, in which there is a varying number of objects and relations among them. In prior work, simple relational policies are learned by viewing imitation learning as supervised learning of a function from states to actions. For propositional worlds, functional gradient methods have been proved to be beneficial. They are simpler to implement than most existing methods, more efficient, more naturally satisfy common constraints on the cost function, and better represent our prior beliefs about the form of the function. Building on recent generalizations of functional gradient boosting to relational representations, we implement a functional gradient boosting approach to imitation learning in relational domains. In particular, given a set of traces from the human teacher, our system learns a policy in the form of a set of relati onal regression trees that additively approximate the functional gradients. The use of multiple additive trees combined with relational representation allows for learning more expressive policies than what has been done before. We demonstrate the usefulness of our approach in several different domains.
Author(s)
Natarajan, S.
Joshi, S.
Tadepalli, P.
Kersting, Kristian  
Shavlik, J.
Mainwork
Twenty-Second International Joint Conference on Artificial Intelligence, IJCAI-11. Proceedings. Vol.2  
Conference
International Joint Conference on Artificial Intelligence (IJCAI) 2011  
DOI
10.5591/978-1-57735-516-8/IJCAI11-239
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024