Mehmet D. Erbas
Embodied imitation-enhanced reinforcement learning in multi-agent systems
Erbas, Mehmet D.; Winfield, Alan F.T.; Bull, Larry
Authors
Alan Winfield Alan.Winfield@uwe.ac.uk
Professor in Robotics
Lawrence Bull Larry.Bull@uwe.ac.uk
School Director (Research & Enterprise) and Professor
Abstract
Imitation is an example of social learning in which an individual observes and copies another's actions. This paper presents a new method for using imitation as a way of enhancing the learning speed of individual agents that employ a well-known reinforcement learning algorithm, namely Q-learning. Compared with other research that uses imitation with reinforcement learning, our method uses imitation of purely observed behaviours to enhance learning, with no internal state access or sharing of experiences between agents. The paper evaluates our imitation-enhanced reinforcement learning approach in both simulation and with real robots in continuous space. Both simulation and real robot experimental results show that the learning speed of the group is improved. © The Author(s) 2013.
Journal Article Type | Article |
---|---|
Online Publication Date | Aug 29, 2013 |
Publication Date | Feb 1, 2014 |
Publicly Available Date | Jun 6, 2019 |
Journal | Adaptive Behavior |
Print ISSN | 1059-7123 |
Electronic ISSN | 1741-2633 |
Publisher | SAGE Publications |
Peer Reviewed | Peer Reviewed |
Volume | 22 |
Issue | 1 |
Pages | 31-50 |
DOI | https://doi.org/10.1177/1059712313500503 |
Keywords | embodied imitation, reinforcement q-learning, social learning, multi-agent systems |
Public URL | https://uwe-repository.worktribe.com/output/821497 |
Publisher URL | http://dx.doi.org/10.1177/1059712313500503 |
Files
Erbas_etal_ImitationEnhancedLearning.pdf
(912 Kb)
PDF
You might also like
Towards the evolution of vertical-axis wind turbines using supershapes
(2014)
Journal Article
Evolving unipolar memristor spiking neural networks
(2015)
Journal Article
A brief history of learning classifier systems: from CS-1 to XCS and its variants
(2015)
Journal Article
Discrete and fuzzy dynamical genetic programming in the XCSF learning classifier system
(2013)
Journal Article
Evolving spiking networks with variable resistive memories
(2014)
Journal Article
Downloadable Citations
About UWE Bristol Research Repository
Administrator e-mail: repository@uwe.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search