Abstract
We propose a method to improve the performance of R-learning, a reinforcement learning algorithm, by using multiple state-action value tables. Unlike Q- or Sarsa learning, R-learning learns a policy to maximize undiscounted rewards. Multiple state-action value tables cause substantial explorations as needed and make R-learning work well. Efficiency of the proposed method is verified through experiments in a simulated environment.
Original language | English |
---|---|
Pages (from-to) | 34-47 |
Number of pages | 14 |
Journal | Electrical Engineering in Japan (English translation of Denki Gakkai Ronbunshi) |
Volume | 159 |
Issue number | 3 |
DOIs | |
Publication status | Published - 2007 May |
Externally published | Yes |
Keywords
- Autonomous mobile robot
- R-learning
- Reinforcement learning
ASJC Scopus subject areas
- Energy Engineering and Power Technology
- Electrical and Electronic Engineering