TY - GEN
T1 - Reinforcement learning for balancer embedded humanoid locomotion
AU - Yamaguchi, Akihiko
AU - Hyon, Sang Ho
AU - Ogasawara, Tsukasa
PY - 2010
Y1 - 2010
N2 - Reinforcement learning (RL) applications in robotics are of great interest because of their wide applicability, however many RL applications suffer from large learning costs. We study a new learning-walking scheme where a humanoid robot is embedded with a primitive balancing controller for safety. In this paper, we investigate some RL methods for the walking task. The system has two modes: double stance and single stance, and the selectable action spaces (sub-action spaces) change according to the mode. Thus, a hierarchical RL and a function approximator (FA) approaches are compared in simulation. To handle the sub-action spaces, we introduce the structured FA. The results demonstrate that non-hierarchical RL algorithms with the structured FA is much faster than the hierarchical RL algorithm. The robot can obtain appropriate walking gaits in around 30 episodes (20∼30 min), which is considered to be applicable to a real humanoid robot.
AB - Reinforcement learning (RL) applications in robotics are of great interest because of their wide applicability, however many RL applications suffer from large learning costs. We study a new learning-walking scheme where a humanoid robot is embedded with a primitive balancing controller for safety. In this paper, we investigate some RL methods for the walking task. The system has two modes: double stance and single stance, and the selectable action spaces (sub-action spaces) change according to the mode. Thus, a hierarchical RL and a function approximator (FA) approaches are compared in simulation. To handle the sub-action spaces, we introduce the structured FA. The results demonstrate that non-hierarchical RL algorithms with the structured FA is much faster than the hierarchical RL algorithm. The robot can obtain appropriate walking gaits in around 30 episodes (20∼30 min), which is considered to be applicable to a real humanoid robot.
UR - http://www.scopus.com/inward/record.url?scp=79851485696&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79851485696&partnerID=8YFLogxK
U2 - 10.1109/ICHR.2010.5686296
DO - 10.1109/ICHR.2010.5686296
M3 - Conference contribution
AN - SCOPUS:79851485696
SN - 9781424486885
T3 - 2010 10th IEEE-RAS International Conference on Humanoid Robots, Humanoids 2010
SP - 308
EP - 313
BT - 2010 10th IEEE-RAS International Conference on Humanoid Robots, Humanoids 2010
T2 - 2010 10th IEEE-RAS International Conference on Humanoid Robots, Humanoids 2010
Y2 - 6 December 2010 through 8 December 2010
ER -