Qdsega による多足ロボットの歩行運動の獲得

Transactions of the Japanese Society for Artificial Intelligence 17:363-372 (2002)
  Copy   BIBTEX

Abstract

Reinforcement learning is very effective for robot learning. Because it does not need priori knowledge and has higher capability of reactive and adaptive behaviors. In our previous works, we proposed new reinforcement learning algorithm: “Q-learning with Dynamic Structuring of Exploration Space Based on Genetic Algorithm (QDSEGA)”. It is designed for complicated systems with large action-state space like a robot with many redundant degrees of freedom. And we applied it to 50 link manipulator and effective behavior is acquired. However optimality and fault tolerance of the proposed algorithm were not considered and to demonstrate effectiveness of the proposed algorithm other applications are necessary. Acquiring of locomotion patterns by a multi-legged robot is a very interesting problem. As it has many redundant degrees of freedom, application of usual reinforcement learning is difficult and an optimal locomotion has not been acquired using previous reinforcement learning algorithm. And the redundancy of the robot is effective to the fault tolerance and various locomotion patterns can be acquired for adapting the faults of the legs. In this paper, we applied QDSEGA to acquiring of locomotion pattern by the multi-legged robot and considered the optimality and fault tolerance. Effective behavior has been obtained by using our proposed algorithm.

Other Versions

No versions found

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 101,060

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Ga により探索空間の動的生成を行う Q 学習.Matsuno Fumitoshi Ito Kazuyuki - 2001 - Transactions of the Japanese Society for Artificial Intelligence 16:510-520.
強化学習を用いた自律移動型ロボットの行動計画法の提案.五十嵐 治一 - 2001 - Transactions of the Japanese Society for Artificial Intelligence 16:501-509.
ノード使用頻度に依存した交叉による進化ロボティクスの高速化.山田 誠二 片上 大輔 - 2001 - Transactions of the Japanese Society for Artificial Intelligence 16:392-399.
合理的政策形成アルゴリズムの連続値入力への拡張.木村 元 宮崎 和光 - 2007 - Transactions of the Japanese Society for Artificial Intelligence 22 (3):332-341.

Analytics

Added to PP
2014-03-24

Downloads
30 (#747,543)

6 months
7 (#698,214)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references