Proceedings Vol. 9 (2003)
ENGINEERING MECHANICS 2003
May 11 – 15, 2003, Svratka, Czech Republic
Copyright © 2003 Institute of Theoretical and Applied Mechanics, Academy of Sciences of the Czech Republic, v.v.i., Prague
ISSN 1805-8248 (printed)
ISSN 1805-8256 (electronic)
list of papers scientific commitee
pages 36 - +6p., full text
The possible method of walking policy obtaining of four-legged robot through Q-learning is discussed in the contribution. Q-learning is implemented using architecture represented by non-deterministic state machine that defines both possible discrete states and admissible transitions between them. Discrete state is designed as indicators vector of goals achievement by single simultaneously activated instances of two basic controllers. Only simultaneous activations that guarantee static stability of robot are admissible even in the case when single activations cold not achieve its goals. The controllers attempt to achieve its goals using on-line minimization process. Q-learning sequentially improves an estimation of future benefit from usage of admissible simultaneous activations in single discrete states. Walking policy is generated through activations with the highest estimation of future benefit.
back to list of papers
Text and facts may be copied and used freely, but credit should be given to these Proceedings.
All papers were reviewed by members of the scientific committee.