Learning of Quadruped Robot Motor Skills Based on Policy Constrained TD3 - Details

Author：

Indexed by：

Abstract：

The　agile　movement　of　quadruped　robot　requires　rich　professional　knowledge　and　tedious　manual　adjustment.　However,　reinforcement　learning　does　not　require　any　professional　knowledge　to　enable　the　quadruped　robot　to　learn　better　movement　gait　and　skills.　The　TD3　algorithm　is　widely　used　in　continuous　motion　control,　but　it　tends　to　converge　to　boundary　actions,　resulting　in　inability　to　learn　the　optimal　strategy　and　overfitting.　Inspired　by　behavior　cloning,　this　paper　proposes　a　policy　constrained　TD3　algorithm(PC-　TD3),　which　adds　behavioral　constraints　during　the　policy　update　process　of　TD3,　updates　the　policy　in　the　direction　of　expected　behavior,　and　reduces　boundary　actions.　The　experiment　was　conducted　on　the　Pybullet　platform.　The　experimental　results　show　that　the　algorithm　proposed　in　this　paper　can　enable　the　quadruped　Robot　learning　to　learn　the　walking　skills.　Compared　with　other　mainstream　algorithms,　the　experiment　shows　that　PC-　TD3　has　better　performance.　©　2023　IEEE.

Keyword：

Reinforcement learning Multipurpose robots Cloning Clone cells

Author Community：

[ 1 ] [Zhu, Xiaoyu]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 2 ] [Zhu, Xiaoqing]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 3 ] [Chen, Jiangtao]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 4 ] [Zhang, Siyuan]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 5 ] [Nan, Borui]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 6 ] [Bi, Lanyue]Beijing University of Technology, Faculty of Information Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Multi-robot formation control using reinforcement learning method
2010，1st International Conference on Advances in Swarm Intelligence, ICSI 2010
Gait Learning of Quadruped Robot Based on Deep Arbitration Strategy
2023，Transaction of Beijing Institute of Technology
Quadruped Robot Get Bionic Learning Method Based on Intelligent Memory Soft Actor-Critic
2023，35th Chinese Control and Decision Conference, CCDC 2023
Environmental Features Assessment Network Aided Deep Reinforcement Learning for Quadrupedal Locomotion in Tough Terrain
2023，2023 China Automation Congress, CAC 2023

Source ：

Year： 2023

Page： 4910-4914

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to