• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Li, C. (Li, C..) | Zhu, X. (Zhu, X..) | Ruan, X. (Ruan, X..) | Liu, X. (Liu, X..) | Zhang, S. (Zhang, S..)

Indexed by:

Scopus

Abstract:

Bionic gait learning of quadruped robots based on reinforcement learning has become a hot research topic. The proximal policy optimization (PPO) algorithm has a low probability of learning a successful gait from scratch due to problems such as reward sparsity. To solve the problem, we propose a experience evolution proximal policy optimization (EEPPO) algorithm which integrates PPO with priori knowledge highlighting by evolutionary strategy. We use the successful trained samples as priori knowledge to guide the learning direction in order to increase the success probability of the learning algorithm. To verify the effectiveness of the proposed EEPPO algorithm, we have conducted simulation experiments of the quadruped robot gait learning task on Pybullet. Experimental results show that the central pattern generator based radial basis function (CPG-RBF) network and the policy network are simultaneously updated to achieve the quadruped robot’s bionic diagonal trot gait learning task using key information such as the robot’s speed, posture and joints information. Experimental comparison results with the traditional soft actor-critic (SAC) algorithm validate the superiority of the proposed EEPPO algorithm, which can learn a more stable diagonal trot gait in flat terrain. © 2023, Shanghai Jiao Tong University.

Keyword:

quadruped robot proximal policy optimization (PPO) priori knowledge TP 242 evolutionary strategy A bionic gait learning

Author Community:

  • [ 1 ] [Li C.]Faculty of Information Technology, Beijing University of Technology
  • [ 2 ] Beijing Key Laboratory of Computational Intelligence and Intelligent System
  • [ 3 ] Engineering Research Center of Digital Community of Ministry of Education, Beijing, 100124, China
  • [ 4 ] [Zhu X.]Faculty of Information Technology, Beijing University of Technology
  • [ 5 ] Beijing Key Laboratory of Computational Intelligence and Intelligent System
  • [ 6 ] Engineering Research Center of Digital Community of Ministry of Education, Beijing, 100124, China
  • [ 7 ] [Ruan X.]Faculty of Information Technology, Beijing University of Technology
  • [ 8 ] Beijing Key Laboratory of Computational Intelligence and Intelligent System
  • [ 9 ] Engineering Research Center of Digital Community of Ministry of Education, Beijing, 100124, China
  • [ 10 ] [Liu X.]Faculty of Information Technology, Beijing University of Technology
  • [ 11 ] Beijing Key Laboratory of Computational Intelligence and Intelligent System
  • [ 12 ] Engineering Research Center of Digital Community of Ministry of Education, Beijing, 100124, China
  • [ 13 ] [Zhang S.]Faculty of Information Technology, Beijing University of Technology
  • [ 14 ] Beijing Key Laboratory of Computational Intelligence and Intelligent System
  • [ 15 ] Engineering Research Center of Digital Community of Ministry of Education, Beijing, 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

Journal of Shanghai Jiaotong University (Science)

ISSN: 1007-1172

Year: 2023

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 2

Affiliated Colleges:

Online/Total:392/10601432
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.