• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zhu, Xiao-Qing (Zhu, Xiao-Qing.) | Liu, Xin-Yuan (Liu, Xin-Yuan.) | Ruan, Xiao-Gang (Ruan, Xiao-Gang.) | Zhang, Si-Yuan (Zhang, Si-Yuan.) | Li, Chun-Yang (Li, Chun-Yang.) | Li, Peng (Li, Peng.)

Indexed by:

EI Scopus

Abstract:

Learning ability is a typical characteristic of higher animal intelligence. In order to explore the learning mechanism of quadruped motor skills, this paper studies the gait learning task of quadruped robots, and reproduces the rhythmic gait learning process of quadruped animals from scratch. In recent years, proximal policy optimization (PPO) algorithm, as a typical representative algorithm of deep reinforcement learning, has been widely used in gait learning tasks for quadruped robots, with good experimental results and fewer hyperparameters required. However, in the multidimensional input and output scenario, it is easy to converge to the local optimum point, in the experimental environment of this study, the gait rhythm signals of the trained quadruped robot were irregular, and the center of gravity oscillates. To solve the above problems, inspired by meta-learning, based on the advantage of meta-learning in characterizing the high-dimensional abstract representation of learning processes, this paper proposes an meta proximal policy optimization (MPPO) algorithm that combines meta-learning and PPO algorithms. This algorithm can enable quadruped robots to learn better gait. The simulation results on the PyBullet simulation platform show that the algorithm proposed in this paper can enable quadruped robots to learn walking skills. Compared with soft actor-critic (SAC) and PPO algorithms, the MPPO algorithm proposed in this paper has advantages such as more regular gait rhythm signals and faster walking speed. © 2024 South China University of Technology. All rights reserved.

Keyword:

Reinforcement learning Simulation platform Deep learning Learning algorithms Multipurpose robots Animals

Author Community:

  • [ 1 ] [Zhu, Xiao-Qing]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)
  • [ 2 ] [Liu, Xin-Yuan]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)
  • [ 3 ] [Ruan, Xiao-Gang]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)
  • [ 4 ] [Zhang, Si-Yuan]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)
  • [ 5 ] [Li, Chun-Yang]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)
  • [ 6 ] [Li, Peng]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Control Theory and Applications

ISSN: 1000-8152

Year: 2024

Issue: 1

Volume: 41

Page: 155-162

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 3

Affiliated Colleges:

Online/Total:761/10654403
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.