A quadruped robot kinematic skill learning method integrating meta-learning and PPO algorithms - Details

Author：

Indexed by：

EI Scopus

Abstract：

Learning　ability　is　a　typical　characteristic　of　higher　animal　intelligence.　In　order　to　explore　the　learning　mechanism　of　quadruped　motor　skills,　this　paper　studies　the　gait　learning　task　of　quadruped　robots,　and　reproduces　the　rhythmic　gait　learning　process　of　quadruped　animals　from　scratch.　In　recent　years,　proximal　policy　optimization　(PPO)　algorithm,　as　a　typical　representative　algorithm　of　deep　reinforcement　learning,　has　been　widely　used　in　gait　learning　tasks　for　quadruped　robots,　with　good　experimental　results　and　fewer　hyperparameters　required.　However,　in　the　multidimensional　input　and　output　scenario,　it　is　easy　to　converge　to　the　local　optimum　point,　in　the　experimental　environment　of　this　study,　the　gait　rhythm　signals　of　the　trained　quadruped　robot　were　irregular,　and　the　center　of　gravity　oscillates.　To　solve　the　above　problems,　inspired　by　meta-learning,　based　on　the　advantage　of　meta-learning　in　characterizing　the　high-dimensional　abstract　representation　of　learning　processes,　this　paper　proposes　an　meta　proximal　policy　optimization　(MPPO)　algorithm　that　combines　meta-learning　and　PPO　algorithms.　This　algorithm　can　enable　quadruped　robots　to　learn　better　gait.　The　simulation　results　on　the　PyBullet　simulation　platform　show　that　the　algorithm　proposed　in　this　paper　can　enable　quadruped　robots　to　learn　walking　skills.　Compared　with　soft　actor-critic　(SAC)　and　PPO　algorithms,　the　MPPO　algorithm　proposed　in　this　paper　has　advantages　such　as　more　regular　gait　rhythm　signals　and　faster　walking　speed.　©　2024　South　China　University　of　Technology.　All　rights　reserved.

Keyword：

Reinforcement learning Simulation platform Deep learning Learning algorithms Multipurpose robots Animals

Author Community：

[ 1 ] [Zhu, Xiao-Qing]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)
[ 2 ] [Liu, Xin-Yuan]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)
[ 3 ] [Ruan, Xiao-Gang]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)
[ 4 ] [Zhang, Si-Yuan]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)
[ 5 ] [Li, Chun-Yang]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)
[ 6 ] [Li, Peng]Faulty of Information Technology, Beijing University of Technology, Beijing 100020, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100020, China)

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Optimization-based parallel learning of quadruped robot locomotion skills
2024，Journal of Tsinghua University
Multi-strategy Central Pattern Generator and Reinforcement Learning Integration for Quadruped Locomotion
2025，3rd International Conference on Machine Learning, Cloud Computing and Intelligent Mining, MLCCIM 2024
Gait learning method of quadruped robot based on policy distillation
2025，Journal of Beijing University of Aeronautics and Astronautics
Gait Learning of Quadruped Robot Based on Deep Arbitration Strategy
2023，Transaction of Beijing Institute of Technology

Source ：

Control Theory and Applications

ISSN： 1000-8152

Year： 2024

Issue： 1

Volume： 41

Page： 155-162

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to