• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zuo, Guoyu (Zuo, Guoyu.) (Scholars:左国玉) | Lu, Jiahao (Lu, Jiahao.) | Chen, Kexin (Chen, Kexin.) | Yu, Jianjun (Yu, Jianjun.) | Huang, Xiangsheng (Huang, Xiangsheng.)

Indexed by:

EI

Abstract:

This paper proposes a robotic imitation learning method which integrates the deterministic off-policy reinforcement learning and generative adversarial network. This method allows the robot to implement the grasping task rapidly by learning the reward function from the demonstration data. Firstly, the discriminator is used to learn the reward function from demonstrations, which can guide the generator to complete the robot grasping task. Secondly, the deep deterministic policy gradient method is used as the generator for learning action policy on the basis of discriminator. In particular, the demonstration data is also input into the generator to ensure its performance. Finally, three experiments on the Push and Pick- and-Place tasks are conducted in the GYM robotic environment. Results show that the learning speed of our method is much faster than the stochastic GAIL method, and it can effectively train from the demonstration data in different states of the task. The proposed method can complete the robot grasping task without environmental reward quickly and improve the stability of the training process. © 2018 IEEE

Keyword:

Educational robots Reinforcement learning Robot learning Robotics Stochastic systems Demonstrations Robots Agricultural robots Gradient methods

Author Community:

  • [ 1 ] [Zuo, Guoyu]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Zuo, Guoyu]Beijing Key Laboratory of Computing Intelligence and Intelligent Systems, Beijing; 100124, China
  • [ 3 ] [Lu, Jiahao]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 4 ] [Lu, Jiahao]Beijing Key Laboratory of Computing Intelligence and Intelligent Systems, Beijing; 100124, China
  • [ 5 ] [Chen, Kexin]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 6 ] [Chen, Kexin]Beijing Key Laboratory of Computing Intelligence and Intelligent Systems, Beijing; 100124, China
  • [ 7 ] [Yu, Jianjun]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 8 ] [Yu, Jianjun]Beijing Key Laboratory of Computing Intelligence and Intelligent Systems, Beijing; 100124, China
  • [ 9 ] [Huang, Xiangsheng]Institute of Automation, Chinese Academy of Sciences, Beijing; 100190, China

Reprint Author's Address:

  • 左国玉

    [zuo, guoyu]faculty of information technology, beijing university of technology, beijing; 100124, china;;[zuo, guoyu]beijing key laboratory of computing intelligence and intelligent systems, beijing; 100124, china

Show more details

Related Keywords:

Related Article:

Source :

Year: 2019

Volume: 2019-August

Page: 803-808

Language: English

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 3

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 3

Online/Total:1447/10840027
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.