• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Tao, Jun-Yuan (Tao, Jun-Yuan.) | Sun, Jin-Wei (Sun, Jin-Wei.) | Li, De-Sheng (Li, De-Sheng.) (Scholars:李德胜)

Indexed by:

EI Scopus PKU CSCD

Abstract:

A reinforcement learning algorithm based on linear average is proposed, which is used to solve non-convergent problems of reinforcement learning function approximation in continuous state space. According to contraction theory, this algorithm is based on gradient descent method, which adopts linear average as performance evaluation of value function. So the iterative process of value function becomes a convergent process to a fixed value. A standard reinforcement learning problem, Mountain Car Problem, is used to verify the performance of the algorithm. Results show the effectiveness, feasibility and quick convergence of the algorithm.

Keyword:

Gradient methods Reinforcement learning Learning algorithms Approximation algorithms Automation

Author Community:

  • [ 1 ] [Tao, Jun-Yuan]School of Electrical Engineering and Automation, Harbin Institute of Technology, Harbin 150001, China
  • [ 2 ] [Sun, Jin-Wei]School of Electrical Engineering and Automation, Harbin Institute of Technology, Harbin 150001, China
  • [ 3 ] [Li, De-Sheng]School of Mechanical Engineering and Applied Electronic Technology, Beijing University of Technology, Beijing 100022, China

Reprint Author's Address:

Show more details

Related Keywords:

Related Article:

Source :

Journal of Jilin University (Engineering and Technology Edition)

ISSN: 1671-5497

Year: 2008

Issue: 6

Volume: 38

Page: 1407-1411

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 6

Online/Total:600/10598705
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.