Reinforcement learning function approximation algorithm based on linear average - Details

Author：

Tao, Jun-Yuan (Tao, Jun-Yuan.) | Sun, Jin-Wei (Sun, Jin-Wei.) | Li, De-Sheng (Li, De-Sheng.) (Scholars：李德胜)

Indexed by：

EI Scopus PKU CSCD

Abstract：

A　reinforcement　learning　algorithm　based　on　linear　average　is　proposed,　which　is　used　to　solve　non-convergent　problems　of　reinforcement　learning　function　approximation　in　continuous　state　space.　According　to　contraction　theory,　this　algorithm　is　based　on　gradient　descent　method,　which　adopts　linear　average　as　performance　evaluation　of　value　function.　So　the　iterative　process　of　value　function　becomes　a　convergent　process　to　a　fixed　value.　A　standard　reinforcement　learning　problem,　Mountain　Car　Problem,　is　used　to　verify　the　performance　of　the　algorithm.　Results　show　the　effectiveness,　feasibility　and　quick　convergence　of　the　algorithm.

Keyword：

Gradient methods Reinforcement learning Learning algorithms Approximation algorithms Automation

Author Community：

[ 1 ] [Tao, Jun-Yuan]School of Electrical Engineering and Automation, Harbin Institute of Technology, Harbin 150001, China
[ 2 ] [Sun, Jin-Wei]School of Electrical Engineering and Automation, Harbin Institute of Technology, Harbin 150001, China
[ 3 ] [Li, De-Sheng]School of Mechanical Engineering and Applied Electronic Technology, Beijing University of Technology, Beijing 100022, China

Reprint Author's Address：

Email：

tjy1975@126.com

Show more details

Related Keywords：

Coherent beam combination based on Q-learning algorithm
2021，Optics Communications
A spectral kernel learning algorithm for classification
2010，
Accomplishing robot grasping task rapidly via adversarial training
2019，2019 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2019
Stochastic Online Learning for Mobile Edge Computing: Learning from Changes
2019，IEEE Communications Magazine

Source ：

Journal of Jilin University (Engineering and Technology Edition)

ISSN： 1671-5497

Year： 2008

Issue： 6

Volume： 38

Page： 1407-1411

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 6

Affiliated Colleges：

材料科学与工程学院本学院/部未明确归属的数据

Get Fulltext

Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to