A Novel Value Iteration Scheme With Adjustable Convergence Rate - Details

Author：

Ha, Mingming (Ha, Mingming.) | Wang, Ding (Wang, Ding.) (Scholars：王鼎) | Liu, Derong (Liu, Derong.)

Indexed by：

EI Scopus SCIE

Abstract：

In　this　article,　a　novel　value　iteration　scheme　is　developed　with　convergence　and　stability　discussions.　A　relaxation　factor　is　introduced　to　adjust　the　convergence　rate　of　the　value　function　sequence.　The　convergence　conditions　with　respect　to　the　relaxation　factor　are　given.　The　stability　of　the　closed-loop　system　using　the　control　policies　generated　by　the　present　VI　algorithm　is　investigated.　Moreover,　an　integrated　VI　approach　is　developed　to　accelerate　and　guarantee　the　convergence　by　combining　the　advantages　of　the　present　and　traditional　value　iterations.　Also,　a　relaxation　function　is　designed　to　adaptively　make　the　developed　value　iteration　scheme　possess　fast　convergence　property.　Finally,　the　theoretical　results　and　the　effectiveness　of　the　present　algorithm　are　validated　by　numerical　examples.

Keyword：

Numerical stability reinforcement learning (RL) Stability criteria Adaptive dynamic programming (ADP) discrete-time nonlinear systems value iteration convergence rate Heuristic algorithms Approximation algorithms Optimal control Convergence admissible control policy Iterative algorithms

Author Community：

[ 1 ] [Ha, Mingming]Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[ 2 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China
[ 4 ] [Liu, Derong]Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA

Reprint Author's Address：

Email：

hamingming_0705@foxmail.com |
dingwang@bjut.edu.cn |
derong@uic.edu

Show more details

Related Keywords：

Offline and Online Adaptive Critic Control Designs With Stability Guarantee Through Value Iteration
2021，IEEE TRANSACTIONS ON CYBERNETICS
Novel Discounted Adaptive Critic Control Designs With Accelerated Learning Formulation
2023，IEEE TRANSACTIONS ON CYBERNETICS
Evolving and Incremental Value Iteration Schemes for Nonlinear Discrete-Time Zero-Sum Games
2022，IEEE TRANSACTIONS ON CYBERNETICS
Dual Event-Triggered Constrained Control Through Adaptive Critic for Discrete-Time Zero-Sum Games
2022，IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS

Source ：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

ISSN： 2162-237X

Year： 2022

Issue： 10

Volume： 34

Page： 7430-7442

1 0 . 4

JCR@2022

1 0 . 4 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：46

JCR Journal Grade：1

CAS Journal Grade：1

Cited Count：

WoS CC Cited Count： 38

SCOPUS Cited Count： 44

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 5

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to