Novel Discounted Adaptive Critic Control Designs With Accelerated Learning Formulation - Details

Author：

Ha, Mingming (Ha, Mingming.) | Wang, Ding (Wang, Ding.) (Scholars：王鼎) | Liu, Derong (Liu, Derong.)

Indexed by：

EI Scopus SCIE

Abstract：

Inspired　by　the　successive　relaxation　method,　a　novel　discounted　iterative　adaptive　dynamic　programming　framework　is　developed,　in　which　the　iterative　value　function　sequence　possesses　an　adjustable　convergence　rate.　The　different　convergence　properties　of　the　value　function　sequence　and　the　stability　of　the　closed-loop　systems　under　the　new　discounted　value　iteration　(VI)　are　investigated.　Based　on　the　properties　of　the　given　VI　scheme,　an　accelerated　learning　algorithm　with　convergence　guarantee　is　presented.　Moreover,　the　implementations　of　the　new　VI　scheme　and　its　accelerated　learning　design　are　elaborated,　which　involve　value　function　approximation　and　policy　improvement.　A　nonlinear　fourth-order　ball-and-beam　balancing　plant　is　used　to　verify　the　performance　of　the　developed　approaches.　Compared　with　the　traditional　VI,　the　present　discounted　iterative　adaptive　critic　designs　greatly　accelerate　the　convergence　rate　of　the　value　function　and　reduce　the　computational　cost　simultaneously.

Keyword：

Iterative methods Stability criteria adaptive dynamic programming (ADP) Cost function reinforcement learning Power system stability Adaptive critic designs Closed loop systems fast convergence rate Optimal control Convergence value iteration (VI) discrete-time nonlinear systems

Author Community：

[ 1 ] [Ha, Mingming]Ant Grp, MYbank, Beijing 100020, Peoples R China
[ 2 ] [Ha, Mingming]Univ Sci & Technol Beijing, Sch Automation & Elect Engn, Beijing 100083, Peoples R China
[ 3 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
[ 4 ] [Liu, Derong]Southern Univ Sci & Technol, Sch Syst Design & Intelligent Mfg, Shenzhen 518055, Peoples R China
[ 5 ] [Liu, Derong]Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA

Reprint Author's Address：

[Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China;;

Email：

hamingming.hmm@mybank.cn |
dingwang@bjut.edu.cn |
liudr@sustech.edu.cn

Show more details

Related Keywords：

Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control
2022，IEEE-CAA JOURNAL OF AUTOMATICA SINICA
Advanced value iteration for discrete-time intelligent critic control: A survey
2023，ARTIFICIAL INTELLIGENCE REVIEW
Iterative Q-Learning for Model-Free Optimal Control With Adjustable Convergence Rate
2024，IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS
A Novel Value Iteration Scheme With Adjustable Convergence Rate
2022，IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

Source ：

IEEE TRANSACTIONS ON CYBERNETICS

ISSN： 2168-2267

Year： 2023

Issue： 5

Volume： 54

Page： 3003-3016

1 1 . 8 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：19

Cited Count：

WoS CC Cited Count： 19

SCOPUS Cited Count： 25

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 12

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to