Reinforcement Learning With Adjustable Convergence Rate for Data-Based Nonlinear Control - Details

Author：

Tang, G. (Tang, G..) | Wang, D. (Wang, D..) | Li, X. (Li, X..) | Ren, J. (Ren, J..) | Liu, N. (Liu, N..)

Indexed by：

CPCI-S EI Scopus

Abstract：

In　this　paper,　a　value-iteration-based　off-policy　Q-learning　algorithm　is　developed.　The　proposed　algorithm　solves　the　optimal　regulation　problem　of　nonlinear　systems　with　unknown　dynamics.　Under　the　off-policy　mechanism,　the　algorithm　utilizes　the　behavioral　policy　for　full　exploration,　which　is　beneficial　to　avoid　the　target　policy　from　falling　into　the　local　optimal　solution.　In　addition,　a　relaxation　factor　is　introduced　to　adjust　the　convergence　rate　of　the　cost　function　sequence.　To　implement　the　algorithm,　the　critic　network　and　the　action　network　are　used　to　approximate　the　optimal　Q-function　and　the　optimal　control　policy,　respectively.　Finally,　a　simulation　example　is　presented　to　demonstrate　the　effectiveness　of　the　proposed　algorithm.　©　2024　IEEE.

Keyword：

Q-learning convergence rate Adaptive dynamic programming discrete-time nonlinear systems value iteration

Author Community：

[ 1 ] [Tang G.]Beijing University Of Technology, Faculty Of Information Technology, Beijing, China
[ 2 ] [Wang D.]Beijing University Of Technology, Faculty Of Information Technology, Beijing, China
[ 3 ] [Li X.]Beijing University Of Technology, Faculty Of Information Technology, Beijing, China
[ 4 ] [Ren J.]Beijing University Of Technology, Faculty Of Information Technology, Beijing, China
[ 5 ] [Liu N.]Beijing University Of Technology, Faculty Of Information Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Iterative Q-Learning for Model-Free Optimal Control With Adjustable Convergence Rate
2024，IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS
Model-Free Optimal Tracking Design With Evolving Control Strategies via Q-Learning
2024，IEEE Transactions on Circuits and Systems II: Express Briefs
Neural Q-learning for discrete-time nonlinear zero-sum games with adjustable convergence rate
2024，Neural Networks
Adjustable iterative Q-learning for advanced neural tracking control with stability guarantee
2024，Neurocomputing

Source ：

Year： 2024

Page： 2717-2722

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 7

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to