A new Q-function structure for model-free adaptive optimal tracking control with asymmetric constrained inputs - Details

Author：

Zhao, M. (Zhao, M..) | Wang, D. (Wang, D..) | Li, M. (Li, M..) | Gao, N. (Gao, N..) | Qiao, J. (Qiao, J..)

Indexed by：

EI Scopus SCIE

Abstract：

This　article　aims　to　design　a　model-free　adaptive　tracking　controller　for　discrete-time　nonlinear　systems　with　unknown　dynamics　and　asymmetric　control　constraints.　First,　a　new　Q-function　structure　is　designed　by　introducing　the　control　input　into　the　tracking　error　of　the　next　moment,　in　order　to　eliminate　the　final　tracking　error,　avoid　the　steady　control,　and　ignore　the　discount　factor.　Second,　via　system　transformation,　a　general　performance　index　is　developed　to　overcome　the　challenge　caused　by　asymmetric　constraints　of　implicit　control　inputs.　By　this　operation,　the　constrained　tracking　problem　is　converted　to　an　unconstrained　optimal　tracking　problem　without　the　traditional　nonquadratic　performance　function　that　is　only　applicable　to　explicit　control　inputs.　Then,　a　value-iteration-based　Q-learning　(VIQL)　algorithm　is　derived　to　seek　the　optimal　Q-function　and　the　optimal　control　policy　by　using　offline　data　rather　than　the　mathematical　model.　Next,　the　convergence,　monotonicity,　and　stability　properties　of　VIQL　are　investigated　to　demonstrate　that　the　iterative　Q-function　sequence　can　converge　to　the　optimal　Q-function　under　ideal　conditions.　To　realize　the　VIQL　algorithm,　the　critic　neural　network　is　employed　to　approximate　the　Q-function.　Finally,　simulation　results　and　comparative　experiments　are　conducted　to　demonstrate　the　validity　and　effectiveness　of　the　present　VIQL　scheme.　©　2024　John　Wiley　&　Sons　Ltd.

Keyword：

adaptive dynamic programming model-free adaptive optimal tracking asymmetric control constraints value-iteration-based Q-learning

Author Community：

[ 1 ] [Zhao M.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 2 ] [Zhao M.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China
[ 3 ] [Zhao M.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, China
[ 4 ] [Zhao M.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, China
[ 5 ] [Wang D.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 6 ] [Wang D.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China
[ 7 ] [Wang D.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, China
[ 8 ] [Wang D.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, China
[ 9 ] [Li M.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 10 ] [Li M.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China
[ 11 ] [Li M.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, China
[ 12 ] [Li M.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, China
[ 13 ] [Gao N.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 14 ] [Gao N.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China
[ 15 ] [Gao N.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, China
[ 16 ] [Gao N.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, China
[ 17 ] [Qiao J.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 18 ] [Qiao J.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China
[ 19 ] [Qiao J.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, China
[ 20 ] [Qiao J.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Stability Analysis of Model-Free Control under Iterative Q-learning Algorithms
2023，
Decentralized controller design with asymmetric input constraints for unknown unmatched interconnected systems; [未知不匹配互联系统的非对称输入约束分散控制器设计]
2024，Chinese Journal of Engineering
Improved value iteration for nonlinear tracking control with accelerated learning
2024，International Journal of Robust and Nonlinear Control
An advanced robust integral reinforcement learning scheme with the fuzzy inference system
2024，INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL

Source ：

International Journal of Adaptive Control and Signal Processing

ISSN： 0890-6327

Year： 2024

Issue： 5

Volume： 38

Page： 1561-1578

3 . 1 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 4

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 12

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to