Event-based online learning control design with eligibility trace for discrete-time unknown nonlinear systems - Details

Author：

Wang, D. (Wang, D..) | Wang, J. (Wang, J..) | Hu, L. (Hu, L..) | Zhao, M. (Zhao, M..)

Indexed by：

EI Scopus SCIE

Abstract：

As　a　heuristic　algorithm　to　solve　the　nonlinear　optimal　control　problem,　adaptive　dynamic　programming　is　commonly　constructed　from　one-step　temporal　difference　learning.　Eligibility　trace　can　effectively　speed　up　controller　learning　by　considering　the　multi-step　information　in　reinforcement　learning.　However,　eligibility　trace　will　bring　in　additional　computational　consumption　and　increase　the　learning　burden.　This　paper　attempts　to　take　advantage　of　eligibility　trace　and　avoids　the　high　learning　consumption　at　the　same　time.　Therefore,　an　event-based　neural　dynamic　programming　(λ)　[ENDP(λ)]　algorithm　via　the　actor–critic　framework　is　constructed　to　solve　the　near-optimal　control　problem　of　unknown　discrete-time　systems.　First,　the　modified　forward　view　with　eligibility　trace　is　derived,　which　is　suitable　for　engineering　practice.　Second,　based　on　the　event-triggered　mechanism,　ENDP(λ)　is　designed　to　relieve　the　pressure　of　communication　consumption.　Then,　the　event-based　system　is　proven　to　ensure　the　input-to-state　stability　under　a　suitable　triggering　condition.　Moreover,　three　neural　networks　are　given　to　approximate　the　one-step　cost　function,　the　n-step　cost　function,　and　the　control　law,　respectively.　Finally,　two　typical　experimental　simulation　examples　are　presented　to　verify　the　effectiveness　of　the　ENDP(λ)　algorithm.　©　2023　Elsevier　Ltd

Keyword：

Neural control Nonaffine nonlinear systems Eligibility trace Adaptive critic design Reinforcement learning Temporal difference

Author Community：

[ 1 ] [Wang D.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
[ 2 ] [Wang D.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 3 ] [Wang D.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
[ 4 ] [Wang D.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
[ 5 ] [Wang J.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
[ 6 ] [Wang J.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 7 ] [Wang J.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
[ 8 ] [Wang J.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
[ 9 ] [Hu L.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
[ 10 ] [Hu L.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 11 ] [Hu L.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
[ 12 ] [Hu L.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
[ 13 ] [Zhao M.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
[ 14 ] [Zhao M.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 15 ] [Zhao M.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
[ 16 ] [Zhao M.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Event-Based Approximate Neuro-Optimal Tracking Control Design Involving a Wastewater Treatment Application
2022，
Event-Based Approximate Neuro-Optimal Tracking Control Design Involving aWastewater Treatment Application
2022，2022 41ST CHINESE CONTROL CONFERENCE (CCC)
Adaptive critic design with weight allocation for intelligent learning control of wastewater treatment plants
2024，Engineering Applications of Artificial Intelligence
Approximate neural optimal control with reinforcement learning for a torsional pendulum device
2019，NEURAL NETWORKS

Source ：

Engineering Applications of Artificial Intelligence

ISSN： 0952-1976

Year： 2023

Volume： 123

8 . 0 0 0

JCR@2022

ESI Discipline： ENGINEERING;

ESI HC Threshold：19

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 5

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 4

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to