Adjustable behavior-guided adaptive dynamic programming for neural learning control - Details

Author：

Tang, Guohan (Tang, Guohan.) | Wang, Ding (Wang, Ding.) | Liu, Ao (Liu, Ao.) | Qiao, Junfei (Qiao, Junfei.)

Indexed by：

EI Scopus SCIE

Abstract：

In　this　article,　an　adjustable　behavior-guided　adaptive　dynamic　programming　(BGADP)　algorithm　is　designed　to　solve　the　optimal　regulation　problem　for　discrete-time　systems.　In　conventional　adaptive　dynamic　programming　methods,　gradient　information　of　system　dynamics　is　necessary　for　conducting　policy　improvement.　However,　these　methods　face　challenges　when　gradient　information　cannot　be　computed　or　when　the　system　dynamics　is　non-differentiable.　To　overcome　these　limitations,　a　human-behavior-inspired　swarm　intelligence　approach　is　used　to　search　for　superior　policies　during　the　iterative　process,　eliminating　the　need　for　gradient　information.　Additionally,　a　relaxation　factor　is　introduced　into　the　value　function　update　to　accelerate　the　convergence　speed　of　the　algorithm.　The　monotonicity　and　convergence　properties　of　the　iterative　value　function　are　rigorously　analyzed.　Finally,　the　effectiveness　and　practicality　of　the　adjustable　BGADP　algorithm　are　validated　through　two　simulation　studies,　which　are　implemented　using　the　actor-critic　framework　with　neural　networks.

Keyword：

Optimal control Brain storm optimization Neural networks Adaptive dynamic programming Convergence rate Swarm intelligence

Author Community：

[ 1 ] [Wang, Ding]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
[ 2 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
[ 3 ] [Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
[ 4 ] [Wang, Ding]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China

Reprint Author's Address：

[Wang, Ding]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China

Email：

tangguohan@emails.bjut.edu.cn |
dingwang@bjut.edu.cn |
liuao@emails.bjut.edu.cn |
adqiao@bjut.edu.cn

Show more details

Related Keywords：

Improved value iteration for neural-network-based stochastic optimal control design
2020，NEURAL NETWORKS
Iterative Q-Learning for Model-Free Optimal Control With Adjustable Convergence Rate
2024，IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS
Novel Discounted Adaptive Critic Control Designs With Accelerated Learning Formulation
2023，IEEE TRANSACTIONS ON CYBERNETICS
Decentralized Optimal Neurocontroller Design for Mismatched Interconnected Systems via Integral Policy Iteration
2024，IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS

Source ：

NEUROCOMPUTING

ISSN： 0925-2312

Year： 2025

Volume： 636

6 . 0 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to