General multi-step value iteration for optimal learning control - Details

Author：

Wang, D. (Wang, D..) | Wang, J. (Wang, J..) | Liu, D. (Liu, D..) | Qiao, J. (Qiao, J..)

Indexed by：

EI Scopus SCIE

Abstract：

Learning　control　methods　have　been　widely　enhanced　by　reinforcement　learning,　but　it　is　challenging　to　analyze　the　effects　of　incorporating　extra　system　information.　This　paper　presents　a　novel　multi-step　framework　that　utilizes　extra　multi-step　system　information　to　solve　optimal　control　problems.　Within　this　framework,　we　establish　and　classify　general　multi-step　value　iteration　(MsVI)　algorithms　based　on　the　uniformity　between　policy　evaluation　and　improvement　stages.　According　to　this　uniformity　concept,　the　convergence　condition　and　the　acceleration　conclusion　are　analyzed　for　different　kinds　of　MsVI　algorithms.　Besides,　we　introduce　a　swarm　policy　optimizer　to　relieve　limitations　of　the　traditional　gradient　optimizer.　Specifically,　we　implement　general　MsVI　using　an　actor–critic　scheme,　where　the　swarm　optimizer　and　neural　networks　are　employed　for　policy　improvement　and　evaluation,　respectively.　Furthermore,　the　approximation　error　caused　by　the　approximator　is　also　considered　to　verify　the　advantage　of　using　multi-step　system　information.　Finally,　we　apply　the　proposed　method　to　a　nonlinear　benchmark　system,　demonstrating　superior　learning　ability　and　control　performance　compared　to　traditional　methods.　©　2025　The　Authors

Keyword：

Multi-step learning Adaptive dynamic programming Optimal control Value iteration Approximation error Particle swarm optimization Neural networks

Author Community：

[ 1 ] [Wang D.]School of Information Science and Technology, Beijing University of Technology, Beijing, 100124, China
[ 2 ] [Wang D.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 3 ] [Wang D.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
[ 4 ] [Wang D.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
[ 5 ] [Wang J.]School of Information Science and Technology, Beijing University of Technology, Beijing, 100124, China
[ 6 ] [Wang J.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 7 ] [Wang J.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
[ 8 ] [Wang J.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
[ 9 ] [Liu D.]School of Automation and Intelligent Manufacturing, Southern University of Science and Technology, Shenzhen, 518055, China
[ 10 ] [Liu D.]Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, 60607, IL, United States
[ 11 ] [Qiao J.]School of Information Science and Technology, Beijing University of Technology, Beijing, 100124, China
[ 12 ] [Qiao J.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 13 ] [Qiao J.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
[ 14 ] [Qiao J.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Swarm-intelligence-based value iteration for optimal regulation of continuous-time nonlinear systems
2025，SWARM AND EVOLUTIONARY COMPUTATION
Improved value iteration for neural-network-based stochastic optimal control design
2020，NEURAL NETWORKS
Self-organizing neural intelligent control for nonlinear discrete-time systems with particle swarm optimization
2024，NONLINEAR DYNAMICS
Value-Iteration-Based Robust Adaptive Critic for Disturbed Non-Affine Continuous-Time Systems
2025，INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL

Source ：

Automatica

ISSN： 0005-1098

Year： 2025

Volume： 175

6 . 4 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 9

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to