Dichotomy value iteration with parallel learning design towards discrete-time zero-sum games - Details

Author：

Wang, J. (Wang, J..) | Wang, D. (Wang, D..) | Li, X. (Li, X..) | Qiao, J. (Qiao, J..)

Indexed by：

EI Scopus SCIE

Abstract：

In　this　paper,　a　novel　parallel　learning　framework　is　developed　to　solve　zero-sum　games　for　discrete-time　nonlinear　systems.　Briefly,　the　purpose　of　this　study　is　to　determine　a　tentative　function　according　to　the　prior　knowledge　of　the　value　iteration　(VI)　algorithm.　The　learning　process　of　the　parallel　controllers　can　be　guided　by　the　tentative　function.　That　is　to　say,　the　neighborhood　of　the　optimal　cost　function　can　be　compressed　within　a　small　range　via　two　typical　exploration　policies.　Based　on　the　parallel　learning　framework,　a　novel　dichotomy　VI　algorithm　is　established　to　accelerate　the　learning　speed.　It　is　shown　that　the　parallel　controllers　will　converge　to　the　optimal　policy　from　contrary　initial　policies.　Finally,　two　typical　systems　are　used　to　demonstrate　the　learning　performance　of　the　constructed　dichotomy　VI　algorithm.　©　2023　Elsevier　Ltd

Keyword：

Value iteration Parallel learning Zero-sum games Artificial neural networks Adaptive critic Nonlinear systems

Author Community：

[ 1 ] [Wang J.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
[ 2 ] [Wang J.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 3 ] [Wang J.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
[ 4 ] [Wang J.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
[ 5 ] [Wang D.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
[ 6 ] [Wang D.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 7 ] [Wang D.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
[ 8 ] [Wang D.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
[ 9 ] [Li X.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
[ 10 ] [Li X.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 11 ] [Li X.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
[ 12 ] [Li X.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
[ 13 ] [Qiao J.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
[ 14 ] [Qiao J.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
[ 15 ] [Qiao J.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
[ 16 ] [Qiao J.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Improved value iteration for neural-network-based stochastic optimal control design
2020，NEURAL NETWORKS
Neural critic learning with accelerated value iteration for nonlinear model predictive control
2024，Neural Networks
Swarm-intelligence-based value iteration for optimal regulation of continuous-time nonlinear systems
2025，SWARM AND EVOLUTIONARY COMPUTATION
Asymmetric Constrained Optimal Tracking Control With Critic Learning of Nonlinear Multiplayer Zero-Sum Games
2022，IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

Source ：

Neural Networks

ISSN： 0893-6080

Year： 2023

Volume： 167

Page： 751-762

7 . 8 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：19

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 5

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 8

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to