• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wang, Y. (Wang, Y..) | Wang, D. (Wang, D..) | Zhao, M. (Zhao, M..) | Liu, N. (Liu, N..) | Qiao, J. (Qiao, J..)

Indexed by:

EI Scopus SCIE

Abstract:

In this paper, an adjustable Q-learning scheme is developed to solve the discrete-time nonlinear zero-sum game problem, which can accelerate the convergence rate of the iterative Q-function sequence. First, the monotonicity and convergence of the iterative Q-function sequence are analyzed under some conditions. Moreover, by employing neural networks, the model-free tracking control problem can be overcome for zero-sum games. Second, two practical algorithms are designed to guarantee the convergence with accelerated learning. In one algorithm, an adjustable acceleration phase is added to the iteration process of Q-learning, which can be adaptively terminated with convergence guarantee. In another algorithm, a novel acceleration function is developed, which can adjust the relaxation factor to ensure the convergence. Finally, through a simulation example with the practical physical background, the fantastic performance of the developed algorithm is demonstrated with neural networks. © 2024 Elsevier Ltd

Keyword:

Zero-sum games Q-learning Adaptive dynamic programming Neural networks Convergence rate Optimal tracking control

Author Community:

  • [ 1 ] [Wang Y.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 2 ] [Wang Y.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
  • [ 3 ] [Wang Y.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
  • [ 4 ] [Wang Y.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
  • [ 5 ] [Wang D.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 6 ] [Wang D.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
  • [ 7 ] [Wang D.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
  • [ 8 ] [Wang D.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
  • [ 9 ] [Zhao M.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 10 ] [Zhao M.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
  • [ 11 ] [Zhao M.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
  • [ 12 ] [Zhao M.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
  • [ 13 ] [Liu N.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 14 ] [Liu N.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
  • [ 15 ] [Liu N.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
  • [ 16 ] [Liu N.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
  • [ 17 ] [Qiao J.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 18 ] [Qiao J.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
  • [ 19 ] [Qiao J.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
  • [ 20 ] [Qiao J.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

Neural Networks

ISSN: 0893-6080

Year: 2024

Volume: 175

7 . 8 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 3

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:295/10608067
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.