• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wang, Ding (Wang, Ding.) | Tang, Guohan (Tang, Guohan.) | Ren, Jin (Ren, Jin.) | Zhao, Mingming (Zhao, Mingming.) | Qiao, Junfei (Qiao, Junfei.)

Indexed by:

EI Scopus SCIE

Abstract:

In this article, an advanced accelerated Q-learning (AQL) approach is designed to address the nonlinear discrete-time optimal tracking problem of zero-sum games with unknown dynamics. Different from conventional adaptive dynamic programming methods, the advanced Q-learning algorithm incorporates both the control input and the disturbance signal into the tracking error, which obviates the quadratic form of control and disturbance inputs directly. This innovative Q-function is used to derive the optimal tracking control policy pair that ensures the terminal tracking error asymptotically converges to zero, independent of the feedforward control input. In order to improve the convergence speed of the iterative process and reduce computational complexity, an accelerated factor is introduced. After collecting offline input-output data, a backpropagation neural network is employed to approximate the proposed Q-function, which enables model-free tracking control of zero-sum games through an off-policy learning mechanism. Furthermore, the theoretical properties of the developed algorithm are analyzed under specific preconditions. Finally, the effectiveness of the AQL algorithm is validated through a numerical simulation, which is implemented using a critic-only structure.

Keyword:

Zero-sum games Adaptive dynamic programming Neural networks Accelerated Q-learning Nonlinear tracking control

Author Community:

  • [ 1 ] [Wang, Ding]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Tang, Guohan]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Ren, Jin]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Zhao, Mingming]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 5 ] [Qiao, Junfei]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 6 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 7 ] [Tang, Guohan]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 8 ] [Ren, Jin]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 9 ] [Zhao, Mingming]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 10 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 11 ] [Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 12 ] [Tang, Guohan]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 13 ] [Ren, Jin]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 14 ] [Zhao, Mingming]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 15 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 16 ] [Wang, Ding]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 17 ] [Tang, Guohan]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 18 ] [Ren, Jin]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 19 ] [Zhao, Mingming]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 20 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China

Reprint Author's Address:

  • [Wang, Ding]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China;;[Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China;;[Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China;;[Wang, Ding]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Source :

NONLINEAR DYNAMICS

ISSN: 0924-090X

Year: 2025

Issue: 13

Volume: 113

Page: 16679-16694

5 . 6 0 0

JCR@2022

Cited Count:

WoS CC Cited Count: 1

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Affiliated Colleges:

Online/Total:1897/10954688
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.