• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wang, Ding (Wang, Ding.) | Wang, Yuan (Wang, Yuan.) | Zhao, Mingming (Zhao, Mingming.) | Qiao, Junfei (Qiao, Junfei.)

Indexed by:

Scopus SCIE

Abstract:

This article aims to achieve data-based online evolving control for zero-sum games with unknown dynamics. First of all, the value-iteration-based Q-learning framework is established. Relevant properties of the iterative Q-learning framework are analyzed, including the convergence and monotonicity. Then, the stability property is investigated and the online data is employed for off-policy learning. More importantly, two effective algorithms are designed to achieve online evolving control. In one algorithm, the monotonically nondecreasing Q-learning sequence requires the admissible criterion to guarantee the stability with the simple Q-function initialization. In another algorithm, the monotonically nonincreasing Q-function sequence can ensure the stability without the admissible criterion, but it requires an elaborate initial Q-function. In the end, by including two examples of real physical backgrounds, the excellent performance of online evolving control is exhibited with the given algorithms.

Keyword:

Stability criteria zero-sum games Game theory Process control Optimal control Adaptive dynamic programming (ADP) Iterative methods Games Q-learning Trajectory tracking Convergence stability analysis online evolving control Cost function

Author Community:

  • [ 1 ] [Wang, Ding]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing Lab Smart Environm Protect, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 2 ] [Wang, Yuan]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing Lab Smart Environm Protect, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 3 ] [Zhao, Mingming]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing Lab Smart Environm Protect, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 4 ] [Qiao, Junfei]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing Lab Smart Environm Protect, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 5 ] [Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 6 ] [Wang, Yuan]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 7 ] [Zhao, Mingming]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 8 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China

Reprint Author's Address:

  • [Wang, Ding]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing Lab Smart Environm Protect, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China;;[Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Source :

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS

ISSN: 2168-2216

Year: 2025

8 . 7 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 3

Affiliated Colleges:

Online/Total:2878/10986392
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.