• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wang, D. (Wang, D..) | Wang, Y. (Wang, Y..) | Zhao, M. (Zhao, M..) | Qiao, J. (Qiao, J..)

Indexed by:

Scopus

Abstract:

This article aims to achieve data-based online evolving control for zero-sum games with unknown dynamics. First of all, the value-iteration-based Q-learning framework is established. Relevant properties of the iterative Q-learning framework are analyzed, including the convergence and monotonicity. Then, the stability property is investigated and the online data is employed for off-policy learning. More importantly, two effective algorithms are designed to achieve online evolving control. In one algorithm, the monotonically nondecreasing Q-learning sequence requires the admissible criterion to guarantee the stability with the simple Q-function initialization. In another algorithm, the monotonically nonincreasing Q-function sequence can ensure the stability without the admissible criterion, but it requires an elaborate initial Q-function. In the end, by including two examples of real physical backgrounds, the excellent performance of online evolving control is exhibited with the given algorithms.  © 2013 IEEE.

Keyword:

Adaptive dynamic programming (ADP) zero-sum games Q-learning stability analysis online evolving control

Author Community:

  • [ 1 ] [Wang D.]Beijing University of Technology, School of Information Science and Technology, The Beijing Key Laboratory of Computational Intelligence and Intelligent System, The Beijing Laboratory of Smart Environmental Protection, The Beijing Institute of Artificial Intelligence, Beijing, 100124, China
  • [ 2 ] [Wang Y.]Beijing University of Technology, School of Information Science and Technology, The Beijing Key Laboratory of Computational Intelligence and Intelligent System, The Beijing Laboratory of Smart Environmental Protection, The Beijing Institute of Artificial Intelligence, Beijing, 100124, China
  • [ 3 ] [Zhao M.]Beijing University of Technology, School of Information Science and Technology, The Beijing Key Laboratory of Computational Intelligence and Intelligent System, The Beijing Laboratory of Smart Environmental Protection, The Beijing Institute of Artificial Intelligence, Beijing, 100124, China
  • [ 4 ] [Qiao J.]Beijing University of Technology, School of Information Science and Technology, The Beijing Key Laboratory of Computational Intelligence and Intelligent System, The Beijing Laboratory of Smart Environmental Protection, The Beijing Institute of Artificial Intelligence, Beijing, 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

IEEE Transactions on Systems, Man, and Cybernetics: Systems

ISSN: 2168-2216

Year: 2025

8 . 7 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 9

Affiliated Colleges:

Online/Total:591/10637951
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.