• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zhao, Mingming (Zhao, Mingming.) | Wang, Ding (Wang, Ding.) | Qiao, Junfei (Qiao, Junfei.)

Indexed by:

EI Scopus SCIE

Abstract:

In this paper, a new stabilizing value iteration Q-learning (SVIQL) algorithm is presented to achieve the online evolving control for unknown nonlinear systems. To achieve this, we aim to establish the data-driven evolving control framework, ensure the stability of iterative policies derived from SVIQL, and reduce the computational cost in online learning. First, by using an admissible policy for initialization, the monotonicity and stability properties of SVIQL are theoretically analyzed. It is highlighted that under this SVIQL framework, the iterative Q-function sequence is monotonically nonincreasing and all iterative policies are admissible. Second, we compare the convergence speed, computational complexity, and evolving control results of SVIQL and policy iteration Q-learning methods. The results shows that SVIQL has faster policy update speed with smaller computational complexity. Third, in the online evolving control stage, the offline and online data are combined to learn the new iterative policy, which is used to regulate the nonlinear system over time. For implementing the SVIQL algorithm, the critic and action networks are constructed to approximate the iterative Q-function and control policy, respectively. Moreover, the convergence of two network weights is analyzed. Finally, numerical simulations are conducted to confirm the effectiveness of the online evolving control under the SVIQL scheme.

Keyword:

Adaptive dynamic programming Online evolving control Stabilizing value iteration Q-learning Adaptive critic control

Author Community:

  • [ 1 ] [Zhao, Mingming]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Qiao, Junfei]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Zhao, Mingming]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 5 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 6 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 7 ] [Zhao, Mingming]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 8 ] [Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 9 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 10 ] [Zhao, Mingming]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 11 ] [Wang, Ding]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 12 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China

Reprint Author's Address:

  • [Qiao, Junfei]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Source :

NONLINEAR DYNAMICS

ISSN: 0924-090X

Year: 2024

Issue: 11

Volume: 112

Page: 9137-9153

5 . 6 0 0

JCR@2022

Cited Count:

WoS CC Cited Count: 2

SCOPUS Cited Count: 2

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 5

Affiliated Colleges:

Online/Total:498/10620797
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.