• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Qiao, Junfei (Qiao, Junfei.) | Zhao, Mingming (Zhao, Mingming.) | Wang, Ding (Wang, Ding.) (Scholars:王鼎) | Ha, Mingming (Ha, Mingming.)

Indexed by:

EI Scopus SCIE

Abstract:

This article puts emphasis on the deterministic value-iteration-based Q-learning (VIQL) algorithm with adjustable convergence speed, followed by the application verification on trajectory tracking for completely unknown nonaffine systems. It is worth emphasizing that, under the effect of learning rates, the convergence speed can be adjusted and the new convergence criterion of the VIQL framework is investigated. The merit of the adjustable VIQL scheme is that it can quicken the learning speed and decrease the number of iterations, thereby reducing the computation burden. To carry out the model-free VIQL algorithm, the offline data of system states and reference trajectories are collected to provide the reference control, the tracking error, and the tracking control, which promotes the parameter updating of the adjustable VIQL algorithm via the off-policy learning scheme. By this updating operation, the convergent optimal tracking policy can guarantee that arbitrary initial state tracks the desired trajectory and can completely obviate the terminal tracking error. Finally, numerical simulations are conducted to indicate the validity of the designed tracking control algorithm.

Keyword:

Q-learning convergence speed Adaptive critic control optimal tracking adaptive dynamic programming (ADP)

Author Community:

  • [ 1 ] [Qiao, Junfei]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 2 ] [Zhao, Mingming]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 3 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 4 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 5 ] [Zhao, Mingming]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 6 ] [Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 7 ] [Ha, Mingming]Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China

Reprint Author's Address:

  • [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China;;[Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China;;

Show more details

Related Keywords:

Source :

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS

ISSN: 2168-2216

Year: 2023

Issue: 2

Volume: 54

Page: 1202-1213

8 . 7 0 0

JCR@2022

Cited Count:

WoS CC Cited Count: 11

SCOPUS Cited Count: 13

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 8

Affiliated Colleges:

Online/Total:609/10616773
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.