• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Ha, Mingming (Ha, Mingming.) | Wang, Ding (Wang, Ding.) (Scholars:王鼎) | Liu, Derong (Liu, Derong.)

Indexed by:

EI Scopus SCIE

Abstract:

In this article, a novel value iteration scheme is developed with convergence and stability discussions. A relaxation factor is introduced to adjust the convergence rate of the value function sequence. The convergence conditions with respect to the relaxation factor are given. The stability of the closed-loop system using the control policies generated by the present VI algorithm is investigated. Moreover, an integrated VI approach is developed to accelerate and guarantee the convergence by combining the advantages of the present and traditional value iterations. Also, a relaxation function is designed to adaptively make the developed value iteration scheme possess fast convergence property. Finally, the theoretical results and the effectiveness of the present algorithm are validated by numerical examples.

Keyword:

Numerical stability reinforcement learning (RL) Stability criteria Adaptive dynamic programming (ADP) discrete-time nonlinear systems value iteration convergence rate Heuristic algorithms Approximation algorithms Optimal control Convergence admissible control policy Iterative algorithms

Author Community:

  • [ 1 ] [Ha, Mingming]Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
  • [ 2 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China
  • [ 4 ] [Liu, Derong]Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA

Reprint Author's Address:

Show more details

Related Keywords:

Source :

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

ISSN: 2162-237X

Year: 2022

Issue: 10

Volume: 34

Page: 7430-7442

1 0 . 4

JCR@2022

1 0 . 4 0 0

JCR@2022

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:46

JCR Journal Grade:1

CAS Journal Grade:1

Cited Count:

WoS CC Cited Count: 38

SCOPUS Cited Count: 44

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Affiliated Colleges:

Online/Total:575/10595632
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.