Indexed by:
Abstract:
This article investigates the stability of the closed-loop system under the iterative control policy generated by various iterative Q-learning algorithms. First, a new stability criterion is developed for the value-iteration-based Q-learning (VIQL) algorithm, which is initialized by a positive semi-definite function. Through this operation, VIQL can provide an initial admissible control policy for the policy-iteration-based Q-learning (PIQL) algorithm. It is emphasized that evolving control policies generated by PIQL can stabilize the controlled system. The numerical result is provided to verify the effectiveness of the present algorithms. © 2023 IEEE.
Keyword:
Reprint Author's Address:
Email:
Source :
Year: 2023
Page: 39-43
Language: English
Cited Count:
WoS CC Cited Count: 0
SCOPUS Cited Count:
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 11
Affiliated Colleges: