Indexed by:
Abstract:
In this article, in order to achieve optimal tracking control of unknown linear discrete sys-tems, a model-free scheme based on Q-learning is established online. First, we introduce an innovative performance index function, so as to eliminate the tracking error and avert the calculation for stable control policies of the reference trajectory. Taking value iteration and policy iteration into consideration, the corresponding model-based approaches are derived. Then, the Q-function is developed and the model-free algorithm utilizing Q-learning is given for the sake of dealing with the linear quadratic tracking (LQT) problem online with-out relying on system dynamics information. In addition, novel stability analysis based on Q-learning is provided for the discounted LQT control issue and the probing noise is demonstrated that it does not result in any excitation noise bias. Finally, by means of con-ducting numerical simulation, the proposed Q-learning algorithm is demonstrated to be effective and practicable.(c) 2023 Elsevier Inc. All rights reserved.
Keyword:
Reprint Author's Address:
Source :
INFORMATION SCIENCES
ISSN: 0020-0255
Year: 2023
Volume: 626
Page: 339-353
8 . 1 0 0
JCR@2022
ESI Discipline: COMPUTER SCIENCE;
ESI HC Threshold:19
Cited Count:
WoS CC Cited Count: 6
SCOPUS Cited Count: 8
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 2
Affiliated Colleges: