• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wang, D. (Wang, D..) | Wang, J. (Wang, J..) | Liu, D. (Liu, D..) | Qiao, J. (Qiao, J..)

Indexed by:

EI Scopus SCIE

Abstract:

Learning control methods have been widely enhanced by reinforcement learning, but it is challenging to analyze the effects of incorporating extra system information. This paper presents a novel multi-step framework that utilizes extra multi-step system information to solve optimal control problems. Within this framework, we establish and classify general multi-step value iteration (MsVI) algorithms based on the uniformity between policy evaluation and improvement stages. According to this uniformity concept, the convergence condition and the acceleration conclusion are analyzed for different kinds of MsVI algorithms. Besides, we introduce a swarm policy optimizer to relieve limitations of the traditional gradient optimizer. Specifically, we implement general MsVI using an actor–critic scheme, where the swarm optimizer and neural networks are employed for policy improvement and evaluation, respectively. Furthermore, the approximation error caused by the approximator is also considered to verify the advantage of using multi-step system information. Finally, we apply the proposed method to a nonlinear benchmark system, demonstrating superior learning ability and control performance compared to traditional methods. © 2025 The Authors

Keyword:

Multi-step learning Adaptive dynamic programming Optimal control Value iteration Approximation error Particle swarm optimization Neural networks

Author Community:

  • [ 1 ] [Wang D.]School of Information Science and Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 2 ] [Wang D.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
  • [ 3 ] [Wang D.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
  • [ 4 ] [Wang D.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
  • [ 5 ] [Wang J.]School of Information Science and Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 6 ] [Wang J.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
  • [ 7 ] [Wang J.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
  • [ 8 ] [Wang J.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China
  • [ 9 ] [Liu D.]School of Automation and Intelligent Manufacturing, Southern University of Science and Technology, Shenzhen, 518055, China
  • [ 10 ] [Liu D.]Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, 60607, IL, United States
  • [ 11 ] [Qiao J.]School of Information Science and Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 12 ] [Qiao J.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, 100124, China
  • [ 13 ] [Qiao J.]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, 100124, China
  • [ 14 ] [Qiao J.]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

Automatica

ISSN: 0005-1098

Year: 2025

Volume: 175

6 . 4 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 2

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 9

Affiliated Colleges:

Online/Total:468/10634003
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.