• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wang, Ding (Wang, Ding.) | Wang, Jiangyu (Wang, Jiangyu.) | Hu, Lingzhi (Hu, Lingzhi.) | Zhang, Liguo (Zhang, Liguo.) (Scholars:张利国)

Indexed by:

EI Scopus SCIE

Abstract:

Parallelization is widely employed to improve the exploration ability of controllers. However, it is rare to provide a lightweight scheme for reducing homogeneous policies with theoretical guarantees. This article is concerned with a novel parallel scheme for solving optimal control problems. In brief, we design a novel global indicator that inherits the theoretical guarantees of a class of iterative reinforcement learning algorithms. By generating a tentative function, the global indicator can guide and communicate with parallel controllers to accelerate the learning process. Using two typical exploration policies, the novel parallel scheme can rapidly compress the neighborhood of the optimal cost function. Besides, two parallel algorithms based on value iteration and Q-learning are established to improve the data efficiency through different extensions. Finally, two benchmark problems are presented to demonstrate the learning effectiveness of the novel parallel scheme.

Keyword:

Adaptive critic nonlinear control parallel learning Q-learning reinforcement learning (RL) discrete-time systems value iteration (VI)

Author Community:

  • [ 1 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Wang, Jiangyu]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Hu, Lingzhi]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 5 ] [Wang, Jiangyu]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 6 ] [Hu, Lingzhi]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 7 ] [Zhang, Liguo]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China

Reprint Author's Address:

  • [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Sch Informat Sci & Technol, Beijing 100124, Peoples R China;;[Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China;;

Show more details

Related Keywords:

Source :

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS

ISSN: 2168-2216

Year: 2024

Issue: 10

Volume: 54

Page: 6320-6331

8 . 7 0 0

JCR@2022

Cited Count:

WoS CC Cited Count: 2

SCOPUS Cited Count: 3

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:656/10700473
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.