Online Value Iteration for Discrete-Time Nonlinear Optimal Regulation with Stability Guarantee - Details

Author：

Wang, Yuan (Wang, Yuan.) | Wang, Ding (Wang, Ding.) (Scholars：王鼎) | Wu, Junlong (Wu, Junlong.) | Zhao, Mingming (Zhao, Mingming.)

Indexed by：

CPCI-S EI Scopus

Abstract：

In　this　paper,　the　intelligent　and　online　value　iteration　(VI)　algorithms　are　developed　to　solve　the　optimal　control　problem　for　nonlinear　discrete-time　systems.　First,　the　intelligent　VI　algorithm　combines　the　advantages　of　traditional　VI　initialized　by　the　zero　cost　function　and　stabilizing　VI　initialized　by　the　admissible　control　policy.　The　traditional　VI　is　easy　to　implement　and　can　provide　the　initial　admissible　control　policy　for　the　stabilizing　VI.　Meanwhile,　stabilizing　VI　can　guarantee　all　control　policies　are　admissible.　Second,　based　on　the　concept　of　the　attraction　domain,　an　online　value　iteration　algorithm　is　proposed　to　regulate　the　closed-loop　system　by　using　immature　control　policies　rather　than　the　fixed　optimal　control　policy.　It　ensures　that　the　state　trajectory　converges　to　the　origin　of　the　attraction　domain.　Finally,　simulations　are　carried　out　and　the　results　show　the　effectiveness　of　the　two　new　VI　algorithms.

Keyword：

asymptotic stability Adaptive dynamic programming attraction domain online value iteration

Author Community：

[ 1 ] [Wang, Yuan]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[ 3 ] [Wu, Junlong]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
[ 4 ] [Zhao, Mingming]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China

Reprint Author's Address：

Email：

wangyuan@emails.bjut.edu.cn |
dingwang@bjut.edu.cn |
wujunlong@emails.bjut.edu.cn |
zhaomm@emails.bjut.edu.cn

Show more details

Related Keywords：

Offline and Online Adaptive Critic Control Designs With Stability Guarantee Through Value Iteration
2021，IEEE TRANSACTIONS ON CYBERNETICS
Value-iteration-based affine nonlinear optimal control involving admissibility discussion
2022，INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL
Advanced Affine Optimal Tracking Control Through Online Value Iteration and Its Stability Proof
2022，
Advanced Affine Optimal Tracking Control Through Online Value Iteration and Its Stability Proof
2022，2022 41ST CHINESE CONTROL CONFERENCE (CCC)

Source ：

2022 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS, ICCR

Year： 2022

Page： 262-268

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 4

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to