Details - 北京工业大学机构库

Query：

学者姓名：王鼎

Refining：

Year

2025 (6)
2024 (15)
2023 (14)
2022 (27)
2021 (16)
2020 (17)
2019 (3)

Submit Unfold

Type

期刊论文 (87)
专利 (6)
会议论文 (5)

Submit Unfold

Indexed by

Scopus (79)
EI (78)
SCIE (71)
CPCI-S (17)
incoPat (6)
zhihuiya (6)
万方 (4)
CNKI (3)
CSCD (2)
PKU (1)
PubMed (1)

Submit Unfold

Source

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (9)
IEEE TRANSACTIONS ON CYBERNETICS (8)
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (8)
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL (8)
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING (4)
NEURAL NETWORKS (4)
2022 41ST CHINESE CONTROL CONFERENCE (CCC) (3)
ARTIFICIAL INTELLIGENCE REVIEW (3)
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS (3)
IEEE-CAA JOURNAL OF AUTOMATICA SINICA (3)
INFORMATION SCIENCES (3)
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE (3)
NEUROCOMPUTING (3)
自动化学报 (3)
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC) (2)
2022 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS, ICCR (2)
2024 43RD CHINESE CONTROL CONFERENCE, CCC 2024 (2)
21st IFAC World Congress on Automatic Control - Meeting Societal Challenges (2)
39th Chinese Control Conference (CCC) (2)
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS (2)
NONLINEAR DYNAMICS (2)
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021) (2)
SWARM AND EVOLUTIONARY COMPUTATION (2)
2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024 (1)
Acta Automatica Sinica (1)
Chinese Automation Congress (CAC) (1)
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (1)
IEEE TRANSACTIONS ON AUTOMATIC CONTROL (1)
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION (1)
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS (1)
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING (1)
Systems and Control Letters (1)

Submit Unfold

Complex

First Author (40)
Reprint Author (22)
First Comm (40)
Reprint Comm (40)
ESI HCP (8)

Submit Unfold

Co-Author

Qiao, Junfei (44)
Ha, Mingming (31)
Zhao, Mingming (31)
Ren, Jin (16)
Hu, Lingzhi (13)
Liu, Derong (12)
乔俊飞 (9)
Li, Menghua (8)
Wu, Junlong (8)
赵明明 (8)
Liu, Ao (5)
Niu, Ben (5)
Zhao, Xudong (4)
Cheng, Long (3)
Huang, Haiming (3)
Hu, Qinna (3)
Jiang, Yuqiang (3)
Wang, Huanqing (3)
Wang, Jiangyu (3)
Xie, Yingbo (3)
Xin, Peng (3)
Xu, Xin (3)
Yang, Ruyue (3)
哈明鸣 (3)
Liang, Mingming (2)
Liu, Chao (2)
Wang, Hua (2)
Wang, Yuan (2)
Yan, Jun (2)
王将宇 (2)
赵慧玲 (2)
Cui, Chengyu (1)
Fan, Wenqian (1)
Gao, Ning (1)
Gao, Yahui (1)
Han, Xiumei (1)
He, Yingyun (1)
Huo, Yu (1)
Karimi, Hamid Reza (1)
Li, Bin (1)
Liu, Nan (1)
Ma, Hongyu (1)
Ou, Yongsheng (1)
Shang, Zihao (1)
Sui, Jihang (1)
Wang, Gongming (1)
Wang, Xiaomei (1)
Wen, Luyao (1)
Yang, Shengxiang (1)
Ye, Kai (1)
Yin, Baocai (1)
Yuan, Zeqiang (1)
Zhang, Guangju (1)
Zhang, Jiaming (1)
Zhao, Huiling (1)
Zhong, Xiangnan (1)
Zhou, Zihang (1)
Zong, Guangdeng (1)
丁海旭 (1)
李文静 (1)
杜胜利 (1)
杨茹越 (1)
武利 (1)
胡凌治 (1)

Submit Unfold

Language

English (88)
Chinese (10)

Submit

Clean All

Select All Sort by：

Default

Default
Title
Year
WOS Cited Count
Impact factor
Ascending
Descending

< Page ，Total 10 >

Event-Triggered Adaptive Finite-Time Control for a Robotic Manipulator System With Global Prescribed Performance and Asymptotic Tracking SCIE

期刊论文 | 2025 , 55 (3) , 1045-1055 | IEEE TRANSACTIONS ON CYBERNETICS

Sui, Jihang | Niu, Ben | Ou, Yongsheng | Zhao, Xudong | Wang, Ding

Abstract&Keyword Cite

Abstract ：

This article studies the dynamic event-triggered adaptive finite-time tracking control issue for a robotic manipulator (RM) system with disturbances. First, a new global prescribed performance function (PPF) is designed based on a scaling function such that the tracking error evolves within the constrained bounds and the restriction related to the initial conditions is removed. Then, the finite-time command filter (FTCF) is used to avoid the direct derivations of virtual controllers and the singularity issue of the conventional backstepping technique. Moreover, the filtering errors caused by the FTCF are removed by the designed error compensation mechanism. A novel dynamic event-triggered mechanism (DETM) using the dynamic auxiliary variable is designed to save communication resources. The proposed control scheme can guarantee that all signals of the RM are globally bounded within a finite time, and the tracking error can asymptotically reach zero. Finally, a simulation example and several comparative simulations show the validity of the proposed scheme.

Keyword ：

dynamic event-triggered control (DETC) dynamic event-triggered control (DETC) robotic manipulator (RM) robotic manipulator (RM) Backstepping Backstepping global prescribed performance global prescribed performance Adaptive systems Adaptive systems Convergence Convergence finite-time control (FTC) finite-time control (FTC) Manipulator dynamics Manipulator dynamics Transient analysis Transient analysis Event detection Event detection Asymptotic stability Asymptotic stability Vectors Vectors Symmetric matrices Symmetric matrices Heuristic algorithms Heuristic algorithms Adaptive asymptotic tracking control Adaptive asymptotic tracking control

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Sui, Jihang , Niu, Ben , Ou, Yongsheng et al. Event-Triggered Adaptive Finite-Time Control for a Robotic Manipulator System With Global Prescribed Performance and Asymptotic Tracking [J]. \| IEEE TRANSACTIONS ON CYBERNETICS , 2025 , 55 (3) : 1045-1055 .
MLA	Sui, Jihang et al. "Event-Triggered Adaptive Finite-Time Control for a Robotic Manipulator System With Global Prescribed Performance and Asymptotic Tracking" . \| IEEE TRANSACTIONS ON CYBERNETICS 55 . 3 (2025) : 1045-1055 .
APA	Sui, Jihang , Niu, Ben , Ou, Yongsheng , Zhao, Xudong , Wang, Ding . Event-Triggered Adaptive Finite-Time Control for a Robotic Manipulator System With Global Prescribed Performance and Asymptotic Tracking . \| IEEE TRANSACTIONS ON CYBERNETICS , 2025 , 55 (3) , 1045-1055 .
Export to	NoteExpress RIS BibTex

Enhancing offline reinforcement learning for wastewater treatment via transition filter and prioritized approximation loss SCIE

期刊论文 | 2025 , 636 | NEUROCOMPUTING

Yang, Ruyue | Wang, Ding | Li, Menghua | Cui, Chengyu | Qiao, Junfei

Abstract&Keyword Cite

Abstract ：

Wastewater treatment plays a crucial role in urban society, requiring efficient control strategies to optimize its performance. In this paper, we propose an enhanced offline reinforcement learning (RL) approach for wastewater treatment. Our algorithm improves the learning process. It uses a transition filter to sort out low- performance transitions and employs prioritized approximation loss to achieve prioritized experience replay with uniformly sampled loss. Additionally, the variational autoencoder is introduced to address the problem of distribution shift in offline RL. The proposed approach is evaluated on a nonlinear system and wastewater treatment simulation platform, demonstrating its effectiveness in achieving optimal control. The contributions of this paper include the development of an improved offline RL algorithm for wastewater treatment and the integration of transition filtering and prioritized approximation loss. Evaluation results demonstrate that the proposed algorithm achieves lower tracking error and cost.

Keyword ：

Adaptive dynamic programming Adaptive dynamic programming Variational autoencoder Variational autoencoder Offline reinforcement learning Offline reinforcement learning Wastewater treatment Wastewater treatment

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Yang, Ruyue , Wang, Ding , Li, Menghua et al. Enhancing offline reinforcement learning for wastewater treatment via transition filter and prioritized approximation loss [J]. \| NEUROCOMPUTING , 2025 , 636 .
MLA	Yang, Ruyue et al. "Enhancing offline reinforcement learning for wastewater treatment via transition filter and prioritized approximation loss" . \| NEUROCOMPUTING 636 (2025) .
APA	Yang, Ruyue , Wang, Ding , Li, Menghua , Cui, Chengyu , Qiao, Junfei . Enhancing offline reinforcement learning for wastewater treatment via transition filter and prioritized approximation loss . \| NEUROCOMPUTING , 2025 , 636 .
Export to	NoteExpress RIS BibTex

Swarm-intelligence-based value iteration for optimal regulation of continuous-time nonlinear systems SCIE

期刊论文 | 2025 , 95 | SWARM AND EVOLUTIONARY COMPUTATION

Wang, Ding | Hu, Qinna | Liu, Ao | Qiao, Junfei

Abstract&Keyword Cite

Abstract ：

In this article, a swarm-intelligence-based value iteration (VI) algorithm is constructed to resolve the optimal control issue for continuous-time (CT) nonlinear systems. By leveraging the evolutionary concept of particle swarm optimization (PSO), the challenge of gradient vanishing is effectively overcome compared to traditional adaptive dynamic programming (ADP). Specifically, a PSO-based action network is implemented to perform policy improvement, eliminating the reliance on gradient information. Furthermore, within the ADP framework, the swarm-intelligence-based VI algorithm for CT systems is developed to address the challenges associated with constraints of initial admissible conditions and the difficulty of selecting probing signals in the traditional policy iteration method. The theoretical analysis is provided to show the convergence of the developed VI algorithm and the stability of the closed-loop system, respectively. Finally, under affine and non-affine backgrounds, two simulations are conducted to demonstrate the effectiveness and optimality of the established swarm-intelligence-based VI scheme for CT systems.

Keyword ：

Intelligent optimal control Intelligent optimal control Reinforcement learning Reinforcement learning Continuous-time nonlinear systems Continuous-time nonlinear systems Value iteration Value iteration Adaptive dynamic programming Adaptive dynamic programming Particle swarm optimization Particle swarm optimization

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wang, Ding , Hu, Qinna , Liu, Ao et al. Swarm-intelligence-based value iteration for optimal regulation of continuous-time nonlinear systems [J]. \| SWARM AND EVOLUTIONARY COMPUTATION , 2025 , 95 .
MLA	Wang, Ding et al. "Swarm-intelligence-based value iteration for optimal regulation of continuous-time nonlinear systems" . \| SWARM AND EVOLUTIONARY COMPUTATION 95 (2025) .
APA	Wang, Ding , Hu, Qinna , Liu, Ao , Qiao, Junfei . Swarm-intelligence-based value iteration for optimal regulation of continuous-time nonlinear systems . \| SWARM AND EVOLUTIONARY COMPUTATION , 2025 , 95 .
Export to	NoteExpress RIS BibTex

Value-Iteration-Based Robust Adaptive Critic for Disturbed Non-Affine Continuous-Time Systems SCIE

期刊论文 | 2025 | INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL

Liu, Ao | Wang, Ding | He, Yingyun | Ye, Kai | Qiao, Junfei

Abstract&Keyword Cite

Abstract ：

In this paper, a novel value-iteration-based adaptive critic scheme is developed to address the H-infinity control problem for non-linear non-affine continuous-time (CT) systems with disturbances. Recurrent neural networks are employed to model the non-linear non-affine systems, thereby covering the unknown system dynamics. Based on the transformation of the optimal-robust problem, the H-infinity control issue is established to deal with disturbances. By introducing the accelerated factor, the value-iteration-based adaptive dynamic programming approach is developed to design controllers for non-linear CT systems subject to input constraints. The initial admissible control law is eliminated, which is a tough question for traditional policy iteration. Besides, the speed of the learning process is improved by relying on the accelerated factor. The corresponding convergence of the established method and the stability of the closed-loop system are presented by giving corresponding theorems. Finally, the effectiveness of novel value-iteration-based adaptive critic is validated by conducting two examples.

Keyword ：

H-infinity control H-infinity control value iteration value iteration adaptive dynamic programming adaptive dynamic programming optimal control optimal control

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Liu, Ao , Wang, Ding , He, Yingyun et al. Value-Iteration-Based Robust Adaptive Critic for Disturbed Non-Affine Continuous-Time Systems [J]. \| INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL , 2025 .
MLA	Liu, Ao et al. "Value-Iteration-Based Robust Adaptive Critic for Disturbed Non-Affine Continuous-Time Systems" . \| INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL (2025) .
APA	Liu, Ao , Wang, Ding , He, Yingyun , Ye, Kai , Qiao, Junfei . Value-Iteration-Based Robust Adaptive Critic for Disturbed Non-Affine Continuous-Time Systems . \| INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL , 2025 .
Export to	NoteExpress RIS BibTex

Self-triggered neural tracking control for discrete-time nonlinear systems via adaptive critic learning SCIE

期刊论文 | 2025 , 186 | NEURAL NETWORKS

Hu, Lingzhi | Wang, Ding | Wang, Gongming | Qiao, Junfei

Abstract&Keyword Cite

Abstract ：

In this paper, a novel self-triggered optimal tracking control method is developed based on the online action- critic technique for discrete-time nonlinear systems. First, an augmented plant is constructed by integrating the system state with the reference trajectory. This transformation redefines the optimal tracking control design as the optimal regulation issue of the reconstructed nonlinear error system. Subsequently, under the premise of ensuring the controlled system stability, a self-sampling function that depends solely on the sampling tracking error is devised, thereby determining the next triggering instant. This approach not only effectively reduces the computational burden but also eliminates the need for continuous evaluation of the triggering condition, as required in traditional event-based methods. Furthermore, the developed control method can be found to possess excellent triggering performance. The model, critic, and action neural networks are constructed to implement the online critic learning algorithm, enabling real-time adjustment of the tracking control policy to achieve optimal performance. Finally, an experimental plant with nonlinear characteristics is presented to illustrate the overall performance of the proposed online self-triggered tracking control strategy.

Keyword ：

Self-triggered mechanism Self-triggered mechanism Adaptive critic control Adaptive critic control Trajectory tracking Trajectory tracking Discrete-time nonlinear systems Discrete-time nonlinear systems Neural networks Neural networks Stability analysis Stability analysis

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Hu, Lingzhi , Wang, Ding , Wang, Gongming et al. Self-triggered neural tracking control for discrete-time nonlinear systems via adaptive critic learning [J]. \| NEURAL NETWORKS , 2025 , 186 .
MLA	Hu, Lingzhi et al. "Self-triggered neural tracking control for discrete-time nonlinear systems via adaptive critic learning" . \| NEURAL NETWORKS 186 (2025) .
APA	Hu, Lingzhi , Wang, Ding , Wang, Gongming , Qiao, Junfei . Self-triggered neural tracking control for discrete-time nonlinear systems via adaptive critic learning . \| NEURAL NETWORKS , 2025 , 186 .
Export to	NoteExpress RIS BibTex

A weighted knowledge extraction strategy for dynamic multi-objective optimization SCIE

期刊论文 | 2025 , 92 | SWARM AND EVOLUTIONARY COMPUTATION

Xie, Yingbo | Qiao, Junfei | Wang, Ding

WoS CC Cited Count： 1

Abstract&Keyword Cite

Abstract ：

Multi-objective evolutionary algorithms suffer from performance degradation when solving dynamic multi- objective optimization problems (DMOPs) with a new conditional configuration from scratch, which motivates the research on knowledge extraction. However, most knowledge extraction strategies only focus on obtaining effective information from a single knowledge source, while ignoring the useful information from other knowledge sources with similar properties. Motivated by this, a weighted multi-source knowledge extraction strategy-based dynamic multiobjective evolutionary algorithm is proposed. First, a similarity criterion based on angle information is constructed to quantify similarity between different source domains and the target domain. Second, a knowledge extraction technique is developed to select a specific number of individuals from each source domain using a distance metric. Third, a generation strategy based on dynamic weighting mechanism is proposed, which generates a certain number of individuals and merges these individuals into the initial population within the new environment. Finally, the comprehensive experiments are conducted on public DMOP benchmarks and demonstrate the devised method significantly outperforms the state-of-the-art competing algorithms.

Keyword ：

Change response Change response Evolutionary environment Evolutionary environment Dynamic multiobjective optimization Dynamic multiobjective optimization Evolutionary algorithms Evolutionary algorithms

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Xie, Yingbo , Qiao, Junfei , Wang, Ding . A weighted knowledge extraction strategy for dynamic multi-objective optimization [J]. \| SWARM AND EVOLUTIONARY COMPUTATION , 2025 , 92 .
MLA	Xie, Yingbo et al. "A weighted knowledge extraction strategy for dynamic multi-objective optimization" . \| SWARM AND EVOLUTIONARY COMPUTATION 92 (2025) .
APA	Xie, Yingbo , Qiao, Junfei , Wang, Ding . A weighted knowledge extraction strategy for dynamic multi-objective optimization . \| SWARM AND EVOLUTIONARY COMPUTATION , 2025 , 92 .
Export to	NoteExpress RIS BibTex

Novel generalized policy iteration for efficient evolving control of nonlinear systems SCIE

期刊论文 | 2024 , 608 | NEUROCOMPUTING

Huang, Haiming | Wang, Ding | Wang, Hua | Wu, Junlong | Zhao, Mingming

Abstract&Keyword Cite

Abstract ：

In this article, we construct a novel generalized policy iteration framework to address optimal regulation problems for discrete-time nonlinear systems in a more efficient way. Relevant properties are investigated for the framework, including monotonicity and convergence of the iterative value function sequence as well as the admissibility of the iterative control policy. Additionally, an innovative approach is developed to seek an initial admissible control policy for the framework with an adjustable searching speed. Based on these, an evolving control algorithm is presented with stability guarantee. This algorithm employs iterative control policies for system control during the computation of the optimal control policy, as opposed to waiting for the generation of the optimal control policy before implementing control. Eventually, two simulation experiments are conducted with real-world physical backgrounds, in order to illustrate the performance of the proposed strategy.

Keyword ：

Admissible control policy Admissible control policy Optimal control Optimal control Adaptive dynamic programming Adaptive dynamic programming Adaptive critic designs Adaptive critic designs Evolving control Evolving control Generalized policy iteration Generalized policy iteration

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Huang, Haiming , Wang, Ding , Wang, Hua et al. Novel generalized policy iteration for efficient evolving control of nonlinear systems [J]. \| NEUROCOMPUTING , 2024 , 608 .
MLA	Huang, Haiming et al. "Novel generalized policy iteration for efficient evolving control of nonlinear systems" . \| NEUROCOMPUTING 608 (2024) .
APA	Huang, Haiming , Wang, Ding , Wang, Hua , Wu, Junlong , Zhao, Mingming . Novel generalized policy iteration for efficient evolving control of nonlinear systems . \| NEUROCOMPUTING , 2024 , 608 .
Export to	NoteExpress RIS BibTex

An advanced robust integral reinforcement learning scheme with the fuzzy inference system SCIE

期刊论文 | 2024 , 34 (17) , 11745-11759 | INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL

Liu, Ao | Wang, Ding | Qiao, Junfei

Abstract&Keyword Cite

Abstract ：

In this paper, the model-free robust control problem is investigated for nonlinear systems with a relaxed condition of initial admissible control. An advanced integral reinforcement learning method is developed, which merges the adaptive network-based fuzzy inference system (ANFIS) and pre-training of the initial weights. To loose the condition for choosing the initial control law, pre-training of initial weights is established by utilizing the ANFIS to provide the information corresponding to the system model, which is applicable to the model-free issue. Based on the actor-critic structure, the approximate optimal control law is obtained by employing adaptive dynamic programming. Redesigning the obtained control law, the robust controller can be derived to stabilize the system with the uncertain term. Eventually, two examples are utilized to verify the effectiveness of the constructed algorithm.

Keyword ：

adaptive dynamic programming adaptive dynamic programming robust control robust control integral reinforcement learning integral reinforcement learning adaptive network-based fuzzy inference systems adaptive network-based fuzzy inference systems

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Liu, Ao , Wang, Ding , Qiao, Junfei . An advanced robust integral reinforcement learning scheme with the fuzzy inference system [J]. \| INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL , 2024 , 34 (17) : 11745-11759 .
MLA	Liu, Ao et al. "An advanced robust integral reinforcement learning scheme with the fuzzy inference system" . \| INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 34 . 17 (2024) : 11745-11759 .
APA	Liu, Ao , Wang, Ding , Qiao, Junfei . An advanced robust integral reinforcement learning scheme with the fuzzy inference system . \| INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL , 2024 , 34 (17) , 11745-11759 .
Export to	NoteExpress RIS BibTex

融合自适应评判的随机系统数据驱动策略优化

期刊论文 | 2024 , 50 (5) , 980-990 | 自动化学报

王鼎 | 王将宇 | 乔俊飞

Abstract&Keyword Cite

Abstract ：

自适应评判技术已经广泛应用于求解复杂非线性系统的最优控制问题,但利用其求解离散时间非线性随机系统的无限时域最优控制问题还存在一定局限性.本文融合自适应评判技术,建立一种数据驱动的离散随机系统折扣最优调节方法.首先,针对宽松假设下的非线性随机系统,研究带有折扣因子的无限时域最优控制问题.所提的随机系统Q-learn-ing算法能够将初始的容许策略单调不增地优化至最优策略.基于数据驱动思想,随机系统Q-learning算法在不建立模型的情况下直接利用数据进行策略优化.其次,利用执行-评判神经网络方案,实现了随机系统Q-learning算法.最后,通过两个基准系统,验证本文提出的随机系统Q-learning算法的有效性.

Keyword ：

Q-learning Q-learning 随机最优控制随机最优控制离散系统离散系统数据驱动数据驱动自适应评判设计自适应评判设计神经网络神经网络

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	王鼎 , 王将宇 , 乔俊飞 . 融合自适应评判的随机系统数据驱动策略优化 [J]. \| 自动化学报 , 2024 , 50 (5) : 980-990 .
MLA	王鼎 et al. "融合自适应评判的随机系统数据驱动策略优化" . \| 自动化学报 50 . 5 (2024) : 980-990 .
APA	王鼎 , 王将宇 , 乔俊飞 . 融合自适应评判的随机系统数据驱动策略优化 . \| 自动化学报 , 2024 , 50 (5) , 980-990 .
Export to	NoteExpress RIS BibTex

Evolution-Guided Adaptive Dynamic Programming for Nonlinear Optimal Control SCIE

期刊论文 | 2024 , 54 (10) , 6043-6054 | IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS

Wang, Ding | Huang, Haiming | Liu, Derong | Zhao, Mingming | Qiao, Junfei

WoS CC Cited Count： 1

Abstract&Keyword Cite

Abstract ：

In this article, an evolution-guided adaptive dynamic programming (EGADP) algorithm is developed to address the optimal regulation problems for the nonlinear systems. In the traditional adaptive dynamic programming algorithms, policy improvement is typically reliant on the gradient information, according to the first order necessity condition. However, these methods encounter limitations when calculating the gradient information becomes infeasible or system dynamics is not differentiable. In response to this challenge, the evolutionary computation is harnessed by EGADP to search for a superior policy during policy improvement. Therefore, compared with the traditional methods, scenarios that gradient information is unavailable can effectively be handled by EGADP. Additionally, the convergence of the algorithm is proven to enhance the rigorousness of the developed method. Finally, the three simulation experiments with realistic physical backgrounds are conducted to comprehensively demonstrate the effectiveness of the established method from different perspectives.

Keyword ：

intelligent control intelligent control adaptive dynamic programming (ADP) adaptive dynamic programming (ADP) optimal control optimal control evolutionary computation (EC) evolutionary computation (EC) reinforcement learning (RL) reinforcement learning (RL) Adaptive critic designs Adaptive critic designs

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wang, Ding , Huang, Haiming , Liu, Derong et al. Evolution-Guided Adaptive Dynamic Programming for Nonlinear Optimal Control [J]. \| IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS , 2024 , 54 (10) : 6043-6054 .
MLA	Wang, Ding et al. "Evolution-Guided Adaptive Dynamic Programming for Nonlinear Optimal Control" . \| IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 54 . 10 (2024) : 6043-6054 .
APA	Wang, Ding , Huang, Haiming , Liu, Derong , Zhao, Mingming , Qiao, Junfei . Evolution-Guided Adaptive Dynamic Programming for Nonlinear Optimal Control . \| IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS , 2024 , 54 (10) , 6043-6054 .
Export to	NoteExpress RIS BibTex

10| 20| 50 per page

< Page ，Total 10 >

Type
Departments

All Years Choose Year From to