Conservative Q-Learning Based Furnace Temperature Control in Municipal Solid Waste Incineration Process - Details

Author：

Indexed by：

EI Scopus

Abstract：

The　nonlinear,　time-varying,　and　lagging　nature　of　the　municipal　solid　waste　incineration　(MSWI)　process　presents　challenges　for　ensuring　controller　safety.　While　offline　reinforcement　learning　(RL)　can　ensure　safety　in　furnace　temperature　(FT)　control,　its　performance　is　hindered　by　extrapolation　errors,　making　it　unsuitable　for　direct　application　in　the　incineration　environment.　To　address　this,　we　propose　a　conservative　Q-learning-based　furnace　temperature　control　strategy　(CQL-FTC).　This　strategy　involves　two　stages:　online　sampling　and　offline　training.　During　the　online　sampling　stage,　the　agent　interacts　with　the　environment　to　collect　samples,　building　an　experience　replay　buffer　(ERB)　and　performing　pretraining.　In　the　offline　training　stage,　we　introduce　the　CQL　method,　adding　constraint　terms　to　the　traditional　Bellman　equation　to　minimize　extrapolation　errors.　After　offline　training,　the　agent　is　directly　applied　to　the　FT　setpoint　control　in　the　incineration　process.　Simulation　results　using　the　actual　MSWI　process　dataset　demonstrate　the　effectiveness　of　the　proposed　method　in　complex　industrial　environments.　©　2024　IEEE.

Keyword：

Reinforcement learning Personnel training Information management Waste management Extrapolation

Author Community：

[ 1 ] [Tian, Hao]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 2 ] [Tang, Jian]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 3 ] [Xia, Heng]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 4 ] [Zhang, Jian]School of Computer Science, Nanjing University of Information Technology, Nanjing, China
[ 5 ] [Wang, Tianzheng]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 6 ] [Yu, Wen]CINVESTAV-IPN (National Polytechnic Institute), Departamento de Control Automatico, Mexico City; 07360, Mexico

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Improved model search based on distillation framework
2021，2nd International Conference on Computer Vision, Image, and Deep Learning
IRAF: A Deep Reinforcement Learning Approach for Collaborative Mobile Edge Computing IoT Networks
2019，IEEE Internet of Things Journal
Deep Q-Learning for Intelligent Band Coordination in 5G Heterogeneous Network Supporting V2X Communication
2022，Wireless Communications and Mobile Computing
AI-Enabled Deployment Automation for 6G Space-Air-Ground Integrated Networks: Challenges, Design, and Outlook
2024，IEEE Network

Source ：

Year： 2024

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 8

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to