• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Ruan, Xiaogang (Ruan, Xiaogang.) | Liu, Pengfei (Liu, Pengfei.) | Zhu, Xiaoqing (Zhu, Xiaoqing.)

Indexed by:

EI Scopus

Abstract:

Q-learning is a model-free iterative reinforcement learning algorithm that is widely used for navigating mobile robots in unstructured environments. However, the exploration and utilization of the environmental data limits the Q-learning convergence speed for mobile robot navigation. This study used the Q-learning algorithm and the fact that rodents use olfactory cues for spatial orientation and navigation to develop a Q-learning environmental cognitive strategy based on odor-reward shaping. This algorithm reduces useless exploration of the environment by improving the Q-learning action selection strategy. Environmental odor information is integrated into the algorithm with the olfactory factor used to weight the Q-learning and the odor-reward shaping in the action selection strategy. The algorithm effectiveness is evaluated in a simulation of movement in the labyrinth environment used in the Tolman mouse experiment. The results show that the Q-learning algorithm with odor-reward shaping reduces useless exploration of the environment, enhances cognitive learning of the environment, and improves the algorithm convergence speed. © 2021, Tsinghua University Press. All right reserved.

Keyword:

Electronic nose Learning systems Mammals Computer aided instruction Mobile robots Iterative methods Reinforcement learning Learning algorithms

Author Community:

  • [ 1 ] [Ruan, Xiaogang]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Ruan, Xiaogang]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China
  • [ 3 ] [Liu, Pengfei]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 4 ] [Liu, Pengfei]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China
  • [ 5 ] [Zhu, Xiaoqing]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 6 ] [Zhu, Xiaoqing]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China

Reprint Author's Address:

  • [zhu, xiaoqing]beijing key laboratory of computational intelligence and intelligent system, beijing; 100124, china;;[zhu, xiaoqing]faculty of information technology, beijing university of technology, beijing; 100124, china

Show more details

Related Keywords:

Related Article:

Source :

Journal of Tsinghua University

ISSN: 1000-0054

Year: 2021

Issue: 3

Volume: 61

Page: 254-260

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 3

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 10

Online/Total:693/10645629
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.