TOCIM: An improved operant conditioning model with task-oriented curiosity - Details

Author：

Li, Yufan (Li, Yufan.) | Huang, Jing (Huang, Jing.) | Lin, Chenliang (Lin, Chenliang.) | Lu, Yazhou (Lu, Yazhou.)

Indexed by：

EI Scopus

Abstract：

As　an　important　type　of　associative　learning,　operation　conditioning　and　its　mathematical　models　have　been　studied　a　lot.　The　recent　trend　is　to　introduce　intrinsic　motivation　in　operant　conditioning　to　expand　the　search　space.　However,　traditional　curiosity-based　intrinsic　motivation　models　have　a　strong　preference　for　those　states　seldomly　visited.　As　a　result,　they　intend　to　ignore　the　states　most　possibly　leading　to　target,　which　may　decrease　the　efficiency.　Aiming　to　solve　the　problem,　we　propose　a　task-oriented　curiosity　based　intrinsic　motivation　model　(TOCIM).　The　model　is　described　as　a　tuple　consisting　of　8　elements,　including　state　space　S,　action　space　A,　orientation　matrix　O,　orientation　function　V,　access　number　matrix　N,　curiosity　matrix　C,　orientation　update　mechanism　e,　and　action　selection　strategy　G.　Here,　the　intrinsic　motivation　is　measured　not　only　by　the　novelty　of　the　sates,　but　also　by　the　correlation　between　the　states　and　the　target　in　order　to　trade　off　exploration　and　exploitation　in　navigation.　Simulation　experiments　have　been　carried　out　to　testify　the　validation　of　TOCIM,　and　some　other　similar　models　have　been　compared.　The　experiment　results　show　that　our　model　has　advantage　in　training　time　of　navigation.　©　2022　ACM.

Keyword：

Economic and social effects Matrix algebra Motivation Navigation Mobile robots

Author Community：

[ 1 ] [Li, Yufan]Faculty of Information Technology, Beijing University of Technology, China
[ 2 ] [Huang, Jing]Faculty of Information Technology, Beijing University of Technology, China
[ 3 ] [Lin, Chenliang]Faculty of Information Technology, Beijing University of Technology, China
[ 4 ] [Lu, Yazhou]Faculty of Information Technology, Beijing University of Technology, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Improved Monte Carlo Localization Based on Probability Density Maps for Autonomous Mobile Robots
2023，2023 IEEE International Conference on Unmanned Systems, ICUS 2023
A Collaborative Filtering Recommendation Method Based on Differential Privacy
2017，Computer Research and Development
How non-economic motivations affect electronic word-of-mouth: Evidence from Chinese social media
2018，International Journal of Information Systems and Change Management
RETRACTED ARTICLE: The R&D internationalization of multinationals from the perspective of developing countries
2009，International Conference on Management and Service Science, MASS 2009

Source ：

Year： 2022

Page： 457-463

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 12

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to