Indexed by:
Abstract:
Operant conditioning is one of the fundamental mechanisms of animal learning, which suggests that the behavior of all animals, from protists to humans, is guided by its consequences. We present a new stochastic learning automaton called a Skinner automaton that is a psychological model for formalizing the theory of operant conditioning. We identify animal operant learning with a thermodynamic process, and derive a so-called Skinner algorithm from Monte Carlo method as well as Metropolis algorithm and simulated annealing. Under certain conditions, we prove that the Skinner automaton is expedient, E >-optimal, optimal, and that the operant probabilities converge to the set of stable roots with probability of 1. The Skinner automaton enables machines to autonomously learn in an animal-like way.
Keyword:
Reprint Author's Address:
Email:
Source :
SCIENCE CHINA-TECHNOLOGICAL SCIENCES
ISSN: 1674-7321
Year: 2013
Issue: 11
Volume: 56
Page: 2745-2761
4 . 6 0 0
JCR@2022
ESI Discipline: ENGINEERING;
JCR Journal Grade:2
CAS Journal Grade:3
Cited Count:
WoS CC Cited Count: 12
SCOPUS Cited Count: 17
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 3
Affiliated Colleges: