Indexed by:
Abstract:
On the basis of trajectory imitation learning, aiming at the problem of the poor learning effect caused by the fluctuations of the single demonstration data, this paper presents a kind of imitation learning method based on multiple demonstrations. The multi constraint optimization idea is introduced for trajectory optimization. Multiple demonstrations data are coded by GMM separately, and the probability interval parameters are set as the optimization constraints. The intersection of the constraint conditions is to obtain the interval overlap probability. The probability of superior data is higher, and the probability of inferior data is lower. Parameter's set for multi constrained optimal trajectory are solved based on Gaussian distribution product form, and then obtain a multi constrained optimal trajectory through GMR. This method reduces the probability of bad data in the demonstration. It effectively avoids the fluctuation of the reproduction trajectory caused by the inferior demonstration data. The simulation results show the effectiveness of the method. © 2017 IEEE.
Keyword:
Reprint Author's Address:
Email:
Source :
Year: 2017
Page: 169-174
Language: English
Cited Count:
WoS CC Cited Count: 0
SCOPUS Cited Count: 1
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 8
Affiliated Colleges: