Indexed by:
Abstract:
Coherent beam combination (CBC) is an effective method to break the limiting power of a single fiber laser. The Q-learning algorithm is one of the reinforcement learning algorithms. We use the Q-learning algorithm to do phase compensation in the field of CBC. The performance difference between the Q-learning algorithm and the stochastic parallel gradient descent optimization algorithm (SPGD) is analyzed by simulating time-domain coherent synthesis. The results show that the Q-learning algorithm is easier to debug and has better stability. © 2021 Elsevier B.V.
Keyword:
Reprint Author's Address:
Email:
Source :
Optics Communications
ISSN: 0030-4018
Year: 2021
Volume: 490
2 . 4 0 0
JCR@2022
ESI Discipline: PHYSICS;
ESI HC Threshold:72
JCR Journal Grade:3
Cited Count:
WoS CC Cited Count: 0
SCOPUS Cited Count: 25
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 8
Affiliated Colleges: