Sign Up to like & get
recommendations!
1
Published in 2022 at "Entropy"
DOI: 10.3390/e24040440
Abstract: In the field of reinforcement learning, we propose a Correct Proximal Policy Optimization (CPPO) algorithm based on the modified penalty factor β and relative entropy in order to solve the robustness and stationarity of traditional…
read more here.
Keywords:
policy optimization;
proximal policy;
relative entropy;
entropy ... See more keywords