Articles with "policy safe" as a keyword



Photo by derstudi from unsplash

Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value At Risk

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Robotics and Automation Letters"

DOI: 10.1109/lra.2022.3184793

Abstract: This letter aims to solve a safe reinforcement learning (RL) problem with risk measure-based constraints. As risk measures, such as conditional value at risk (CVaR), focus on the tail distribution of cost signals, constraining risk… read more here.

Keywords: safe reinforcement; policy safe; risk; policy ... See more keywords