Sign Up to like & get
recommendations!
2
Published in 2023 at "IEEE Transactions on Automatic Control"
DOI: 10.1109/tac.2022.3194040
Abstract: This article provides an approximate online adaptive solution to the infinite-horizon optimal tracking problem for control-affine continuous-time nonlinear systems with uncertain drift dynamics. A model-based approximate dynamic programming (ADP) approach, which is facilitated using a…
read more here.
Keywords:
error extrapolation;
state space;
bellman error;
extrapolation ... See more keywords
Sign Up to like & get
recommendations!
2
Published in 2022 at "IEEE Transactions on Pattern Analysis and Machine Intelligence"
DOI: 10.1109/tpami.2022.3213503
Abstract: Most value function learning algorithms in reinforcement learning are based on the mean squared (projected) Bellman error. However, squared errors are known to be sensitive to outliers, both skewing the solution of the objective and…
read more here.
Keywords:
learning value;
losses learning;
bellman error;
robust losses ... See more keywords