Articles with "online sparse" as a keyword



Photo by szolkin from unsplash

Online Sparse Temporal Difference Learning Based on Nested Optimization and Regularized Dual Averaging

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Systems, Man, and Cybernetics: Systems"

DOI: 10.1109/tsmc.2020.3043584

Abstract: In policy evaluation of reinforcement learning tasks, the temporal difference (TD) learning with value function approximation has been widely studied. However, feature representation has a decisive influence on both accuracy of value function approximation and… read more here.

Keywords: temporal difference; tex math; online sparse; inline formula ... See more keywords