LAUSR.org creates dashboard-style pages of related content for over 1.5 million academic articles. Sign Up to like articles & get recommendations!

Bellman residuals minimization using online support vector machines

Photo by finleydesign from unsplash

In this paper we present and theoretically study an Approximate Policy Iteration (API) method called API − BRM???? using a very effective implementation of incremental Support Vector Regression (SVR) to… Click to show full abstract

In this paper we present and theoretically study an Approximate Policy Iteration (API) method called API − BRM???? using a very effective implementation of incremental Support Vector Regression (SVR) to approximate the value function able to generalize Reinforcement Learning (RL) problems with continuous (or large) state space. API − BRM???? is presented as a non-parametric regularization method based on an outcome of the Bellman Residual Minimization (BRM) able to minimize the variance of the problem. The proposed method can be cast as incremental and may be applied to the on-line agent interaction framework of RL. Being also based on SVR which are based on convex optimization, is able to find the global solution of the problem. API − BRM???? using SVR can be seen as a regularization problem using ????−insensitive loss. Compared to standard squared loss also used in regularization, this allows to naturally build a sparse solution for the approximation function. We extensively analyze the statistical properties of API − BRM???? founding a bound which controls the performance loss of the algorithm under some assumptions on the kernel and assuming that the collected samples are not-i.i.d. following a β−mixing process. Some experimental evidence and performance for well known RL benchmarks are also presented.

Keywords: minimization; api brm; support vector

Journal Title: Applied Intelligence
Year Published: 2017

Link to full text (if available)


Share on Social Media:                               Sign Up to like & get
recommendations!

Related content

More Information              News              Social Media              Video              Recommended



                Click one of the above tabs to view related content.