Photo from wikipedia
Sign Up to like & get
recommendations!
1
Published in 2021 at "IEEE Access"
DOI: 10.1109/access.2021.3051984
Abstract: Adaptive dynamic programming (ADP) is generally implemented using three neural networks: model network, action network, and critic network. In the conventional works of the value iteration ADP, the model network is initialized randomly and trained…
read more here.
Keywords:
value iteration;
time;
network;
iteration adp ... See more keywords
Sign Up to like & get
recommendations!
2
Published in 2022 at "IEEE Control Systems Letters"
DOI: 10.1109/lcsys.2020.3046373
Abstract: Partially observable Markov decision processes have been widely adopted in the automatic planning literature since it elegantly captures both execution and observation uncertainties. In our previous paper, we proposed a model called vector autoregressive partially…
read more here.
Keywords:
based value;
point based;
value;
value iteration ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2021 at "IEEE Control Systems Letters"
DOI: 10.1109/lcsys.2020.3047729
Abstract: The problem of synthesizing an optimal sensor selection policy is pertinent to a variety of engineering applications ranging from event detection to autonomous navigation. We consider such a synthesis problem in the context of linear-Gaussian…
read more here.
Keywords:
sensor selection;
value iteration;
selection;
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Transactions on Circuits and Systems II: Express Briefs"
DOI: 10.1109/tcsii.2022.3172700
Abstract: In this brief, in order to solve the discounted optimal tracking control problem for discrete-time systems with control constraints, an advanced online value iteration (VI) algorithm is developed. First, we revisit discounted general value iteration…
read more here.
Keywords:
control;
tracking control;
value iteration;
online value ... See more keywords
Photo from wikipedia
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE transactions on cybernetics"
DOI: 10.1109/tcyb.2022.3224485
Abstract: A reinforcement learning-based boundary optimal control algorithm for parabolic distributed parameter systems is developed in this article. First, a spatial Riccati-like equation and an integral optimal controller are derived in infinite-time horizon based on the…
read more here.
Keywords:
optimal control;
boundary optimal;
value;
value iteration ... See more keywords
Photo by goian from unsplash
Sign Up to like & get
recommendations!
2
Published in 2022 at "IEEE transactions on neural networks and learning systems"
DOI: 10.1109/tnnls.2022.3143527
Abstract: In this article, a novel value iteration scheme is developed with convergence and stability discussions. A relaxation factor is introduced to adjust the convergence rate of the value function sequence. The convergence conditions with respect…
read more here.
Keywords:
convergence;
value;
value iteration;
iteration scheme ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2018 at "IEEE Transactions on Systems, Man, and Cybernetics: Systems"
DOI: 10.1109/tsmc.2016.2623766
Abstract: In this paper, convergence properties are established for the newly developed discrete-time local value iteration adaptive dynamic programming (ADP) algorithm. The present local iterative ADP algorithm permits an arbitrary positive semidefinite function to initialize the…
read more here.
Keywords:
time;
value;
function;
local value ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2020 at "IEEE Transactions on Systems, Man, and Cybernetics: Systems"
DOI: 10.1109/tsmc.2019.2898389
Abstract: The optimal control problem of discrete-time nonlinear systems depends on the solution of the Bellman equation. In this paper, an adaptive reinforcement learning (RL) method is developed to solve the complex Bellman equation, which balances…
read more here.
Keywords:
discrete time;
iteration policy;
control;
iteration ... See more keywords