Sign Up to like & get
recommendations!
0
Published in 2021 at "Applied Intelligence"
DOI: 10.1007/s10489-021-02374-7
Abstract: In the paper, we propose a variant of Variational Autoencoder (VAE) for sequence generation task, called SeqVAE, which is a combination of recurrent VAE and policy gradient in reinforcement learning. The goal of SeqVAE is…
read more here.
Keywords:
policy gradient;
policy;
seqvae;
variational autoencoder ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Access"
DOI: 10.1109/access.2021.3137821
Abstract: Reinforcement learning can achieve excellent performance in the field of robotic grasping if the grasping target is stable. However, during applications in the real world, robot needs to overcome the effects of a complex working…
read more here.
Keywords:
deep attentive;
deterministic policy;
policy gradient;
attentive deterministic ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Access"
DOI: 10.1109/access.2022.3202918
Abstract: Autonomous maneuvering decisions of unmanned aerial vehicle (UAV) in short-range air combat remain a challenging research topic, and a decision method based on an improved deep deterministic policy gradient (DDPG) is proposed. First, the problem…
read more here.
Keywords:
improved deep;
deep deterministic;
based improved;
deterministic policy ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Internet of Things Journal"
DOI: 10.1109/jiot.2021.3135977
Abstract: In this article, we consider a parking lot that manages the charging processes of its parked electric vehicles (EVs). Upon arrival, each EV requests a certain amount of energy. This request should be fulfilled before…
read more here.
Keywords:
charging rates;
deep policy;
policy gradient;
policy ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Control Systems Letters"
DOI: 10.1109/lcsys.2022.3188180
Abstract: In spite of the lack of convexity, convergence and sample complexity properties were recently established for the random search method applied to the linear quadratic regulator (LQR) problem. Since policy gradient approaches require an initial…
read more here.
Keywords:
feedback gains;
policy gradient;
policy;
method ... See more keywords
Sign Up to like & get
recommendations!
2
Published in 2023 at "IEEE Control Systems Letters"
DOI: 10.1109/lcsys.2023.3271594
Abstract: We revisit in this letter the discrete-time linear quadratic regulator (LQR) problem from the perspective of receding-horizon policy gradient (RHPG), a newly developed model-free learning framework for control applications. We provide a fine-grained sample complexity…
read more here.
Keywords:
horizon policy;
receding horizon;
control;
policy gradient ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2024 at "IEEE Control Systems Letters"
DOI: 10.1109/lcsys.2024.3513896
Abstract: This letter studies the synthesis of an active perception policy that maximizes the information leakage of the initial state in a stochastic system modeled as a hidden Markov model (HMM). Specifically, the emission function of…
read more here.
Keywords:
policy gradient;
policy;
perception;
initial state ... See more keywords
Photo from wikipedia
Sign Up to like & get
recommendations!
0
Published in 2021 at "IEEE Transactions on Automatic Control"
DOI: 10.1109/tac.2020.3037046
Abstract: The linear quadratic regulator (LQR) problem has reemerged as an important theoretical benchmark for reinforcement learning-based control of complex dynamical systems with continuous state and action spaces. In contrast with nearly all recent work in…
read more here.
Keywords:
optimal controllers;
policy;
learning optimal;
multiplicative noise ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Transactions on Automatic Control"
DOI: 10.1109/tac.2022.3163085
Abstract: Reinforcement learning aims to find policies that maximize an expected cumulative reward in Markov decision processes with unknown transition probabilities. Policy gradient (PG)-algorithms use stochastic gradients of the value function to update the policy. A…
read more here.
Keywords:
markov decision;
decision processes;
policy gradient;
policy ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2024 at "IEEE Transactions on Automatic Control"
DOI: 10.1109/tac.2025.3543128
Abstract: Learning policies in an asynchronous parallel way is essential to numerous successes of reinforcement learning for solving complex problems. However, their convergence has not been rigorously evaluated. To improve the theoretical understanding, we adopt the…
read more here.
Keywords:
asynchronous parallel;
policy gradient;
policy;
linear quadratic ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2025 at "IEEE Transactions on Computational Social Systems"
DOI: 10.1109/tcss.2025.3540263
Abstract: To reduce resource consumption and environmental impact, the manufacturing industry increasingly leans towards repurposing, repairing, or updating products. In a multifactory environment, considering the disassembly line balancing problem helps enterprises improve production efficiency and reduce…
read more here.
Keywords:
deterministic policy;
policy gradient;
heterogeneous multifactory;
problem ... See more keywords