Articles with "actor critic" as a keyword



Photo by charlesdeluvio from unsplash

Actor-Critic-Based Optimal Adaptive Control Design for Morphing Aircraft

Sign Up to like & get
recommendations!
Published in 2020 at "IFAC-PapersOnLine"

DOI: 10.1016/j.ifacol.2020.12.1943

Abstract: Abstract An online actor-critic-based control design strategy is proposed for a variable span and sweep morphing wing aircraft considering the morphing parameters as control effectors, which makes the system non-affine in control. By adopting the… read more here.

Keywords: system; control; critic based; actor critic ... See more keywords
Photo from wikipedia

Four actor-critic structures and algorithms for nonlinear multi-input multi-output system

Sign Up to like & get
recommendations!
Published in 2019 at "Neurocomputing"

DOI: 10.1016/j.neucom.2018.10.072

Abstract: Abstract The action-dependent heuristic approximate dynamic (ADHDP) for nonlinear multi-input multi-output (MIMO) system needs different forms to adapt to variable practical objects. Due to some inappropriate network structure or training algorithm, unsuccessful designs or undesirable… read more here.

Keywords: four actor; multi; actor critic; nonlinear multi ... See more keywords
Photo from wikipedia

Discrete soft actor-critic with auto-encoder on vascular robotic system

Sign Up to like & get
recommendations!
Published in 2022 at "Robotica"

DOI: 10.1017/s0263574722001527

Abstract: Abstract Instrument delivery is critical part in vascular intervention surgery. Due to the soft-body structure of instruments, the relationship between manipulation commands and instrument motion is non-linear, making instrument delivery challenging and time-consuming. Reinforcement learning… read more here.

Keywords: critic auto; discrete soft; auto encoder; vascular robotic ... See more keywords
Photo from wikipedia

A Sample‐Efficient Actor‐Critic Algorithm for Recommendation Diversification

Sign Up to like & get
recommendations!
Published in 2020 at "Chinese Journal of Electronics"

DOI: 10.1049/cje.2019.10.004

Abstract: Diversifying recommendation results gains benefits from satisfying user's existing interests as well as exploring novel information needs. Recently proposed Monte-Carlo based reinforcement learning method suffers from sample inefficiency, large variance, and even failing to perform… read more here.

Keywords: algorithm recommendation; recommendation; actor; actor critic ... See more keywords

Guided Soft Actor Critic: A Guided Deep Reinforcement Learning Approach for Partially Observable Markov Decision Processes

Sign Up to like & get
recommendations!
Published in 2021 at "IEEE Access"

DOI: 10.1109/access.2021.3131772

Abstract: Most real-world problems are essentially partially observable, and the environmental model is unknown. Therefore, there is a significant need for reinforcement learning approaches to solve them, where the agent perceives the state of the environment… read more here.

Keywords: reinforcement learning; partially observable; actor critic; approach ... See more keywords
Photo from wikipedia

Optimization of Apparel Supply Chain Using Deep Reinforcement Learning

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Access"

DOI: 10.1109/access.2022.3205720

Abstract: An effective supply chain management system is indispensable for an enterprise with a supply chain network in several aspects. Especially, organized control over the production and transportation of its products is a key success factor… read more here.

Keywords: supply; policy; optimization; actor critic ... See more keywords
Photo by hajjidirir from unsplash

Distillation and Ordinary Federated Learning Actor-Critic Algorithms in Heterogeneous UAV-Aided Networks

Sign Up to like & get
recommendations!
Published in 2023 at "IEEE Access"

DOI: 10.1109/access.2023.3273123

Abstract: In recent years, there has been growing enthusiasm for employing Unmanned Aerial Vehicles (UAVs) as an innovative technology with significant potential for the next generation of wireless networks. Hence, the Quality of Service (QoS) of… read more here.

Keywords: distillation; federated learning; actor critic; heterogeneous uav ... See more keywords
Photo by aleexcif from unsplash

CACTO: Continuous Actor-Critic With Trajectory Optimization—Towards Global Optimality

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Robotics and Automation Letters"

DOI: 10.1109/lra.2023.3266985

Abstract: This letter presents a novel algorithm for the continuous control of dynamical systems that combines Trajectory Optimization (TO) and Reinforcement Learning (RL) in a single framework. The motivations behind this algorithm are the two main… read more here.

Keywords: cacto continuous; trajectory optimization; policy; actor critic ... See more keywords
Photo by hajjidirir from unsplash

Radio Resource Management for C-V2X Using Graph Matching and Actor–Critic Learning

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Wireless Communications Letters"

DOI: 10.1109/lwc.2022.3213176

Abstract: We propose a hybrid centralized-distributed radio resource management (RRM) scheme for cellular vehicle-to-everything (C-V2X), which is to mitigate the interference caused by radio resource sharing between vehicle-to-infrastructure (V2I) links and vehicle-to-vehicle (V2V) links. Specifically, it… read more here.

Keywords: resource management; critic learning; resource; graph matching ... See more keywords
Photo from wikipedia

Proactive Content Caching Based on Actor–Critic Reinforcement Learning for Mobile Edge Networks

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Cognitive Communications and Networking"

DOI: 10.1109/tccn.2021.3130995

Abstract: Mobile edge caching/computing (MEC) has emerged as a promising approach for addressing the drastic increasing mobile data traffic by bringing high caching and computing capabilities to the edge of networks. Under MEC architecture, content providers… read more here.

Keywords: mobile edge; edge networks; critic reinforcement; caching ... See more keywords
Photo from wikipedia

A Deep Reinforcement Learning Algorithm Suitable for Autonomous Vehicles: Double Bootstrapped Soft-Actor-Critic-Discrete

Sign Up to like & get
recommendations!
Published in 2021 at "IEEE Transactions on Cognitive and Developmental Systems"

DOI: 10.1109/tcds.2021.3092715

Abstract: With the rapid advancement of modern society, autonomous systems have been broadly applied in people’s daily lives. Under the guidance of this trend, autonomous vehicles have gradually become popular. However, due to some adverse factors(such… read more here.

Keywords: actor critic; critic discrete; soft actor; reinforcement learning ... See more keywords