Articles with "actor critic" as a keyword



Actor-Critic-Based Optimal Adaptive Control Design for Morphing Aircraft

Sign Up to like & get
recommendations!
Published in 2020 at "IFAC-PapersOnLine"

DOI: 10.1016/j.ifacol.2020.12.1943

Abstract: Abstract An online actor-critic-based control design strategy is proposed for a variable span and sweep morphing wing aircraft considering the morphing parameters as control effectors, which makes the system non-affine in control. By adopting the… read more here.

Keywords: system; control; critic based; actor critic ... See more keywords

Four actor-critic structures and algorithms for nonlinear multi-input multi-output system

Sign Up to like & get
recommendations!
Published in 2019 at "Neurocomputing"

DOI: 10.1016/j.neucom.2018.10.072

Abstract: Abstract The action-dependent heuristic approximate dynamic (ADHDP) for nonlinear multi-input multi-output (MIMO) system needs different forms to adapt to variable practical objects. Due to some inappropriate network structure or training algorithm, unsuccessful designs or undesirable… read more here.

Keywords: four actor; multi; actor critic; nonlinear multi ... See more keywords

Discrete soft actor-critic with auto-encoder on vascular robotic system

Sign Up to like & get
recommendations!
Published in 2022 at "Robotica"

DOI: 10.1017/s0263574722001527

Abstract: Abstract Instrument delivery is critical part in vascular intervention surgery. Due to the soft-body structure of instruments, the relationship between manipulation commands and instrument motion is non-linear, making instrument delivery challenging and time-consuming. Reinforcement learning… read more here.

Keywords: critic auto; discrete soft; auto encoder; vascular robotic ... See more keywords
Photo from wikipedia

A Sample‐Efficient Actor‐Critic Algorithm for Recommendation Diversification

Sign Up to like & get
recommendations!
Published in 2020 at "Chinese Journal of Electronics"

DOI: 10.1049/cje.2019.10.004

Abstract: Diversifying recommendation results gains benefits from satisfying user's existing interests as well as exploring novel information needs. Recently proposed Monte-Carlo based reinforcement learning method suffers from sample inefficiency, large variance, and even failing to perform… read more here.

Keywords: algorithm recommendation; recommendation; actor; actor critic ... See more keywords

Guided Soft Actor Critic: A Guided Deep Reinforcement Learning Approach for Partially Observable Markov Decision Processes

Sign Up to like & get
recommendations!
Published in 2021 at "IEEE Access"

DOI: 10.1109/access.2021.3131772

Abstract: Most real-world problems are essentially partially observable, and the environmental model is unknown. Therefore, there is a significant need for reinforcement learning approaches to solve them, where the agent perceives the state of the environment… read more here.

Keywords: reinforcement learning; partially observable; actor critic; approach ... See more keywords

Optimization of Apparel Supply Chain Using Deep Reinforcement Learning

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Access"

DOI: 10.1109/access.2022.3205720

Abstract: An effective supply chain management system is indispensable for an enterprise with a supply chain network in several aspects. Especially, organized control over the production and transportation of its products is a key success factor… read more here.

Keywords: supply; policy; optimization; actor critic ... See more keywords

Distillation and Ordinary Federated Learning Actor-Critic Algorithms in Heterogeneous UAV-Aided Networks

Sign Up to like & get
recommendations!
Published in 2023 at "IEEE Access"

DOI: 10.1109/access.2023.3273123

Abstract: In recent years, there has been growing enthusiasm for employing Unmanned Aerial Vehicles (UAVs) as an innovative technology with significant potential for the next generation of wireless networks. Hence, the Quality of Service (QoS) of… read more here.

Keywords: distillation; federated learning; actor critic; heterogeneous uav ... See more keywords

Satellite Communication Resource Scheduling Using a Dynamic Weight-Based Soft Actor Critic Reinforcement Learning

Sign Up to like & get
recommendations!
Published in 2024 at "IEEE Access"

DOI: 10.1109/access.2024.3438930

Abstract: One of the key challenge faced by space-based network is how to maximize the demand for on-board resources for ground communication tasks, given the limited availability of satellite resources. For this challenge, firstly, we propose… read more here.

Keywords: soft actor; based soft; actor critic; dynamic weight ... See more keywords

Evaluation of Efficient and Flexible Hardware–Software Co-Design of Advantage Actor–Critic Reinforcement Learning for Edge Deployment

Sign Up to like & get
recommendations!
Published in 2025 at "IEEE Access"

DOI: 10.1109/access.2025.3637842

Abstract: Reinforcement learning (RL) has shown remarkable success in solving sequential decision-making problems, yet deploying these algorithms on resource-constrained edge devices remains a significant challenge due to limited power and area budgets. While prior FPGA-based research… read more here.

Keywords: hardware; tex math; actor critic; inline formula ... See more keywords

Automatic Delineation of the 3D Left Atrium From LGE-MRI: Actor-Critic Based Detection and Semi-Supervised Segmentation

Sign Up to like & get
recommendations!
Published in 2024 at "IEEE Journal of Biomedical and Health Informatics"

DOI: 10.1109/jbhi.2024.3373127

Abstract: Accurate and automatic delineation of the left atrium (LA) is crucial for computer-aided diagnosis of atrial fibrillation-related diseases. However, effective model training typically requires a large amount of labeled data, which is time-consuming and labor-intensive.… read more here.

Keywords: delineation; left atrium; delineation left; semi supervised ... See more keywords

Combining Lyapunov Optimization With Actor–Critic Networks for Privacy-Aware IIoT Computation Offloading

Sign Up to like & get
recommendations!
Published in 2024 at "IEEE Internet of Things Journal"

DOI: 10.1109/jiot.2024.3357110

Abstract: Opportunistic computation offloading is an effective way to improve the computing performance of Industrial Internet of Things (IIoT) devices. However, as more and more computing tasks are being offloaded to mobile-edge computing (MEC) servers for… read more here.

Keywords: lyapunov optimization; privacy; critic networks; actor critic ... See more keywords