Articles with "policy optimization" as a keyword



Photo from wikipedia

Optimal Policy Characterization Enhanced Proximal Policy Optimization for Multitask Scheduling in Cloud Computing

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Internet of Things Journal"

DOI: 10.1109/jiot.2021.3111414

Abstract: For a serving system with multiple servers and a public queue, we study the scheduling of multiple tasks with deadlines, under random task arrivals and renewable energy generation. To minimize the weighted sum of the… read more here.

Keywords: policy optimization; proximal policy; policy; optimal policy ... See more keywords
Photo from wikipedia

Compactly Restrictable Metric Policy Optimization Problems

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Automatic Control"

DOI: 10.1109/tac.2022.3217269

Abstract: We study policy optimization problems for deterministic Markov decision processes (MDPs) with metric state and action spaces, which we refer to as metric policy optimization problems (MPOPs). Our goal is to establish theoretical results on… read more here.

Keywords: metric policy; compactly restrictable; policy optimization; optimization problems ... See more keywords
Photo by aleexcif from unsplash

Anti-Martingale Proximal Policy Optimization.

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE transactions on cybernetics"

DOI: 10.1109/tcyb.2022.3170355

Abstract: Since the sample data after one exploration process can only be used to update network parameters once in on-policy deep reinforcement learning (DRL), a high sample efficiency is necessary to accelerate the training process of… read more here.

Keywords: policy optimization; martingale proximal; proximal policy; policy ... See more keywords
Photo by martindorsch from unsplash

PPOAccel: A High-Throughput Acceleration Framework for Proximal Policy Optimization

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Parallel and Distributed Systems"

DOI: 10.1109/tpds.2021.3134709

Abstract: Reinforcement Learning (RL) is a major branch of AI that enables agents to learn optimal decision making via interaction with the environment. Proximal Policy Optimization (PPO) is the state-of-the-art policy optimization based RL algorithm which… read more here.

Keywords: policy optimization; high throughput; proximal policy; policy ... See more keywords
Photo by benceboros from unsplash

Learning to Drive in the NGSIM Simulator Using Proximal Policy Optimization

Sign Up to like & get
recommendations!
Published in 2023 at "Journal of Advanced Transportation"

DOI: 10.1155/2023/4127486

Abstract: As a popular research field, autonomous driving may offer great benefits for human society. To achieve that, current studies often applied machine learning methods like reinforcement learning to enable an agent to interact and learn… read more here.

Keywords: ngsim simulator; policy optimization; using proximal; proximal policy ... See more keywords
Photo by dulhiier from unsplash

Intelligent TCP Congestion Control Policy Optimization

Sign Up to like & get
recommendations!
Published in 2023 at "Applied Sciences"

DOI: 10.3390/app13116644

Abstract: Network congestion control is an important means to improve network throughput and reduce data transmission delay. To further optimize the network data transmission capability, this research suggests a proximal policy optimization-based intelligent TCP congestion management… read more here.

Keywords: network; policy optimization; congestion; congestion control ... See more keywords
Photo from wikipedia

Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment

Sign Up to like & get
recommendations!
Published in 2022 at "Entropy"

DOI: 10.3390/e24040440

Abstract: In the field of reinforcement learning, we propose a Correct Proximal Policy Optimization (CPPO) algorithm based on the modified penalty factor β and relative entropy in order to solve the robustness and stationarity of traditional… read more here.

Keywords: policy optimization; proximal policy; relative entropy; entropy ... See more keywords