Articles with "proximal policy" as a keyword



Photo from wikipedia

Optimal Policy Characterization Enhanced Proximal Policy Optimization for Multitask Scheduling in Cloud Computing

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Internet of Things Journal"

DOI: 10.1109/jiot.2021.3111414

Abstract: For a serving system with multiple servers and a public queue, we study the scheduling of multiple tasks with deadlines, under random task arrivals and renewable energy generation. To minimize the weighted sum of the… read more here.

Keywords: policy optimization; proximal policy; policy; optimal policy ... See more keywords
Photo by szolkin from unsplash

Mobile Communications, Computing, and Caching Resources Allocation for Diverse Services via Multi-Objetive Proximal Policy Optimization

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Communications"

DOI: 10.1109/tcomm.2022.3173005

Abstract: Mobile services are becoming more diverse, making them have different demands on communications, computing, and caching (3C) resources in mobile systems. Unlike the traditional work that considers only one type of service, this paper designs… read more here.

Keywords: communications computing; proximal policy; computing caching; diverse services ... See more keywords
Photo by aleexcif from unsplash

Anti-Martingale Proximal Policy Optimization.

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE transactions on cybernetics"

DOI: 10.1109/tcyb.2022.3170355

Abstract: Since the sample data after one exploration process can only be used to update network parameters once in on-policy deep reinforcement learning (DRL), a high sample efficiency is necessary to accelerate the training process of… read more here.

Keywords: policy optimization; martingale proximal; proximal policy; policy ... See more keywords
Photo by martindorsch from unsplash

PPOAccel: A High-Throughput Acceleration Framework for Proximal Policy Optimization

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Parallel and Distributed Systems"

DOI: 10.1109/tpds.2021.3134709

Abstract: Reinforcement Learning (RL) is a major branch of AI that enables agents to learn optimal decision making via interaction with the environment. Proximal Policy Optimization (PPO) is the state-of-the-art policy optimization based RL algorithm which… read more here.

Keywords: policy optimization; high throughput; proximal policy; policy ... See more keywords
Photo by benceboros from unsplash

Learning to Drive in the NGSIM Simulator Using Proximal Policy Optimization

Sign Up to like & get
recommendations!
Published in 2023 at "Journal of Advanced Transportation"

DOI: 10.1155/2023/4127486

Abstract: As a popular research field, autonomous driving may offer great benefits for human society. To achieve that, current studies often applied machine learning methods like reinforcement learning to enable an agent to interact and learn… read more here.

Keywords: ngsim simulator; policy optimization; using proximal; proximal policy ... See more keywords
Photo from wikipedia

Autonomous Driving Decision Control Based on Improved Proximal Policy Optimization Algorithm

Sign Up to like & get
recommendations!
Published in 2023 at "Applied Sciences"

DOI: 10.3390/app13116400

Abstract: The decision-making control of autonomous driving in complex urban road environments is a difficult problem in the research of autonomous driving. In order to solve the problem of high dimensional state space and sparse reward… read more here.

Keywords: proximal policy; control; driving decision; decision control ... See more keywords
Photo from wikipedia

Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment

Sign Up to like & get
recommendations!
Published in 2022 at "Entropy"

DOI: 10.3390/e24040440

Abstract: In the field of reinforcement learning, we propose a Correct Proximal Policy Optimization (CPPO) algorithm based on the modified penalty factor β and relative entropy in order to solve the robustness and stationarity of traditional… read more here.

Keywords: policy optimization; proximal policy; relative entropy; entropy ... See more keywords