Articles with "temporal difference" as a keyword



Photo from archive.org

Dopamine signals as temporal difference errors: recent advances

Sign Up to like & get
recommendations!
Published in 2021 at "Current Opinion in Neurobiology"

DOI: 10.1016/j.conb.2020.08.014

Abstract: In the brain, dopamine is thought to drive reward-based learning by signaling temporal difference reward prediction errors (TD errors), a 'teaching signal' used to train computers. Recent studies using optogenetic manipulations have provided multiple pieces… read more here.

Keywords: dopamine signals; signals temporal; difference errors; temporal difference ... See more keywords
Photo from archive.org

A temporal difference method for multi-objective reinforcement learning

Sign Up to like & get
recommendations!
Published in 2017 at "Neurocomputing"

DOI: 10.1016/j.neucom.2016.10.100

Abstract: Abstract This work describes MPQ-learning, an algorithm that approximates the set of all deterministic non-dominated policies in multi-objective Markov decision problems, where rewards are vectors and each component stands for an objective to maximize. MPQ-learning… read more here.

Keywords: method multi; temporal difference; multi objective; mpq learning ... See more keywords
Photo by hajjidirir from unsplash

Correlation minimizing replay memory in temporal-difference reinforcement learning

Sign Up to like & get
recommendations!
Published in 2020 at "Neurocomputing"

DOI: 10.1016/j.neucom.2020.02.004

Abstract: Abstract Online reinforcement learning agents are now able to process an increasing amount of data which makes their approximation and compression into value functions a more demanding task. To improve approximation, thus the learning process… read more here.

Keywords: replay memory; reinforcement learning; correlation minimizing; memory ... See more keywords
Photo by alinnnaaaa from unsplash

Reinforcement Learning: Computing the Temporal Difference of Values via Distinct Corticostriatal Pathways (Trends in Neurosciences 35, 457–467; 2012)

Sign Up to like & get
recommendations!
Published in 2017 at "Trends in Neurosciences"

DOI: 10.1016/j.tins.2017.05.006

Abstract: The authors regret that they mischaracterized the composition of two cell populations in the ventral tegmental area (VTA) in the discussion within Box 1 of the original article. We would like to correct the following… read more here.

Keywords: computing temporal; reinforcement learning; difference values; learning computing ... See more keywords
Photo from wikipedia

The serial blocking effect: a testbed for the neural mechanisms of temporal-difference learning

Sign Up to like & get
recommendations!
Published in 2019 at "Scientific Reports"

DOI: 10.1038/s41598-019-42244-4

Abstract: Temporal-difference (TD) learning models afford the neuroscientist a theory-driven roadmap in the quest for the neural mechanisms of reinforcement learning. The application of these models to understanding the role of phasic midbrain dopaminergic responses in… read more here.

Keywords: serial blocking; difference learning; neural mechanisms; blocking effect ... See more keywords
Photo from wikipedia

Learning temporal difference embeddings for biomedical hypothesis generation

Sign Up to like & get
recommendations!
Published in 2022 at "Bioinformatics"

DOI: 10.1093/bioinformatics/btac660

Abstract: MOTIVATION Hypothesis Generation (HG) refers to the discovery of meaningful implicit connections be-tween disjoint scientific terms, which is of great significance for drug discovery, prediction of drug side effects and precision treatment. More recently, a… read more here.

Keywords: term; temporal difference; hypothesis generation;
Photo by reganography from unsplash

Deterministic limit of temporal difference reinforcement learning for stochastic games

Sign Up to like & get
recommendations!
Published in 2019 at "Physical review. E"

DOI: 10.1103/physreve.99.043305

Abstract: Reinforcement learning in multiagent systems has been studied in the fields of economic game theory, artificial intelligence, and statistical physics by developing an analytical understanding of the learning dynamics (often in relation to the replicator… read more here.

Keywords: temporal difference; reinforcement; deterministic limit; reinforcement learning ... See more keywords
Photo by reganography from unsplash

Distributed Off-Policy Temporal Difference Learning Using Primal-Dual Method

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Access"

DOI: 10.1109/access.2022.3211395

Abstract: The goal of this paper is to provide theoretical analysis and additional insights on a distributed temporal-difference (TD)-learning algorithm for the multi-agent Markov decision processes (MDPs) via saddle-point viewpoints. The (single-agent) TD-learning is a reinforcement… read more here.

Keywords: temporal difference; policy temporal; policy; distributed policy ... See more keywords
Photo by bladeoftree from unsplash

CTD: Cascaded Temporal Difference Learning for the Mean-Standard Deviation Shortest Path Problem

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Intelligent Transportation Systems"

DOI: 10.1109/tits.2021.3096829

Abstract: This paper investigates the reliable shortest path (RSP) planning problem from the reinforcement learning perspective. Different from canonical path planning methods, which require at least the first- order statistic (mean) and second-order statistic (variance) information… read more here.

Keywords: problem; temporal difference; path; shortest path ... See more keywords
Photo by szolkin from unsplash

Online Sparse Temporal Difference Learning Based on Nested Optimization and Regularized Dual Averaging

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Systems, Man, and Cybernetics: Systems"

DOI: 10.1109/tsmc.2020.3043584

Abstract: In policy evaluation of reinforcement learning tasks, the temporal difference (TD) learning with value function approximation has been widely studied. However, feature representation has a decisive influence on both accuracy of value function approximation and… read more here.

Keywords: temporal difference; tex math; online sparse; inline formula ... See more keywords