Photo from archive.org
Sign Up to like & get
recommendations!
0
Published in 2021 at "Current Opinion in Neurobiology"
DOI: 10.1016/j.conb.2020.08.014
Abstract: In the brain, dopamine is thought to drive reward-based learning by signaling temporal difference reward prediction errors (TD errors), a 'teaching signal' used to train computers. Recent studies using optogenetic manipulations have provided multiple pieces…
read more here.
Keywords:
dopamine signals;
signals temporal;
difference errors;
temporal difference ... See more keywords
Photo from archive.org
Sign Up to like & get
recommendations!
0
Published in 2017 at "Neurocomputing"
DOI: 10.1016/j.neucom.2016.10.100
Abstract: Abstract This work describes MPQ-learning, an algorithm that approximates the set of all deterministic non-dominated policies in multi-objective Markov decision problems, where rewards are vectors and each component stands for an objective to maximize. MPQ-learning…
read more here.
Keywords:
method multi;
temporal difference;
multi objective;
mpq learning ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2020 at "Neurocomputing"
DOI: 10.1016/j.neucom.2020.02.004
Abstract: Abstract Online reinforcement learning agents are now able to process an increasing amount of data which makes their approximation and compression into value functions a more demanding task. To improve approximation, thus the learning process…
read more here.
Keywords:
replay memory;
reinforcement learning;
correlation minimizing;
memory ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2017 at "Trends in Neurosciences"
DOI: 10.1016/j.tins.2017.05.006
Abstract: The authors regret that they mischaracterized the composition of two cell populations in the ventral tegmental area (VTA) in the discussion within Box 1 of the original article. We would like to correct the following…
read more here.
Keywords:
computing temporal;
reinforcement learning;
difference values;
learning computing ... See more keywords
Photo from wikipedia
Sign Up to like & get
recommendations!
1
Published in 2019 at "Scientific Reports"
DOI: 10.1038/s41598-019-42244-4
Abstract: Temporal-difference (TD) learning models afford the neuroscientist a theory-driven roadmap in the quest for the neural mechanisms of reinforcement learning. The application of these models to understanding the role of phasic midbrain dopaminergic responses in…
read more here.
Keywords:
serial blocking;
difference learning;
neural mechanisms;
blocking effect ... See more keywords
Photo from wikipedia
Sign Up to like & get
recommendations!
1
Published in 2022 at "Bioinformatics"
DOI: 10.1093/bioinformatics/btac660
Abstract: MOTIVATION Hypothesis Generation (HG) refers to the discovery of meaningful implicit connections be-tween disjoint scientific terms, which is of great significance for drug discovery, prediction of drug side effects and precision treatment. More recently, a…
read more here.
Keywords:
term;
temporal difference;
hypothesis generation;
Sign Up to like & get
recommendations!
0
Published in 2019 at "Physical review. E"
DOI: 10.1103/physreve.99.043305
Abstract: Reinforcement learning in multiagent systems has been studied in the fields of economic game theory, artificial intelligence, and statistical physics by developing an analytical understanding of the learning dynamics (often in relation to the replicator…
read more here.
Keywords:
temporal difference;
reinforcement;
deterministic limit;
reinforcement learning ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Access"
DOI: 10.1109/access.2022.3211395
Abstract: The goal of this paper is to provide theoretical analysis and additional insights on a distributed temporal-difference (TD)-learning algorithm for the multi-agent Markov decision processes (MDPs) via saddle-point viewpoints. The (single-agent) TD-learning is a reinforcement…
read more here.
Keywords:
temporal difference;
policy temporal;
policy;
distributed policy ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Transactions on Intelligent Transportation Systems"
DOI: 10.1109/tits.2021.3096829
Abstract: This paper investigates the reliable shortest path (RSP) planning problem from the reinforcement learning perspective. Different from canonical path planning methods, which require at least the first- order statistic (mean) and second-order statistic (variance) information…
read more here.
Keywords:
problem;
temporal difference;
path;
shortest path ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Transactions on Systems, Man, and Cybernetics: Systems"
DOI: 10.1109/tsmc.2020.3043584
Abstract: In policy evaluation of reinforcement learning tasks, the temporal difference (TD) learning with value function approximation has been widely studied. However, feature representation has a decisive influence on both accuracy of value function approximation and…
read more here.
Keywords:
temporal difference;
tex math;
online sparse;
inline formula ... See more keywords