Photo by garri from unsplash
Sign Up to like & get
recommendations!
0
Published in 2020 at "Finance and Stochastics"
DOI: 10.1007/s00780-020-00443-2
Abstract: We study Markov decision processes with Borel state spaces under quasi-hyperbolic discounting. This type of discounting nicely models human behaviour, which is time-inconsistent in the long run. The decision maker has preferences changing in time.…
read more here.
Keywords:
decision;
markov decision;
decision processes;
hyperbolic discounting ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2021 at "Dynamic Games and Applications"
DOI: 10.1007/s13235-021-00391-2
Abstract: In this paper, we study two-person zero-sum stochastic games for controlled continuous time Markov decision processes with risk-sensitive discounted cost criterion. The transition and cost rates are possibly unbounded. For the zero-sum stochastic game, we…
read more here.
Keywords:
markov decision;
zero sum;
continuous time;
sum ... See more keywords
Photo from wikipedia
Sign Up to like & get
recommendations!
0
Published in 2017 at "Neurocomputing"
DOI: 10.1016/j.neucom.2017.02.020
Abstract: In the field of text mining, topic modeling and detection are fundamental problems in public opinion monitoring, information retrieval, social media analysis, and other activities. Document clustering has been used for topic detection at the…
read more here.
Keywords:
detection;
using markov;
markov decision;
decision processes ... See more keywords
Photo by garri from unsplash
Sign Up to like & get
recommendations!
0
Published in 2018 at "Advances in Applied Probability"
DOI: 10.1017/apr.2018.36
Abstract: Abstract In this paper we investigate risk-sensitive semi-Markov decision processes with a Borel state space, unbounded cost rates, and general utility functions. The performance criteria are several expected utilities of the total cost in a…
read more here.
Keywords:
sensitive semi;
risk sensitive;
markov decision;
decision processes ... See more keywords
Photo by garri from unsplash
Sign Up to like & get
recommendations!
0
Published in 2019 at "IEEE Control Systems"
DOI: 10.1109/mcs.2019.2913493
Abstract: Optimal decision making under uncertainty is of increasing importance in artificial intelligence, machine learning, signal processing, and control. Partially observed Markov decision processes (POMDPs) are a significant paradigm in real-world sequential decision making. The framework…
read more here.
Keywords:
markov decision;
partially observed;
observed markov;
controlled sensing ... See more keywords
Photo from wikipedia
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Transactions on Automatic Control"
DOI: 10.1109/tac.2022.3163085
Abstract: Reinforcement learning aims to find policies that maximize an expected cumulative reward in Markov decision processes with unknown transition probabilities. Policy gradient (PG)-algorithms use stochastic gradients of the value function to update the policy. A…
read more here.
Keywords:
markov decision;
decision processes;
policy gradient;
policy ... See more keywords
Photo by garri from unsplash
Sign Up to like & get
recommendations!
0
Published in 2021 at "IEEE Transactions on Cognitive and Developmental Systems"
DOI: 10.1109/tcds.2019.2948025
Abstract: The multiobjective Markov decision processes (MOMDPs) are sequential decision-making problems that involve multiple conflicting reward functions that cannot be optimized simultaneously without a compromise. This type of problem cannot be solved by a single optimal…
read more here.
Keywords:
markov decision;
intrinsically motivated;
multiobjective markov;
policy ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "Frontiers in Neurorobotics"
DOI: 10.3389/fnbot.2022.1012427
Abstract: The atypical Markov decision processes (MDPs) are decision-making for maximizing the immediate returns in only one state transition. Many complex dynamic problems can be regarded as the atypical MDPs, e.g., football trajectory control, approximations of…
read more here.
Keywords:
decision processes;
reinforcement learning;
markov decision;
atypical mdps ... See more keywords