Sign Up to like & get
recommendations!
0
Published in 2020 at "IEEE transactions on cybernetics"
DOI: 10.1109/tcyb.2020.2983923
Abstract: Path integral policy improvement (PI²) is known to be an efficient reinforcement learning algorithm, particularly, if the target system is a high-dimensional dynamical system. However, PI², and its existing extensions, have adjustable parameters, on which…
read more here.
Keywords:
integral policy;
path integral;
policy improvement;
improvement population ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Transactions on Cybernetics"
DOI: 10.1109/tcyb.2022.3192049
Abstract: Robot learning through kinesthetic teaching is a promising way of cloning human behaviors, but it has its limits in the performance of complex tasks with small amounts of data, due to compounding errors. In order…
read more here.
Keywords:
natural evolution;
system;
dynamical system;
policy improvement ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE transactions on neural networks and learning systems"
DOI: 10.1109/tnnls.2022.3202192
Abstract: In this article, a novel coupled policy improvement mechanism is developed for improving policy iteration (PI) algorithms. In contrast to the common PI, the developed dual parallel policy iteration (DPPI) with coupled policy improvement mechanism…
read more here.
Keywords:
policy iteration;
parallel policy;
policy;
policy improvement ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2018 at "Journal of Global Oncology"
DOI: 10.1200/jgo.18.51100
Abstract: Background and context: Breast cancer takes the first place among the cancer diseases in the Kyrgyz Republic. Almost 40% of breast cancer cases are detected in the advanced III and IV stages. Specialized oncology services…
read more here.
Keywords:
improvement abc;
planning policy;
national planning;
policy improvement ... See more keywords