Sign Up to like & get
recommendations!
1
Published in 2017 at "Machine Learning"
DOI: 10.1007/s10994-017-5650-8
Abstract: In this work, we build upon the observation that offline reinforcement learning (RL) is synergistic with task hierarchies that decompose large Markov decision processes (MDPs). Task hierarchies can allow more efficient sample collection from large…
read more here.
Keywords:
offline reinforcement;
reinforcement learning;
task;
task hierarchies ... See more keywords