Policy rollout is a method for the online computation of future costs in approximate dynamic programming and has been utilized for various problems, including sensor management. In previous work, it… Click to show full abstract
Policy rollout is a method for the online computation of future costs in approximate dynamic programming and has been utilized for various problems, including sensor management. In previous work, it has predominately been applied to the selection of actions from discrete sets. In this article, we present methods for action selection from continuous sets and analyze their tradeoffs. The methods are evaluated on the problem of sensor path planning, with the intent of minimizing the time to localize an emitter using bearing measurements.
               
Click one of the above tabs to view related content.