This paper studies the optimal control policy learning for underactuated vertical take-off and landing (VTOL) aerial vehicles subject to unknown mass and inertia matrix. A novel off-policy integral reinforcement learning… Click to show full abstract
This paper studies the optimal control policy learning for underactuated vertical take-off and landing (VTOL) aerial vehicles subject to unknown mass and inertia matrix. A novel off-policy integral reinforcement learning (IRL) scheme is presented for simultaneously unknown parameter identification and optimal trajectory tracking. In the outer loop of the VTOL vehicles, a novel off-policy IRL scheme is proposed, where the fixed control policy for data generation is chosen to be different from the iterated control policy and the feedforward term with unknown mass can be learned along with the optimal control policy. In the inner loop, a hybrid off-policy IRL algorithm is developed to tackle the optimal attitude control policy learning and inertia matrix identification under the hybrid control scheme introduced by the employed inner-outer loop control strategy. A simulation study is finally provided to demonstrate the effectiveness of the proposed algorithm.
               
Click one of the above tabs to view related content.