With the development of learning-based autonomous underwater exploration method of robotic fish, how to improve data quality and sampling efficiency, so as to achieve better control performance becomes a challenging… Click to show full abstract
With the development of learning-based autonomous underwater exploration method of robotic fish, how to improve data quality and sampling efficiency, so as to achieve better control performance becomes a challenging research subject. To address this issue, a feasible real-world deep reinforcement learning framework for autonomous underwater exploration of robotic fish is proposed, which ably avoids model discrepancy generated in virtual training. The designed framework consists of three phases: teaching initialization, regular update of reinforcement learning, and phased consolidation training. Specially, reasonable teaching initialization improves the data sampling efficiency and stabilizes the real-world early training. The consolidation training ensures the reproducibility of good controllers by interim imitation learning in the middle training phase. Extensive underwater experiments on a novel self-developed biomimetic robotic shark show that the proposed real-world learning method significantly improves the safety and efficiency of autonomous exploration based on local sensor information, providing a promising solution for exploring in uncharted wild waters.
               
Click one of the above tabs to view related content.