Continuous Viewpoint Planning in Conjunction with Dynamic Exploration for Active Object Recognition |
| |
Authors: | Haibo Sun Feng Zhu Yanzi Kong Jianyu Wang Pengfei Zhao |
| |
Institution: | 1.Faculty of Robot Science and Engineering, Northeastern University, Shenyang 110169, China; (H.S.); (J.W.);2.Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China; (Y.K.); (P.Z.);3.Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang 110016, China;4.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, China;5.University of Chinese Academy of Sciences, Beijing 100049, China |
| |
Abstract: | Active object recognition (AOR) aims at collecting additional information to improve recognition performance by purposefully adjusting the viewpoint of an agent. How to determine the next best viewpoint of the agent, i.e., viewpoint planning (VP), is a research focus. Most existing VP methods perform viewpoint exploration in the discrete viewpoint space, which have to sample viewpoint space and may bring in significant quantization error. To address this challenge, a continuous VP approach for AOR based on reinforcement learning is proposed. Specifically, we use two separate neural networks to model the VP policy as a parameterized Gaussian distribution and resort the proximal policy optimization framework to learn the policy. Furthermore, an adaptive entropy regularization based dynamic exploration scheme is presented to automatically adjust the viewpoint exploration ability in the learning process. To the end, experimental results on the public dataset GERMS well demonstrate the superiority of our proposed VP method. |
| |
Keywords: | active object recognition continuous viewpoint planning adaptive entropy regularization dynamic exploration proximal policy optimization |
|
|