Some recent visual-based relocalization algorithms rely on deep learning methods to perform camera pose regression from image data. This letter focuses on the loss functions that embed the error between… Click to show full abstract
Some recent visual-based relocalization algorithms rely on deep learning methods to perform camera pose regression from image data. This letter focuses on the loss functions that embed the error between two poses to perform deep learning based camera pose regression. Existing loss functions are either difficult-to-tune multi-objective functions or present unstable reprojection errors that rely on ground truth 3D scene points and require a two-step training. To deal with these issues, we introduce a novel loss function which is based on a multiplane homography integration. This new function does not require prior initialization and only depends on physically interpretable hyperparameters. Furthermore, the experiments carried out on well established relocalization datasets show that it minimizes best the mean square reprojection error during training when compared with existing loss functions.
               
Click one of the above tabs to view related content.