Skip to main content

Research Repository

Advanced Search

E2EK: End-to-end regression network based on keypoint for 6d pose estimation

Lin, Shifeng; Wang, Zunran; Ling, Yonggen; Tao, Yidan; Yang, Chenguang

E2EK: End-to-end regression network based on keypoint for 6d pose estimation Thumbnail


Authors

Shifeng Lin

Zunran Wang

Yonggen Ling

Yidan Tao



Abstract

The methods based on deep learning are the mainstream of 6D object pose estimation, which mainly include direct regression and two-stage pipelines. The former are keen by many scholars at first due to their simplicity and differentiability to poses, but they usually lack in accuracy when compared with the latter that estimate the intermediate variables relating to geometries such as object keypoints or 2D-3D correspondence before PnP/RANSAC algorithm. However, the loss function of the two-stage method is non-differentiable to the 6D pose, which is hard to apply in the tasks requiring the differentiable poses. To overcome the disadvantages of the above methods, we propose an end-to-end regression network based on keypoints for 6D pose estimation. Specifically, we supervise the point-wise keypoint offsets that help the network to learn the geometric information and directly regress the 6D pose through aggregating keypoints to achieve differentiability to the pose. Furthermore, we improve the sampling method by sampling points around objects that benefits the small object and design a unit loss function that helps the learning of the keypoints. Experimental results show that our approach outperforms most methods on LM, LM-O and YCB-V datasets.

Journal Article Type Article
Acceptance Date Apr 20, 2022
Publication Date 2022-06
Deposit Date Jun 18, 2022
Publicly Available Date Jun 23, 2022
Journal IEEE Robotics and Automation Letters
Print ISSN 2377-3766
Publisher Institute of Electrical and Electronics Engineers
Peer Reviewed Peer Reviewed
Volume 7
Issue 3
Pages 6526-6533
DOI https://doi.org/10.1109/LRA.2022.3174261
Keywords Deep learning for visual perception, pose estimation, RGB-D perception, Pose estimation, Feature extraction, Three-dimensional displays, Semantics, Task analysis, Robots, Point cloud compression
Public URL https://uwe-repository.worktribe.com/output/9646861
Publisher URL https://ieeexplore.ieee.org/document/9772945

Files

E2EK: End-to-end regression network based on keypoint for 6d pose estimation (964 Kb)
PDF

Licence
http://www.rioxx.net/licenses/all-rights-reserved

Publisher Licence URL
http://www.rioxx.net/licenses/all-rights-reserved

Copyright Statement
This is the author’s accepted manuscript. The final published version is available here: https://ieeexplore.ieee.org/document/9772945
© 2022 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.





You might also like



Downloadable Citations