Shifeng Lin
E2EK: End-to-end regression network based on keypoint for 6d pose estimation
Lin, Shifeng; Wang, Zunran; Ling, Yonggen; Tao, Yidan; Yang, Chenguang
Authors
Abstract
The methods based on deep learning are the mainstream of 6D object pose estimation, which mainly include direct regression and two-stage pipelines. The former are keen by many scholars at first due to their simplicity and differentiability to poses, but they usually lack in accuracy when compared with the latter that estimate the intermediate variables relating to geometries such as object keypoints or 2D-3D correspondence before PnP/RANSAC algorithm. However, the loss function of the two-stage method is non-differentiable to the 6D pose, which is hard to apply in the tasks requiring the differentiable poses. To overcome the disadvantages of the above methods, we propose an end-to-end regression network based on keypoints for 6D pose estimation. Specifically, we supervise the point-wise keypoint offsets that help the network to learn the geometric information and directly regress the 6D pose through aggregating keypoints to achieve differentiability to the pose. Furthermore, we improve the sampling method by sampling points around objects that benefits the small object and design a unit loss function that helps the learning of the keypoints. Experimental results show that our approach outperforms most methods on LM, LM-O and YCB-V datasets.
Journal Article Type | Article |
---|---|
Acceptance Date | Apr 20, 2022 |
Publication Date | 2022-06 |
Deposit Date | Jun 18, 2022 |
Publicly Available Date | Jun 23, 2022 |
Journal | IEEE Robotics and Automation Letters |
Print ISSN | 2377-3766 |
Publisher | Institute of Electrical and Electronics Engineers |
Peer Reviewed | Peer Reviewed |
Volume | 7 |
Issue | 3 |
Pages | 6526-6533 |
DOI | https://doi.org/10.1109/LRA.2022.3174261 |
Keywords | Deep learning for visual perception, pose estimation, RGB-D perception, Pose estimation, Feature extraction, Three-dimensional displays, Semantics, Task analysis, Robots, Point cloud compression |
Public URL | https://uwe-repository.worktribe.com/output/9646861 |
Publisher URL | https://ieeexplore.ieee.org/document/9772945 |
Files
E2EK: End-to-end regression network based on keypoint for 6d pose estimation
(964 Kb)
PDF
Licence
http://www.rioxx.net/licenses/all-rights-reserved
Publisher Licence URL
http://www.rioxx.net/licenses/all-rights-reserved
Copyright Statement
This is the author’s accepted manuscript. The final published version is available here: https://ieeexplore.ieee.org/document/9772945
© 2022 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
You might also like
Head-raising of snake robots based on a predefined spiral curve method
(2018)
Journal Article
Enhanced teleoperation performance using hybrid control and virtual fixture
(2019)
Journal Article
Efficient 3D object recognition via geometric information preservation
(2019)
Journal Article
Downloadable Citations
About UWE Bristol Research Repository
Administrator e-mail: repository@uwe.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search