TY - GEN
T1 - Shape Descriptor Guided Learning for Category-Level Object Pose Estimation
AU - Liu, Yun
AU - Wang, Weiming
AU - Wang, Fu Lee
AU - Xie, Haoran
AU - Chen, Honghua
AU - Wei, Mingqiang
AU - Qin, Jing
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - Category-level object pose estimation plays a crucial role in a wide range of practical applications by accurately predicting the poses and sizes of unseen objects within a specific category. However, accurately estimating object poses remains a significant challenge due to substantial shape variations within the same category. To address this issue, this paper introduces a novel learning network for object pose estimation that is guided by a shape descriptor. By capturing the geometric information of an object’s shape, the shape descriptor provides valuable input for subsequent feature learning, effectively handling shape variations. Moreover, our framework incorporates a confidence-based pose estimator, which assigns confidence scores to each pose prediction. This integration allows for the acquisition of more accurate poses with higher confidence by penalizing poses with low confidence. Experimental results on the CAMERA25 and REAL275 datasets demonstrate the superiority of our approach over state-of-the-art methods.
AB - Category-level object pose estimation plays a crucial role in a wide range of practical applications by accurately predicting the poses and sizes of unseen objects within a specific category. However, accurately estimating object poses remains a significant challenge due to substantial shape variations within the same category. To address this issue, this paper introduces a novel learning network for object pose estimation that is guided by a shape descriptor. By capturing the geometric information of an object’s shape, the shape descriptor provides valuable input for subsequent feature learning, effectively handling shape variations. Moreover, our framework incorporates a confidence-based pose estimator, which assigns confidence scores to each pose prediction. This integration allows for the acquisition of more accurate poses with higher confidence by penalizing poses with low confidence. Experimental results on the CAMERA25 and REAL275 datasets demonstrate the superiority of our approach over state-of-the-art methods.
KW - Category-level
KW - Object pose estimation
KW - Shape descriptor
UR - http://www.scopus.com/inward/record.url?scp=86000474562&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-82024-3_4
DO - 10.1007/978-3-031-82024-3_4
M3 - Conference contribution
AN - SCOPUS:86000474562
SN - 9783031820236
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 42
EP - 54
BT - Advances in Computer Graphics - 41st Computer Graphics International Conference, CGI 2024, Proceedings
A2 - Magnenat-Thalmann, Nadia
A2 - Kim, Jinman
A2 - Sheng, Bin
A2 - Deng, Zhigang
A2 - Thalmann, Daniel
A2 - Li, Ping
T2 - 41st Computer Graphics International Conference, CGI 2024
Y2 - 1 July 2024 through 5 July 2024
ER -