TY - JOUR
T1 - Complement decoded point cloud with coordinate adjustment for video-based point cloud compression
AU - Li, Zeliang
AU - Bao, Jingwei
AU - Liu, Yu
AU - Au Yeung, Siu Kei
AU - Zhu, Shuyuan
AU - HUNG, King Fai Kevin
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
PY - 2025/1
Y1 - 2025/1
N2 - Dynamic point cloud (DPC) represents a realistic 3D scene in motion and has a wide range of applications. Compressing point clouds has become crucial for storing and transmitting such data. Video-based point cloud compression (V-PCC) developed by the Moving Picture Expert Group can achieve remarkable performance in DPC compression. However, it also introduces issues of point reduction and coordinate distortion in the decoded DPC. In this paper, we present a 3D-based framework as a post-processing tool for the V-PCC decoder, which complements decoded DPC and performs coordinate adjustment. In particular, we propose a neighbor-based interpolation method to recover the missing points based on the coordinates in decoded DPC. Then, to minimize the coordinate distortion in interpolation, we design a sparse fully convolutional networks, 3D Minkowski Unet, to perform coordinate adjustment. Considering the variation of data size for DPC, we propose a cube-based patch generation method to enable the scalability of the proposed framework. The experiment results demonstrate that the proposed framework obtains significant performance in complementing reduced coordinates in both objective and subjective evaluation.
AB - Dynamic point cloud (DPC) represents a realistic 3D scene in motion and has a wide range of applications. Compressing point clouds has become crucial for storing and transmitting such data. Video-based point cloud compression (V-PCC) developed by the Moving Picture Expert Group can achieve remarkable performance in DPC compression. However, it also introduces issues of point reduction and coordinate distortion in the decoded DPC. In this paper, we present a 3D-based framework as a post-processing tool for the V-PCC decoder, which complements decoded DPC and performs coordinate adjustment. In particular, we propose a neighbor-based interpolation method to recover the missing points based on the coordinates in decoded DPC. Then, to minimize the coordinate distortion in interpolation, we design a sparse fully convolutional networks, 3D Minkowski Unet, to perform coordinate adjustment. Considering the variation of data size for DPC, we propose a cube-based patch generation method to enable the scalability of the proposed framework. The experiment results demonstrate that the proposed framework obtains significant performance in complementing reduced coordinates in both objective and subjective evaluation.
KW - Geometry quality enhancement
KW - Interpolation
KW - Minkowski engine
KW - Video-Based point cloud compression
UR - http://www.scopus.com/inward/record.url?scp=85211378170&partnerID=8YFLogxK
U2 - 10.1007/s11760-024-03602-6
DO - 10.1007/s11760-024-03602-6
M3 - Article
AN - SCOPUS:85211378170
SN - 1863-1703
VL - 19
JO - Signal, Image and Video Processing
JF - Signal, Image and Video Processing
IS - 1
M1 - 48
ER -