TY - GEN
T1 - Sparse Fully Convolutional Network for Video-based Point Cloud Compression Color Enhancement
AU - Li, Zeliang
AU - Bao, Jingwei
AU - Liu, Yu
AU - Au Yeung, Siu Kei
AU - Zhu, Shuyuan
AU - Hung, Kevin
AU - Khan, Muazzam A.
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/12/16
Y1 - 2023/12/16
N2 - Dynamic point cloud enables objects or scenes to have a realistic 3D representation in motion. Storage and transmission of dynamic point cloud efficiently is an essential precondition for its application. Video-based point cloud compression (V-PCC) developed by the MPEG standardization group can achieve remarkable performance in compressing dynamic point clouds. However, it also introduces compression noise in decoded dynamic point clouds, which can significantly affect subsequent applications. In this paper, we propose a quality enhancement architecture that focuses on improving color attributes on V-PCC compressed point cloud. The architecture designs a sparse fully convolution networks using Minkowski Engine to maintain the sparsity nature of point cloud data and speed up the learning process with less memory usage. Additionally, we applied a feature extraction unit that takes into account the information across channels. Considering the influence of coordinates compression noise on our architecture and the limitation of GPU memory capacity, coordinates optimization and patch generation methods are applied to input data as a pre-processing step. To the best of our knowledge, this is the first implementation of the Minkowski Engine for enhancing color attributes of compressed point clouds in the V-PCC field. The experiment results demonstrate that the proposed architecture can improve the quality of color attributes in the reconstructed point cloud with different quantization parameters.
AB - Dynamic point cloud enables objects or scenes to have a realistic 3D representation in motion. Storage and transmission of dynamic point cloud efficiently is an essential precondition for its application. Video-based point cloud compression (V-PCC) developed by the MPEG standardization group can achieve remarkable performance in compressing dynamic point clouds. However, it also introduces compression noise in decoded dynamic point clouds, which can significantly affect subsequent applications. In this paper, we propose a quality enhancement architecture that focuses on improving color attributes on V-PCC compressed point cloud. The architecture designs a sparse fully convolution networks using Minkowski Engine to maintain the sparsity nature of point cloud data and speed up the learning process with less memory usage. Additionally, we applied a feature extraction unit that takes into account the information across channels. Considering the influence of coordinates compression noise on our architecture and the limitation of GPU memory capacity, coordinates optimization and patch generation methods are applied to input data as a pre-processing step. To the best of our knowledge, this is the first implementation of the Minkowski Engine for enhancing color attributes of compressed point clouds in the V-PCC field. The experiment results demonstrate that the proposed architecture can improve the quality of color attributes in the reconstructed point cloud with different quantization parameters.
KW - Point Cloud Attributes Denoising
KW - Sparse Convolution
KW - V-PCC
UR - http://www.scopus.com/inward/record.url?scp=85191000400&partnerID=8YFLogxK
U2 - 10.1145/3639592.3639602
DO - 10.1145/3639592.3639602
M3 - Conference contribution
AN - SCOPUS:85191000400
T3 - ACM International Conference Proceeding Series
SP - 66
EP - 73
BT - AICCC 2023 - 2023 6th Artificial Intelligence and Cloud Computing Conference
T2 - 6th Artificial Intelligence and Cloud Computing Conference, AICCC 2023
Y2 - 16 December 2023 through 18 December 2023
ER -