Point Transformer-Based Salient Object Detection Network for 3-D Measurement Point Clouds

Zeyong Wei, Baian Chen, Weiming Wang, Honghua Chen, Mingqiang Wei, Jonathan Li

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

While salient object detection (SOD) on 2-D images has been extensively studied, there is very little SOD work on 3-D measurement surfaces. We propose an effective point transformer-based SOD network for 3-D measurement point clouds, termed PSOD-Net. PSOD-Net is an encoder-decoder network that takes full advantage of transformers to model the contextual information in both multiscale point- and scenewise manners. In the encoder, we develop a point context transformer (PCT) module to capture region contextual features at the point level; PCT contains two different transformers to excavate the relationship among points. In the decoder, we develop a scene context transformer (SCT) module to learn context representations at the scene level; SCT contains both upsampling-and-transformer (UT) blocks and multicontext aggregation (MCA) units to integrate the global semantic and multilevel features from the encoder into the global scene context. Experiments show clear improvements of PSOD-Net over its competitors and validate that PSOD-Net is more robust to challenging cases such as small objects, multiple objects, and objects with complex structures. Code is available at: https://github.com/ZeyongWei/PSOD-Net.

Original languageEnglish
Article number5701511
Pages (from-to)1-11
Number of pages11
JournalIEEE Transactions on Geoscience and Remote Sensing
Volume62
DOIs
Publication statusPublished - 2024
Externally publishedYes

Keywords

  • 3-D measurement point cloud
  • 3-D salient object detection (SOD)
  • PSOD-Net
  • point transformer

Fingerprint

Dive into the research topics of 'Point Transformer-Based Salient Object Detection Network for 3-D Measurement Point Clouds'. Together they form a unique fingerprint.

Cite this