Abstract
While salient object detection (SOD) on 2-D images has been extensively studied, there is very little SOD work on 3-D measurement surfaces. We propose an effective point transformer-based SOD network for 3-D measurement point clouds, termed PSOD-Net. PSOD-Net is an encoder-decoder network that takes full advantage of transformers to model the contextual information in both multiscale point- and scenewise manners. In the encoder, we develop a point context transformer (PCT) module to capture region contextual features at the point level; PCT contains two different transformers to excavate the relationship among points. In the decoder, we develop a scene context transformer (SCT) module to learn context representations at the scene level; SCT contains both upsampling-and-transformer (UT) blocks and multicontext aggregation (MCA) units to integrate the global semantic and multilevel features from the encoder into the global scene context. Experiments show clear improvements of PSOD-Net over its competitors and validate that PSOD-Net is more robust to challenging cases such as small objects, multiple objects, and objects with complex structures. Code is available at: https://github.com/ZeyongWei/PSOD-Net.
| Original language | English |
|---|---|
| Article number | 5701511 |
| Pages (from-to) | 1-11 |
| Number of pages | 11 |
| Journal | IEEE Transactions on Geoscience and Remote Sensing |
| Volume | 62 |
| DOIs | |
| Publication status | Published - 2024 |
Keywords
- 3-D measurement point cloud
- 3-D salient object detection (SOD)
- PSOD-Net
- point transformer
Fingerprint
Dive into the research topics of 'Point Transformer-Based Salient Object Detection Network for 3-D Measurement Point Clouds'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver