Mask-Based and Multi-Modal Bird's Eye View Feature-Guided Point Cloud Panoptic Segmentation

Jiaming Zhang; Yuhui Peng; Gan Zhang; Baozhe Sun; Shenyang Lin

doi:10.3788/LOP242258

Laser & Optoelectronics Progress, Volume. 62, Issue 12, 1215001(2025)

Mask-Based and Multi-Modal Bird's Eye View Feature-Guided Point Cloud Panoptic Segmentation

Jiaming Zhang, Yuhui Peng^*, Gan Zhang, Baozhe Sun, and Shenyang Lin

Author Affiliations

School of Mechanical Engineering and Automation, Fuzhou University, Fuzhou 350108, Fujian , China

show less

Abstract Get PDF(in Chinese)

To improve the efficiency and accuracy of attention mask prediction for panoptic segmentation on point clouds, an end-to-end point cloud panoptic segmentation network model guided by multi-modal bird's eye view (BEV) features is proposed. First, object queries are generated from BEV features decoded by the Transformer, and feature enhancement is achieved through confidence ranking and positional encoding embedding. Second, a cross-attention mechanism module is constructed to fuse object queries with learnable query features and the query features of fused object instance information are used to improve the accuracy of attention mask prediction. Finally, the dimensionality of the input features to the masked attention mechanism network is reduced to enhance detection speed. Experimental results based on the nuScenes dataset indicate that, compared with the baseline method, BEVGuide-PS improves panoptic segmentation metrics PQ, PQ^?, RQ, and SQ by 17.7%, 17.0%, 18.3%, and 20.9%, respectively, reduces inference time by 58.4%, and significantly enhances training efficiency.

Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.

Keywords

cross attention end to end masked attention multi-modal panoptic segmentation

Tools

Get Citation

Copy Citation Text

Jiaming Zhang, Yuhui Peng, Gan Zhang, Baozhe Sun, Shenyang Lin. Mask-Based and Multi-Modal Bird's Eye View Feature-Guided Point Cloud Panoptic Segmentation[J]. Laser & Optoelectronics Progress, 2025, 62(12): 1215001

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites