Infrared and Laser Engineering, Volume. 53, Issue 5, 20240026(2024)

Multi-modal-fusion-based 3D semantic segmentation algorithm

Qi Chao, Yandong Zhao, and Shengbo Liu
Author Affiliations
  • School of Engineering, Beijing Forestry University, Beijing 100080, China
  • show less
    References(24)

    [1] [1] Qi C R, Su H, Mo K, et al. Point: deep learning on point sets f 3D classification segmentation[C]Proceedings of the IEEE conference on computer vision pattern recognition(CVPR). 2017: 652660.

    [2] [2] Qi C R, Yi L, Su H, et al. Point++: deep hierarchical feature learning on point sets in a metric space[C]Advances in Neural Infmation Processing Systems (NIPS 2017). New Yk: Curran Assosciates, Inc, 2017: 51055114.

    [3] [3] Li Y, Bu R, Sun M, et al. PointCNN: convolution on Χtransfmed points[C]Conference Wkshop on Neural Infmation Processing Systems(NIPS), 2018: 820830.

    [4] [4] Thomas H, Qi C R, Deschaud JE, et al. KPConv: flexible defmable convolution f point clouds[C]Proceedings of the IEEECVF International Conference on Computer Vision(ICCV), 2019: 64116420.

    [5] [5] Zhou Y, Tuzel O. Voxel: endtoend learning f point cloud based 3D object detection[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition (CVPR), 2018: 44904499.

    [6] [6] Lang A H, Va S, Caesar H, et al. Pointpillars: fast encoders f object detection from point clouds[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition(CVPR), 2019: 1269712705.

    [7] X Zhu, H Zhou, T Wang, et al. Cylindrical and asymmetrical 3D convolution networks for lidar segmentation.

    [8] [8] Tang H, Liu Z, Zhao S, et al. Searching efficient 3D architectures with sparse pointvoxel convolution[C]European Conference on Computer Vision(ECCV), 2020: 685702.

    [10] [10] a S, Lang A H, Helou B, et al. Pointpainting: sequential fusion f 3d object detection[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2020: 46044612.

    [11] [11] Wang C, Ma C, Zhu M, et. al. PointAugmenting: crossmodal augmentation f 3D object detection[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition (CVPR), 2021: 1179411803.

    [12] [12] Liang M, Yang B, Wang S, et al. Deep continuous fusion f multisens 3D object detection[C]European Conference on Computer Vision(ECCV), 2018: 663678.

    [14] [14] Cheng R, Razani R, Taghavi E, et al. (AF)2S3: attentive feature fusion with adaptive feature ion f sparse semantic segmentation wk[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2021: 1254712556.

    [15] [15] He K, Zhang X. Ren S, et al. Deep residual learning f image recognition[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2016: 770778.

    [16] [16] Liu Z, Lin Y T, Cao Y, et al. Swin Transfmer: Hierarchical vision transfmer using shifted windows [C]Proceedings of the IEEECVF International Conference on Computer Vision(ICCV), 2021: 999210002.

    [17] [17] Philion J, LiftFidler S. Lift, splat, shoot: encoding images from arbitrary camera rigs by implicitly unprojecting to 3D [C]European Conference on Computer Vision(ECCV), 2020: 194210.

    [18] [18] Hu J, Shen L, Sun G. Squeezeexcitation wks[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2018: 71327141.

    [20] [20] Zhang H Y, Cisse M, Dauphin Y N, et al. Mixup: beyond empirical risk minimization [C]Proceedings of the International Conference on Learning Representations(ICLR), 2018: 113.

    [21] [21] Hong M, Choi J, Kim G. StyleMix: separating content style f enhanced data augmentation[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition (CVPR), 2021: 1486214870.

    [22] [22] Kim J H, Choo W, Ssong H. Puzzle mix: exploiting saliency local statistics f optimal mixup[C]International Conference on Machine Learning(ICML), 2020: 52755285.

    [23] [23] Yun S, Han D, Chun S, et al. CutMix: regularization strategy to train strong classifiers with localizable features[C]2019 IEEECVF International Conference on Computer Vision (ICCV), 2019: 60226031.

    [24] [24] Xu Chenfeng, Wu Bichen, Wang Zining, et. al. Squeezesegv3: spatiallyadaptive convolution f efficient pointcloud segmentation [C]European Conference on Computer Vision(ECCV), 2020: 119.

    Tools

    Get Citation

    Copy Citation Text

    Qi Chao, Yandong Zhao, Shengbo Liu. Multi-modal-fusion-based 3D semantic segmentation algorithm[J]. Infrared and Laser Engineering, 2024, 53(5): 20240026

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Jan. 16, 2024

    Accepted: --

    Published Online: Jun. 21, 2024

    The Author Email:

    DOI:10.3788/IRLA20240026

    Topics