Laser & Optoelectronics Progress, Volume. 62, Issue 16, 1615007(2025)

Multi-Modal 3D Object Detection Algorithm Based on Kolmogorov-Arnold Network

Yanwu Ling1,2, Junmin Rao2, Yan Li2, and Fanming Li2、*
Author Affiliations
  • 1School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
  • 2Shanghai Institute of Technical Physics, Chinese Academy of Sciences, Shanghai 200083, China
  • show less
    Figures & Tables(12)
    Flow chart of SECOND network
    Flow chart of the proposed network
    Process of colored point cloud generation. (a) RGB image; (b) original point cloud; (c) colored point cloud
    KANDyVFE layer processing flow chart
    Flow chart of attention image-colored point cloud fusion
    Visualization of 3D object detection results by different methods on KITTI test set. (a) SECOND[15]; (b) Part-A2[20]; (c) PVRCNN[24]; (d) proposed method
    Visualization of 3D object detection results in BEV space of colored point clouds
    • Table 1. 3D object detection results for the car category based on the AP metric on the KITTI test set

      View table

      Table 1. 3D object detection results for the car category based on the AP metric on the KITTI test set

      MethodSpeed /HzmAP of BEV /%APBEVRIoU=0.7) /%mAP of 3D /%AP3DRIoU=0.7) /%
      EasyModerateHardEasyModerateHard
      MV3D102.880.1086.3577.5076.4563.8671.6563.0256.90
      VoxelNet144.484.4889.7184.9378.8271.3282.1565.7563.10
      F-PointNet265.983.0188.4084.1576.5573.0584.0171.2563.91
      SECOND1520.085.6890.1087.2079.7577.9787.6576.8269.45
      P-RCNN1910.085.3390.0387.5278.4380.5888.3277.9275.51
      MVXNet1713.184.6289.6585.0279.2076.0685.5174.2368.43
      Part-A22012.588.0490.3487.7586.0281.2588.6578.4376.68
      IA-SSD3883.088.5390.3488.7386.5280.5688.7679.5273.40
      SeSame3917.987.1690.7387.5383.2277.8885.2676.8671.55
      Proposed14.389.4694.4788.1285.7881.7289.8578.9376.37
    • Table 2. Detection results for the car category based on the AP metric on the nuScenes data set

      View table

      Table 2. Detection results for the car category based on the AP metric on the nuScenes data set

      MethodAP@0.5AP@1.0AP@2.0AP@4.0
      SECOND1562.4773.8776.8378.92
      PointPillars4066.6578.5883.4185.52
      Proposed73.2583.3285.3488.59
    • Table 3. Ablation experiment results for different modules on the KITTI validation set

      View table

      Table 3. Ablation experiment results for different modules on the KITTI validation set

      ModuleCar(RIoU=0.7)Cyclist(RIoU=0.5)Pedestrian(RIoU=0.5)
      EasyModerateHardEasyModerateHardEasyModerateHard
      None87.2677.1873.5672.1454.1850.3558.1953.4350.20
      M187.8277.8173.7775.7556.8653.1262.3856.2952.93
      M1+M289.7478.7976.3279.9758.3455.9264.5259.1455.06
    • Table 4. Comparison results for original point clouds and colored point clouds on the KITTI validation set

      View table

      Table 4. Comparison results for original point clouds and colored point clouds on the KITTI validation set

      Point cloudmAP of BEVAPBEVRIoU=0.7)mAP of 3DAP3DRIoU=0.7)
      EasyModerateHardEasyModerateHard
      OPC87.5291.9785.5985.0179.2387.7676.3173.62
      CPC89.4594.5188.1585.8181.6289.7478.7976.32
    • Table 5. Comparison of 2D projection results for car category on the KITTI test set based on the AP metric with other advanced methods

      View table

      Table 5. Comparison of 2D projection results for car category on the KITTI test set based on the AP metric with other advanced methods

      MethodmAP of 2DAP2DRIoU=0.7)
      EasyModerateHard
      SECOND1589.8290.8489.8188.80
      P-RCNN1991.3695.6289.3689.10
      MVXNet1793.2396.3392.9590.40
      Part-A22089.7290.6989.3989.07
      Proposed94.2898.1192.6992.03
    Tools

    Get Citation

    Copy Citation Text

    Yanwu Ling, Junmin Rao, Yan Li, Fanming Li. Multi-Modal 3D Object Detection Algorithm Based on Kolmogorov-Arnold Network[J]. Laser & Optoelectronics Progress, 2025, 62(16): 1615007

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Jan. 20, 2025

    Accepted: Mar. 14, 2025

    Published Online: Aug. 11, 2025

    The Author Email: Fanming Li (lfmjws@163.com)

    DOI:10.3788/LOP250553

    CSTR:32186.14.LOP250553

    Topics