Chinese Optics Letters, Volume. 23, Issue 5, 051102(2025)

Tactile-assisted point cloud super-resolution

Haoran Shen1,2, Puzheng Wang1,2, Ming Lu1,2, Chi Zhang1,2, Jian Li1,2、**, and Qin Wang1,2、*
Author Affiliations
  • 1Institute of Quantum Information and Technology, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
  • 2Broadband Wireless Communication and Sensor Network Technology, Key Lab of Ministry of Education, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
  • show less
    Figures & Tables(12)
    An illustration of the tactile-assisted framework. Given sparse point cloud P with N points and touch point cloud T with N points, the feature extraction block extracts feature maps Fp and Ft from the input, then feeds them into the feature fusion block, where Ft and Fp are merged to produce the fused feature map Ff. Next, the transformer encoder consumes both P and the fused feature Ff to refine the feature map, then a high-resolution point cloud Q is obtained through coordinate reconstruction.
    The architecture of the feature extraction block (FE block).
    The architecture of the feature fusion block (FF block). In this module, we iteratively fuse tactile features Ft into visual features Fp, ultimately obtaining the fused features Ff. Specifically, during the first fusion of tactile features, the input features are the initial point cloud features Fp.
    An object from the TSR-PD, where (a) represents the high-resolution point cloud (GT), (b) corresponds to the low-resolution point cloud (blue) and tactile information (red) for 5 touches, and (c) depicts the point cloud from one tactile interaction.
    Comparing point set upsampling (16×) results from sparse inputs with and without tactile information using 512 input points. Among them are (a) joint, (b) arch, and (c) lamp post. The first row is the input low-resolution point cloud, the second row is the reconstructed point cloud without tactile information, the third row is the reconstructed point cloud with tactile information, and the fourth row is GT.
    Visualization results of different algorithms for upsampling on the same objects (a). We show the 16× upsampled results of (b) input point clouds (512 points) when processed by different upsampling methods: (c) PU-GCN[28], (d) Grad-PU[43], (e) PU-Transformer[29], and (f) TAPSR.
    • Table 1. Feature Fusion Pipeline

      View table
      View in Article

      Table 1. Feature Fusion Pipeline

      Require: a low-resolution point cloud feature fp touch point cloud features fti, i={1,2,…,α}
      Ensure: a fused feature ff
      1: fori{1,2,…,α}do
      2:  ifi==1then
      3:   fi=fp
      4:  end if
      5:  fi+1=FeaFus(fp,fti,fi)
      6: end for
      7: ff=fi+1
      8: returnff
    • Table 1. Quantitative Comparisons for Different Numbers of Tactile Iterations by Our Methoda

      View table
      View in Article

      Table 1. Quantitative Comparisons for Different Numbers of Tactile Iterations by Our Methoda

      Number of touches012345
      CD1.1620.9530.7910.7160.6710.778
      HD3.7243.4843.3133.3123.2913.314
      EMD5.4215.0795.0645.0995.0995.092
    • Table 2. Quantitative Comparisons to Other Methods on the TSR-PDa

      View table
      View in Article

      Table 2. Quantitative Comparisons to Other Methods on the TSR-PDa

       CDHDEMD
      PU-GAN[27]4.6349.21913.672
      PU-GCN[28]3.0098.75110.576
      Grad-PU[43]2.4646.3089.582
      PU-Transformer[29]1.1623.7245.421
      Ours (Number of touch = 4)0.6713.2915.099
    • Table 3. Quantitative Comparisons Under Different Upsampling Rates Between the State-of-the-Art Work and Our Present Worka

      View table
      View in Article

      Table 3. Quantitative Comparisons Under Different Upsampling Rates Between the State-of-the-Art Work and Our Present Worka

      RatePU-Transformer[29]Ours
      1.0960.832
      16×1.1620.671
      32×1.1840.895
    • Table 4. Comparing the Upsampling Performance of Our Full Pipeline with Various Cases in the Ablation Study (r = 16)a

      View table
      View in Article

      Table 4. Comparing the Upsampling Performance of Our Full Pipeline with Various Cases in the Ablation Study (r = 16)a

      FE blockFF blockNumber of touches
      012345
      ××1.1621.2421.2711.2201.2531.256
      ×1.0180.8581.1231.0941.117
      0.9530.7910.7160.6710.778
    • Table 5. Training Speed and Inference Time for the Model With Different Numbers of Touches (r = 16)

      View table
      View in Article

      Table 5. Training Speed and Inference Time for the Model With Different Numbers of Touches (r = 16)

      Number of touchesTraining speed (per epoch)Inference time (per sample)
      076.68 s15.8 ms
      178.35 s23.9 ms
      279.39 s24.0 ms
      380.35 s24.1 ms
      482.18 s24.1 ms
      583.45 s24.2 ms
    Tools

    Get Citation

    Copy Citation Text

    Haoran Shen, Puzheng Wang, Ming Lu, Chi Zhang, Jian Li, Qin Wang, "Tactile-assisted point cloud super-resolution," Chin. Opt. Lett. 23, 051102 (2025)

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Imaging Systems and Image Processing

    Received: Jul. 5, 2024

    Accepted: Nov. 14, 2024

    Published Online: May. 14, 2025

    The Author Email: Jian Li (jianli@njupt.edu.cn), Qin Wang (qinw@njupt.edu.cn)

    DOI:10.3788/COL202523.051102

    CSTR:32184.14.COL202523.051102

    Topics