Chinese Optics Letters, Volume. 23, Issue 9, 091101(2025)

Image-free cross-species pose estimation via an ultra-low sampling rate single-pixel camera

Xin Wu1, Cheng Zhou1、*, Binyu Li2, Jipeng Huang1、**, Yanli Meng1, Lijun Song3、***, and Shensheng Han4
Author Affiliations
  • 1School of Physics, Northeast Normal University, Changchun 130024, China
  • 2Beijing Institute of Space Mechanics and Electricity, Beijing 100094, China
  • 3Changchun Institute of Technology, Changchun 130012, China
  • 4Shanghai Institute of Optics and Fine Mechanics, Chinese Academy of Sciences, Shanghai 201800, China
  • show less
    Figures & Tables(9)
    Overview of plane-array-camera-based and SPC-based pose estimation models. (a) The architecture of plane-array-camera-based (image-based) pose estimation model. (b) The architecture of SPC-based (image-free) pose estimation model that performs reconstruction before pose estimation, the two-stage architecture of image-free model with implicit reconstruction, and the architecture of our proposed image-free and (implicit) reconstruction-free model.
    Overview of the SPCPose. (a) SPC structure schematic. (b) The architecture of our proposed image-free and (implicit) reconstruction-free cross-species pose estimation model.
    Visualization of SPCPose on the public dataset. (a) Visualization of SPCPose on Tri-Mouse. (b) Visualization of SPCPose on Horse10. M1 means natural order, M2 means reverse order, and M3 means random order. The numbers on the picture mean the sampling rate.
    Visualization of SPCPose on the indoor Human dataset. The numbers on the picture mean the sampling rate.
    Real-world experimental results. (a) Schematic of our customized SPC and captured scenes. (b) Results of the same object captured with an RGB camera perform different actions in three real-world scenes. (c) Reconstructed scenes using our single-pixel detection system and pose estimation results using an image-based approach. (d) Results of the natural-order single-pixel detection value map representation and the action of SPCPose. (e) Inverse-order single-pixel detection value map representation and results from the action of SPCPose. (f) Random-order single-pixel probe-value map representation and results of the action of SPCPose.
    Effect of extraction methods on SPCPose’s performance in parsing object poses at 256 sample points. (a) Visualization of SPCPose processing results with different extraction methods. The numbers on the picture mean the sampling rate. (b) Objective evaluation of SPCPose processing performance with different extraction methods.
    • Table 1. Performance Comparison of Image-Based and Image-Free Pose Estimation Regarding AP, PCK, AUC, EPE, Params, and GFLOPs on Tri-Mouse[6,51]a

      View table
      View in Article

      Table 1. Performance Comparison of Image-Based and Image-Free Pose Estimation Regarding AP, PCK, AUC, EPE, Params, and GFLOPs on Tri-Mouse[6,51]a

      MethodSample rateBackboneAP@50-95PCK@0.05AUCEPEParams (M)GFLOPs
      Heatmap[52]1.000HRNet-W3299.999.891.41.9128.510.2
      1.000HRNet-W48100.099.892.21.6763.621.0
      RLE[48]1.000ResNet5098.495.886.43.4323.75.4
      1.000ResNet10199.397.487.73.0142.710.2
      1.000ResNet15298.996.487.53.0858.315.1
      SPCPose-M15.333 × 10−2ViT-S99.995.991.41.9024.35.9
      1.333 × 10−2ViT-S96.599.988.53.5624.35.9
      3.333 × 10−3ViT-S53.054.854.525.3924.35.9
      8.333 × 10−4ViT-S78.479.777.614.5924.35.9
      SPCPose-M21.333 × 10−2ViT-S100.0100.093.91.1324.35.9
      3.333 × 10−3ViT-S100.0100.092.81.4924.35.9
      8.333 × 10−4ViT-S100.0100.093.41.3024.35.9
      SPCPose-M31.333 × 10−2ViT-S100.099.791.51.8724.35.9
      3.333 × 10−3ViT-S100.099.891.31.9324.35.9
      8.333 × 10−4ViT-S100.0100.093.41.2824.35.9
    • Table 2. Performance Comparison of Image-Based and Image-Free Pose Estimation Regarding AP, PCK, AUC, EPE, Params, and GFLOPs on Horse10[51]a

      View table
      View in Article

      Table 2. Performance Comparison of Image-Based and Image-Free Pose Estimation Regarding AP, PCK, AUC, EPE, Params, and GFLOPs on Horse10[51]a

      MethodSample rateBackboneAP@50-95PCK@0.05AUCEPEParams (M)GFLOPs
      Heatmap[52]1.000HRNet-W3298.399.993.61.1428.510.2
      1.000HRNet-W4898.399.993.71.1063.621.0
      RLE[48]1.000ResNet5095.199.693.81.4423.75.4
      1.000ResNet10196.199.793.91.3642.710.2
      1.000ResNet15296.599.794.01.3058.315.1
      SPCPose-M13.512 × 10−1ViT-S87.694.088.12.9624.35.9
      8.779 × 10−2ViT-S76.086.683.04.8524.35.9
      2.195 × 10−2ViT-S67.481.580.25.8324.35.9
      5.487 × 10−3ViT-S72.683.281.05.6524.35.9
      SPCPose-M28.779 × 10−2ViT-S92.797.791.31.8724.35.9
      2.195 × 10−2ViT-S89.494.988.52.8324.35.9
      5.487 × 10−3ViT-S83.390.385.04.1024.35.9
      SPCPose-M38.779 × 10−2ViT-S94.798.193.41.5224.35.9
      2.195 × 10−2ViT-S91.196.590.32.2024.35.9
      5.487 × 10−3ViT-S89.695.889.62.4524.35.9
    • Table 3. Performance of Image-Free Pose Estimation on Our Self-Created Human Dataset with Different Extraction Methods including SPCPose-M1, SPCPose-M2, and SPCPose-M3, Evaluated in Terms of AP, PCK, AUC, EPE, Params, and GFLOPs

      View table
      View in Article

      Table 3. Performance of Image-Free Pose Estimation on Our Self-Created Human Dataset with Different Extraction Methods including SPCPose-M1, SPCPose-M2, and SPCPose-M3, Evaluated in Terms of AP, PCK, AUC, EPE, Params, and GFLOPs

      MethodSample rateBackboneAP@50-95PCK@0.05AUCEPEParams (M)GFLOPs
      SPCPose-M14.006 × 10−2ViT-S58.072.956.115.9724.35.9
      1.002 × 10−2ViT-S37.255.743.024.0424.35.9
      2.504 × 10−3ViT-S27.347.737.027.9224.35.9
      6.260 × 10−4ViT-S47.364.350.119.0524.35.9
      SPCPose-M21.002 × 10−2ViT-S68.478.562.712.9624.35.9
      2.504 × 10−3ViT-S63.573.759.014.0624.35.9
      6.260 × 10−4ViT-S41.148.745.422.1824.35.9
      SPCPose-M31.002 × 10−2ViT-S70.480.163.212.5224.35.9
      2.504 × 10−3ViT-S69.279.562.612.5024.35.9
      6.260 × 10−4ViT-S56.369.754.515.8724.35.9
    Tools

    Get Citation

    Copy Citation Text

    Xin Wu, Cheng Zhou, Binyu Li, Jipeng Huang, Yanli Meng, Lijun Song, Shensheng Han, "Image-free cross-species pose estimation via an ultra-low sampling rate single-pixel camera," Chin. Opt. Lett. 23, 091101 (2025)

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Imaging Systems and Image Processing

    Received: Apr. 1, 2025

    Accepted: May. 9, 2025

    Published Online: Aug. 22, 2025

    The Author Email: Cheng Zhou (zhoucheng91210@163.com), Jipeng Huang (huangjp848@nenu.edu.cn), Lijun Song (ccdxslj@126.com)

    DOI:10.3788/COL202523.091101

    CSTR:32184.14.COL202523.091101

    Topics