Laser & Optoelectronics Progress, Volume. 59, Issue 16, 1610004(2022)

Human Instance Segmentation Based on Two-Stream Convolutional Neural Network

Zitong Ma and Guodong Wang*
Author Affiliations
  • College of Computer Science & Technology, Qingdao University, Qingdao 266071, Shandong , China
  • show less
    Figures & Tables(7)
    Overall network structure
    Comparison of ablation study results on COCOPersons dataset
    Comparison of segmentation results on OCHuman dataset
    Comparison of segmentation results on COCOPersons dataset
    • Table 1. Results of ablation study on COCOPersons validation set

      View table

      Table 1. Results of ablation study on COCOPersons validation set

      Training methodAPAPMAPL
      One Stream0.3420.3330.385
      Two Streams+Concatenate0.5880.5420.690
      +Max Pooling0.5910.5450.689
      +Avg Pooling0.5840.5420.684
      +SENet230.5930.5450.693
      +CA290.5920.5480.690
      +FFB(Ours)0.5950.5480.697
    • Table 2. Segmentation results of different algorithms on OCHuman dataset

      View table

      Table 2. Segmentation results of different algorithms on OCHuman dataset

      MethodDatasetBackboneAPAPH
      Mask R-CNN1OCH valResNet50-fpn0.1630.113
      OCH testResNet50-fpn0.1690.128
      Pose2Seg(GT Kpt)16OCH valResNet50-fpn0.5440.491
      OCH testResNet50-fpn0.5520.495
      Pose2SegOCH valResNet50-fpn0.2220.150
      OCH testResNet50-fpn0.2380.175
      OursOCH valResNet50-fpn0.5730.576
      OCH testResNet50-fpn0.5670.570
    • Table 3. Segmentation results of different algorithms on COCOPersons dataset

      View table

      Table 3. Segmentation results of different algorithms on COCOPersons dataset

      MethodBackboneAPAPMAPL
      Mask R-CNN1ResNet50-fpn0.5320.4330.648
      PersonLab18ResNet1010.4760.592
      PersonLabResNet101(ms scale)0.4920.621
      PersonLabResNet1520.4830.595
      PersonLabResNet152(ms scale)0.4970.621
      Pose2Seg(GT Kpt)16ResNet50-fpn0.5820.5390.679
      Pose2SegResNet50-fpn0.5550.4980.670
      OursResNet50-fpn0.5950.5480.697
    Tools

    Get Citation

    Copy Citation Text

    Zitong Ma, Guodong Wang. Human Instance Segmentation Based on Two-Stream Convolutional Neural Network[J]. Laser & Optoelectronics Progress, 2022, 59(16): 1610004

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: May. 18, 2021

    Accepted: Jun. 27, 2021

    Published Online: Jul. 22, 2022

    The Author Email: Guodong Wang (doctorwgd@gmail.com)

    DOI:10.3788/LOP202259.1610004

    Topics