Laser & Optoelectronics Progress, Volume. 56, Issue 22, 221003(2019)

Light-Weight Multi-Object Detection Network Based on Inverted Residual Structure

Wanjun Liu, Mingyue Gao, Haicheng Qu*, and Lamei Liu
Author Affiliations
  • College of Software, Liaoning Technical University, Huludao, Liaoning 125105, China
  • show less
    Figures & Tables(13)
    Decoupling process of the depth separable convolution. (a) Standard convolution; (b) depth separable convolution
    Residual block and inverted residual block. (a) Residual block; (b) inverted residual block when stride is 1
    IR-YOLO network architecture
    Train loss curves
    Class detection accuracy histogram
    Comparison of detection results. (a)(d) Original input images ; (b)(e) detection results with YOLOv3-Tiny Model; (c)(f) detection results with IR-YOLO Model
    • Table 1. Parameters of inverted residual block

      View table

      Table 1. Parameters of inverted residual block

      InputOperationOutput
      h×w×k1×1 pointconv, ReLUh×w×2k
      h×w×2k3×3/sdepth conv, ReLUhs×ws ×2k
      hs×ws ×2k1×1 pointconv, linearhs×ws ×2k
    • Table 2. VOC dataset

      View table

      Table 2. VOC dataset

      CategoryTrain setTest set
      Aeroplane1171285
      Bicycle1064337
      Bird1605459
      Boat1140263
      Bottle1764469
      Bus822213
      Car32671201
      Cat1593358
      Chair3152756
      Cow847244
      Dining table824206
      Dog2025489
      Horse1072348
      Motor bike1052325
      Person132564528
      Potted plant1487480
      Sheep1070242
      Sofa814239
      Train925282
      TV monitor1108308
      Total4005812032
    • Table 3. Hyper parameters

      View table

      Table 3. Hyper parameters

      Parameters nameValue
      Batch64
      Momentum0.9
      Weight decay0.0005
      Learning rate0.001
    • Table 4. Comparison on number of floating point operations

      View table

      Table 4. Comparison on number of floating point operations

      InputOutputNumber of floatingpoint operations instandard conv /109Number of floating pointoperations in inverted residual block /109
      Expand point convDepth convSqueeze point conv
      208×208×16208×208×320.3990.0440.0250.089
      104×104×32104×104×640.3990.0440.0120.089
      52×52×6452×52×1280.3990.0440.0060.089
      26×26×12826×26×2560.3990.0440.0030.089
      13×13×25613×13×5120.3990.0440.0020.089
      13×13×51213×13×10241.5950.1770.0030.354
    • Table 5. Comparison detection speed of IR-YOLO model and YOLOv3-Tiny model

      View table

      Table 5. Comparison detection speed of IR-YOLO model and YOLOv3-Tiny model

      ModelCPU speed /(frame·s-1)GPU speed /(frame·s-1)
      YOLOv3-Tiny1.231.3
      IR-YOLO1.731.2
    • Table 6. Comparison mAP of different training numbers

      View table

      Table 6. Comparison mAP of different training numbers

      TrainingnumberYOLOv3-TinymAP /%IR-YOLOmAP /%
      6500045.1543.33
      7500045.6044.37
      8500045.1745.23
      9000042.7544.20
      9500042.7646.07
    • Table 7. Comparison of detection results of IR-YOLO and YOLOv3-Tiny on VOC dataset%

      View table

      Table 7. Comparison of detection results of IR-YOLO and YOLOv3-Tiny on VOC dataset%

      CategoryYOLOv3-TinyIR-YOLO
      Aeroplane54.7856.38
      Bicycle60.7957.86
      Bird27.2428.19
      Boat27.928.92
      Bottle14.817.58
      Bus56.9858.48
      Car63.864.05
      Cat50.3953.57
      Chair25.7723.25
      Cow46.4345.48
      Dining table39.6645.48
      Dog46.0945.68
      Horse66.6262.45
      Motor bike64.0962.85
      Person59.2359.4
      Potted plant18.2217.22
      Sheep47.5744.68
      Sofa39.3943.11
      Train54.0258.25
      TV monitor50.3448.62
      mAP45.6046.07
    Tools

    Get Citation

    Copy Citation Text

    Wanjun Liu, Mingyue Gao, Haicheng Qu, Lamei Liu. Light-Weight Multi-Object Detection Network Based on Inverted Residual Structure[J]. Laser & Optoelectronics Progress, 2019, 56(22): 221003

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Mar. 27, 2019

    Accepted: May. 17, 2019

    Published Online: Nov. 2, 2019

    The Author Email: Qu Haicheng (quhaicheng@lntu.edu.cn)

    DOI:10.3788/LOP56.221003

    Topics