Opto-Electronic Engineering, Volume. 52, Issue 2, 240269-1(2025)

An instrument detection method for complex retinal microsurgery

Yuhao He1,2, Yiwei Chen2, Jinyu Fan2, Yi He1,2, and Guohua Shi1,2、*
Author Affiliations
  • 1Department of Biomedical Engineering, University of Science and Technology of China, Hefei, Anhui 230026, China
  • 2Jiangsu Key Laboratory of Medical Optics, Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, Jiangsu 215163, China
  • show less
    Figures & Tables(16)
    Three main surgical instruments in RET1 dataset. (a) Vitrectomy cutter; (b) Internal limiting membrane peeling forceps; (c) Light pipe
    Overall network architecture of the proposed RM-YOLO model
    Cascading of three attention mechanisms in the detection head
    Integrated structure of the proposed dynamic head and decoupled head
    Diagram of the SGConv convolution layer
    RSGCSP feature extraction modules
    Transformation process of the reparameterization part
    Schematic diagram of proposed DeltaIoU
    Results of the loss function experiments. (a) mAP at different values of α; (b) Bounding box regression loss corresponding to different IoUs; (c) mAP corresponding to different IoUs
    The results of three main types of surgical instruments in the experiment
    Detection results of different algorithms for a single surgical instrument in complex scenarios. (a) Severe instrument reflection; (b) Severe distortion in microscope imaging
    Detection results of different algorithms for multiple surgical instruments in complex scenarios. (a) High instrument similarity; (b) Microscope out of focus; (c) Occlusion between instruments
    Detection of surgical instruments during vitrectomy under different lighting conditions by the proposed algorithm
    • Table 1. Experimental environment configuration

      View table
      View in Article

      Table 1. Experimental environment configuration

      ConfigurationConfiguration parameters
      Operating systemWindows 11
      GPUNvidia Geforce RTX 4070 Super
      Programming languagePython 3.11
      FrameworkPytorch 2.1
      GPU computing frameworkCuda 12.1
      GPU acceleration libraryCudnn 8.0
      Learning rate0.001
      Momentum0.9
      Weight rate decay0.0005
      Batch size32
      Epochs300
    • Table 2. Ablation experiments results

      View table
      View in Article

      Table 2. Ablation experiments results

      ModelDynamic headRSGCSP(SGConv)DeltaIoU lossPRmAP50-95FPS
      A0.9600.9250.686206
      B0.9660.9230.706128
      C0.9770.9290.691210
      D0.9700.9280.680216
      E0.9600.9310.702136
      F0.9800.9260.707198
      G0.9850.9300.711115
      H0.9750.9410.724143
    • Table 3. Comparison experiments results

      View table
      View in Article

      Table 3. Comparison experiments results

      ModelPRmAP50-95Parameters/MGFLOPsFPS
      Faster R-CNN0.9610.9190.652//85
      YOLOv3s[24]0.9810.9310.68715.3243.8147
      YOLOv5s0.9650.9270.6839.1123.8194
      YOLOv6s[25]0.9600.9140.68116.344.0192
      RT-DETR0.9640.8950.62310.5623.9131
      YOLOv9m[26]0.9520.9270.68520.0276.589
      YOLOv10s[27]0.9470.8760.6848.0424.5182
      DBH-YOLO0.9750.9180.64320.8647.9128
      RM-YOLO(ours)0.9750.9410.7247.420.7143
    Tools

    Get Citation

    Copy Citation Text

    Yuhao He, Yiwei Chen, Jinyu Fan, Yi He, Guohua Shi. An instrument detection method for complex retinal microsurgery[J]. Opto-Electronic Engineering, 2025, 52(2): 240269-1

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Article

    Received: Nov. 20, 2024

    Accepted: Jan. 15, 2025

    Published Online: Apr. 27, 2025

    The Author Email:

    DOI:10.12086/oee.2025.240269

    Topics