Chinese Journal of Ship Research, Volume. 19, Issue 5, 188(2024)

Lightweight and robust ship detection method driven by self-attention mechanism

Feng MA1,2, Zihui SHI1,2, Jie SUN3, Chen CHEN3,4, Xianbin MAO5, and Xinping YAN2
Author Affiliations
  • 1School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan 430063, China
  • 2Intelligent Transportation Systems Research Center, Wuhan University of Technology, Wuhan 430063, China
  • 3Nanjing Smart Water Transportation Technology Co., Ltd, Nanjing 210028, China
  • 4School of Computer Science and Technology, Wuhan Institute of Technology, Wuhan 430205, China
  • 5Zhoushan Haihua Passenger Transport Co., Ltd, Zhoushan 316111, China
  • show less
    References(29)

    [2] [2] REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, realtime object detection[C]2016 IEEE Conference on Computer Vision Pattern Recognition (CVPR). Las Vegas, NV, USA: IEEE, 2016: 779−788.

    [3] [3] REDMON J, FARHADI A. YOLO9000: better, faster, stronger[C]IEEE Conference on Computer Vision Pattern Recognition (CVPR). Honolulu, HI, USA: IEEE, 2017: 6517−6525.

    [4] [4] REDMON J, FARHADI A. Yolov3: an incremental improvement[R]. Washington: University of Washington, 2018.

    [5] [5] BOCHKOVSKIY A, WANG C Y, LIAO H Y M. YOLOv4: optimal speed accuracy of object detection[EBOL]. (20200423)[20230526]. https:doi.g10.48550arXiv.2004.10934.

    [6] [6] KALCHBRENNER N, GREFENSTETTE E, BLUNSOM P, et al. A convolutional neural wk f modelling sentences[C]Proceedings of the 52nd Annual Meeting of the Association f Computational Linguistics (Volume 1: Long Papers). Baltime, Maryl: Association f Computational Linguistics, 2014.

    [7] [7] GIRSHICK R. Fast RCNN[C]Proceedings of the 2015 IEEE International Conference on Computer Vision. Washington: IEEE, 2015: 1440−1448.

    [9] [9] HE K M, GKIOXARI G, DOLLÁR P, et al. Mask RCNN[C]2017 IEEE International Conference on Computer Vision (ICCV). Venice, Italy: IEEE, 2017: 2980−2988.

    [10] [10] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot MultiBox detect[M]LEIBE B, MATAS J, SEBE N, et al. 14th European Conference on Computer VisionECCV 2016. Amsterdam, The herls: Springer, 2016, 9905: 21−37.

    [13] [13] FU K, LI J, MA L, et al. Intrinsic relationship reasoning f small object detection[EBOL]. (20200902)[20230526]. https:doi.g10.48550arXiv.2009.00833.

    [16] [16] LIU Z M, GAO G Y, SUN L, et al. HRD: Highresolution detection wk f small objects[C]2021 IEEE International Conference on Multimedia Expo (ICME). Shenzhen, China: IEEE, 2021: 1−6.

    [17] [17] ZHU X K, LÜ S C, WANG X, et al. TPHYOLOv5: Improved YOLOv5 based on transfmer prediction head f object detection on dronecaptured scenarios[C]2021 IEEECVF International Conference on Computer Vision Wkshops (ICCVW). Montreal, Canada: IEEE, 2021: 2778−2788.

    [19] [19] LIU Z, LIN Y T, CAO Y, et al. Swin transfmer: hierarchical vision transfmer using shifted windows[C]2021 IEEECVF International Conference on Computer Vision (ICCV). Montreal, QC, Canada: IEEE, 2021.

    [21] [21] CHANG J Y, OH H, LEE S J, et al. Ship detection f KOMPSAT3A optical images using binary features adaboost classification[C]IGARSS 20202020 IEEE International Geoscience Remote Sensing Symposium. Waikoloa, HI, USA: IEEE, 2020.

    [25] [25] LIU T, ZHOU B J, ZHAO Y S, et al. Ship detection algithm based on improved YOLO V5[C]2021 6th International Conference on Automation, Control Robotics Engineering (CACRE). Dalian, China: IEEE, 2021.

    [27] [27] YUAN L, CHEN Y P, WANG T, et al. Tokenstotoken ViT: Training vision transfmers from scratch on Image[C]2021 IEEECVF International Conference on Computer Vision. Montreal, QC, Canada: IEEE, 2021: 558−567.

    [28] [28] REZATOFIGHI H, TSOI N, GWAK J, et al. Generalized intersection over union: a metric a loss f bounding box regression[C]2019 IEEECVF Conference on Computer Vision Pattern Recognition (CVPR). Long Beach, CA, USA: IEEE, 2019.

    [29] [29] ZHENG Z H, WANG P, LIU W, et al. DistanceIoU loss: faster better learning f bounding box regression[C]34th AAAI Conference on Artificial Intelligence. New Yk: IEEE, 2020.

    [30] [30] DENG J, DONG W, SOCHER R, et al. Image: a largescale hierarchical image database[C]2009 IEEE Conference on Computer Vision Pattern Recognition. Miami, FL, USA: IEEE, 2009.

    [31] [31] LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft COCO: common objects in context[M]FLEET D, PAJDLA T, SCHIELE B, et al. 13rd European Conference on Computer VisionECCV 2014. Zurich, Switzerl: Springer, 2014, 8693: 740−755.

    Tools

    Get Citation

    Copy Citation Text

    Feng MA, Zihui SHI, Jie SUN, Chen CHEN, Xianbin MAO, Xinping YAN. Lightweight and robust ship detection method driven by self-attention mechanism[J]. Chinese Journal of Ship Research, 2024, 19(5): 188

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Weapon, Electronic and Information System

    Received: May. 30, 2023

    Accepted: --

    Published Online: Mar. 14, 2025

    The Author Email:

    DOI:10.19693/j.issn.1673-3185.03389

    Topics