Laser & Optoelectronics Progress, Volume. 61, Issue 24, 2437005(2024)

Underwater Object Detection Using a Multiscale and Cross-Spatial Information Aggregation Network

Jihai Yang1 and Xiaofang Pei1,2、*
Author Affiliations
  • 1School of Electronics and Information Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, Jiangsu , China
  • 2School of Electronic and Information Engineering, Wuxi University, Wuxi 214105, Jiangsu , China
  • show less

    An underwater target detection algorithm that uses a multiscale and cross-spatial information aggregation network is proposed. First, a deformable layer aggregation module is used within the backbone network to extract features, enhancing the network's positioning accuracy. Second, the Conv2former module is used to enhance the neck's global information extraction capability and reduce missing detections caused by mutual occlusion among underwater targets. Finally, a multiscale attention parallel enhancement module that uses parallel convolution blocks to extract deeper features is proposed. This module integrates an efficient multiscale attention module to filter out interference from background and image distortion and introduces multiple cross-level connections to effectively integrate low-level local features with high-level strong semantic information, thereby improving model detection accuracy. The ablation experiment is conducted on the URPC dataset. Compared with the original model, the accuracy rate, recall rate, mean average precision (mAP)@0.5, and mAP@0.5∶0.95 of the improved model increase by 3.6 percentage points, 2.6 percentage points, 3.5 percentage points, and 3.3 percentage points, respectively. Tests on the RUOD dataset under different scenarios indicate that the proposed model offers notable advantages over several current mainstream models.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Jihai Yang, Xiaofang Pei. Underwater Object Detection Using a Multiscale and Cross-Spatial Information Aggregation Network[J]. Laser & Optoelectronics Progress, 2024, 61(24): 2437005

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Digital Image Processing

    Received: Mar. 25, 2024

    Accepted: May. 8, 2024

    Published Online: Dec. 12, 2024

    The Author Email: Xiaofang Pei (xiaofangpei@163.com)

    DOI:10.3788/LOP240958

    CSTR:32186.14.LOP240958

    Topics