Cross-Modal Multilevel Feature Fusion-Based Algorithm for Power-Equipment Detection

Shanfeng LIU; Wandeng MAO; Miaomiao LI; Qiankai ZHOU; Wenjie ZOU; Hua BAO

Infrared Technology, Volume. 47, Issue 7, 884(2025)

Shanfeng LIU¹, Wandeng MAO¹, Miaomiao LI¹, Qiankai ZHOU², Wenjie ZOU³, and Hua BAO^3、*

Author Affiliations

¹Electric Power Research Institute of State Grid Henan Electric Power Company, Zhengzhou 450018, China

²State Grid Zhumadian Queshan Electric Power Compony, Zhumadian 463200, China

³School of Electrical Engineering and Automation, Anhui University, Hefei 230601, China

show less

Abstract Get PDF(in Chinese)

A novel cross-modal multilevel feature fusion algorithm based on adaptive fusion and self-attention enhancement is proposed to address the low robustness of power-equipment detection algorithms and inaccurate small-target detection in complex environments. The algorithm begins by constructing a dual-stream feature-extraction network to extract multilevel target representations from visible-light and infrared images. An adaptive fusion module is introduced to capture complementary features from both the visible-light and infrared branches. Furthermore, a self-attention mechanism based on a Transformer is employed to enhance the semantic spatial information of the complementary features. Finally, precise target localization is achieved by utilizing deep features at different scales. Experimental evaluations were conducted on a custom-developed power-equipment dataset, and the results show that the proposed algorithm achieved an average precision mean value of 91.7%. Compared with using only the visible-light or infrared branch separately, the algorithm shows improvements of 3.5% and 3.9%, respectively, thus effectively achieving cross-modal information fusion. Compared with current mainstream object-detection algorithms, it exhibits superior robustness.

Keywords

adaptive fusion cross-modal object detection power equipment self-attention mechanism

Tools

Get Citation

Copy Citation Text

LIU Shanfeng, MAO Wandeng, LI Miaomiao, ZHOU Qiankai, ZOU Wenjie, BAO Hua. Cross-Modal Multilevel Feature Fusion-Based Algorithm for Power-Equipment Detection[J]. Infrared Technology, 2025, 47(7): 884

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Dec. 13, 2023

Accepted: Aug. 12, 2025

Published Online: Aug. 12, 2025

The Author Email: BAO Hua (baohua@ahu.edu.cn)

DOI:

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology