Infrared and Laser Engineering, Volume. 52, Issue 9, 20220876(2023)
Infrared time-sensitive target detection technology based on cross-modal data augmentation
[1] [1] Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, realtime object detection[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2016: 779788.
[2] X Yu, S Hong, J Yu, et al. Research on a ship target data augmentation method of visible remote sensing image. Chinese Journal of Scientific Instrument, 41, 261-269(2020).
[5] [5] Tayl L, Nitschke G. Improving deep learning with generic data augmentation[C]2018 IEEE Symposium Series on Computational Intelligence (SSCI), IEEE, 2018, 15421547.
[6] [6] Zhong Z, Zheng L, Kang G, et al. Rom erasing data augmentation[C]Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 34(7): 1300113008.
[8] [8] Gulrajani I, Ahmed F, Arjovsky M, et al. Improved training of wasserstein gans[EBOL]. (20171225) [20221206]. https:arxiv.gabs1704.00028.
[9] [9] Zheng Z, Zheng L, Yang Y. Unlabeled samples generated by gan improve the person reidentification baseline in vitro[C]Proceedings of the IEEE International Conference on Computer Vision, 2017: 37543762.
[10] [10] Zhong Z, Zheng L, Zheng Z, et al. Camera style adaptation f person reidentification[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2018: 51575166.
[11] [11] Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detect[C]Proceedings of the IEEE European Conference on Computer Vision, 2016: 2137.
[12] [12] Redmon J, Divvala S, Girshick R, et al. You only look once: unified, realtime object detection[C]2016 IEEE Conference on Computer Vision Pattern Recognition, 2016: 779788.
[13] [13] Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies f accurate object detection semantic segmentation[C] 2014 IEEE Comference on Computer Vision Pattern Recognition, 2014: 580587.
[14] [14] Girshick R. Fast rcnn[C]Proceedings of the IEEE International Conference on Computer Vision, 2015: 14401448.
[19] [19] Owens A, Wu J, McDermott J H, et al. Ambient sound provides supervision f visual learning[C]European conference on computer vision. Springer, Cham, 2016: 801816.
[21] [21] Hou Q, Zhou D, Feng J. Codinate attention f efficient mobile wk design[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition. 2021: 1371313722.
[22] [22] Hu J, Shen L, Sun G. Squeezeexcitation wks[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2018: 71327141.
[23] [23] Woo S, Park J, Lee J Y, et al. Cbam: Convolutional block attention module[C]Proceedings of the European Conference on Computer Vision (ECCV), 2018: 319.
[25] [25] Xia G S, Bai X, Ding J, et al. DOTA: A largescale dataset f object detection in aerial images[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2018: 39743983.
Get Citation
Copy Citation Text
Siyu Wang, Xiaogang Yang, Ruitao Lu, Qingge Li, Jiwei Fan, Zhengjie Zhu. Infrared time-sensitive target detection technology based on cross-modal data augmentation[J]. Infrared and Laser Engineering, 2023, 52(9): 20220876
Category: Image processing
Received: Dec. 6, 2022
Accepted: --
Published Online: Oct. 23, 2023
The Author Email: