Remote Sensing Technology and Application, Volume. 40, Issue 4, 864(2025)
Cross-modal Feature Decoupling and Focalizing Network for Robust UAV-based Road Traffic Scenes Semantic Segmentation
[6] [6] YANG X H, LI H Q, ZHU W,et al. RSHRNet: Improved HRNet-based semantic segmentation for UAV rice seedling images in mechanical transplanting quality assessment[J]. Computers and Electronics in Agriculture, 2025, 234: 110273. DOI: 10.1016/j.compag.2025.110273
[7] [7] HA Q S, WATANABE K, KARASAWA T,et al. MF-Net: Towards real-time semantic segmentation for autonomous vehicles with multi-spectral scenes[C]//Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2017: 5108-5115. DOI: 10.1109/IROS.2017.8206396
[8] [8] SUN Y X, ZUO W X, LIU M. RTFNet: RGB-thermal fusion network for semantic segmentation of urban scenes[J]. IEEE Robotics and Automation Letters, 2019, 4(3): 2576-2583. DOI: 10.1109/LRA.2019.2904733
[9] [9] SUN Y X, ZUO W X, YUN P,et al. FuseSeg: Semantic segmentation of urban[J]. IEEE Transactions on Automation Science and Engineering, 2021, 18(3)10: 1000ZHOU- 1011W. DOI: 10.1109/ TASE.2020.2993143J
[10] [10] LIN X Y, LEI J S,et al. MFFENet: Multiscale feature fusion and enhancement network for RGB-thermal urban road scene parsing[J]. IEEE Transactions on Multimedia, 2021, 24: 2526-2538. DOI: 10.1109/TMM.2021.3086618
[11] [11] ZHOU W J, LIU J F, LEI J S,et al. GMNet: Graded-feature multilabel-learning network for RGB-thermal urban scene semantic segmentation[J].IEEE Transactions on Image Processing, 2021, 30: 7790-802. DOI: 10.1109/TIP.2021. 3109518
[12] [12] HOU Y L, JIA Y, HOU Z J,et al. IAFFNet: Illumination-aware feature fusion network for all-day RGB-thermal semantic segmentation of road scenes[J]. IEEE Access, 2022, 10: 129702-129711.
[13] [13] CHEN Y, ZHAN W D, JIANG Y C,et al. LASNet: A light-weight asymmetric spatial feature network for real-time semantic segmentation[J]. Electronics, 2022, 11(19): 3238. DOI: 10.3390/electronics11193238
[14] [14] WANG Q W, YIN C, SONG H H,et al. UTFNet: Uncertainty-guided trustworthy fusion network for RGB-thermal semantic segmentation[J]. IEEE Geoscience Remote Sensing Letters, 2023, 20: 1-5. DOI: 10.1109/LGRS.2023.3322452
[15] [15] ZHANG Q, ZHAO S L, LUO Y J,et al. ABMDRNet: Adaptive-weighted bi-directional modality difference reduction network for RGB-T semantic segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2021: 2633-2642. DOI: 10.1109/cvpr46437.2021.00266
[16] [16] ZHAO S L, LIU Y C, JIAO Q,et al. Mitigating modality discrepancies for RGB-T semantic segmentation[J].IEEE Transactions on Neural Networks and Learning Systems, 2024, 35(7): 9380-9394. DOI: 10.1109/TNNLS.2022.3233089
[17] [17] ZHOU H, TIAN C H, ZHANG Z X,et al. Multispectral fusion transformer network for RGB-thermal urban scene semantic segmentation[J]. IEEE Geoscience Remote Sensing Letter, 2022, 19: 1-5. DOI: 10.1109/LGRS.2022.3179721
[18] [18] ZHANG J M, LIU H Y, YANG K L,et al. CMX: Cross-modal fusion for RGB-X semantic segmentation with transformers[J]. IEEE Transation on Intelligent Transportation Systems, 2023, 24(12): 14679-14694. DOI: 10.1109/TITS.2023.3300537
[19] [19] WAN Z F, ZHANG P P, WANG Y H,et al. Sigma: Siamese mamba network for multi-modal semantic segmentation; proceedings of the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), F, 2025[C]//IEEE, 2025. DOI: 10.1109/WACV61041.2025.00176
[20] [20] GUO X D, LIN Z A, HU L W,et al. Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation[J]. arXiv Preprint. 2025.10.48550/arXiv.2506.17869
[21] [21] OUYANG Junlin, WANG Qingwang, SHEN tao. Kust4K: An RGB-TIR Dataset from UAV Platform for Robust Urban Traffic Scenes Semantic Segmentation[DB/OL]. Figshare.2025.10.6084/m9.figshare.29476610.v3
[22] [22] CARION N, MASSA F, SYNNAEVE G,et al. End-to-end object detection with transformers; proceedings of the European conference on computer vision F2020[J]. arXiv Preprint. DOI: arXiv: 2005.12872
[23] [23] CHENG B W, SCHWING A, KIRILLOV Alexander. Perpixel classification is not all you need for semantic segmentation[J]. Advances in Neural Information Processing Systems, 2021, 34: 17864-17875. DOI: 10.5555/3540261.3541628
[24] [24] LI F, ZHANG H, XU H Z,et al. Mask DINO: Towards a unified transformer-based framework for object detection and segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2023: 3041-3050. DOI: 10.1109/CVPR52729.2023.00297
[25] [25] LIANG M J, HU J J, BAO C Y,et al. Explicit attention-enhanced fusion for RGB-thermal perception tasks[J]. IEEE Robotics and Automation Letters, 2023, 8(7): 4060-4067. DOI: 10.1109/LRA.2023.3272269
[26] [26] DENG F Q, FENG H, LIANG M J,et al. FEANet: Feature-enhanced attention network for RGB-thermal real-time semantic segmentation[C]//Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems(IROS). IEEE, 2021: 4467-4473. DOI: 10.1109/iros51168.2021.9636084
[27] [27] RONNEBERGER O, FISCHER P, BROX T. U-Net: Convolutional networks for biomedical image segmentation[M]//Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015. Cham: Springer International Publishing, 2015: 234-241. DOI: 10.1007/978-3-319-24574-4_28
[28] [28] XIAO T T, LIU Y C, ZHOU B L,et al. Unified perceptual parsing for scene understanding[C]// Proceedings of the European Conference on Computer Vision (ECCV), F, 2018. DOI: arXiv: 1807.10221
[29] [29] ZHANG J M, LIU R P, SHI H,et al. Delivering arbitrary-modal semantic segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR). IEEE, 2023: 1136-1147. DOI: 10.1109/CVPR52729.2023.00116
Get Citation
Copy Citation Text
WANG Qingwang, OUYANG Junlin, JIN Pengcheng, SHEN Tao. Cross-modal Feature Decoupling and Focalizing Network for Robust UAV-based Road Traffic Scenes Semantic Segmentation[J]. Remote Sensing Technology and Application, 2025, 40(4): 864
Received: May. 11, 2025
Accepted: Aug. 26, 2025
Published Online: Aug. 26, 2025
The Author Email: SHEN Tao (shentao@kust.edu.cn)