Optoelectronics Letters, Volume. 21, Issue 5, 298(2025)
BEDiff: denoising diffusion probabilistic models for building extraction
[1] [1] QIU W Y, GU L J, GAO F, et al. Building extraction from very high-resolution remote sensing images using refine-UNet[J]. IEEE geoscience and remote sensing letters, 2023, 20: 1-5.
[2] [2] LI X, XU F, LIU F, et al. Semantic segmentation of remote sensing images by interactive representation refinement and geometric prior-guided inference[J]. IEEE transactions on geoscience and remote sensing, 2024, 62: 1-18.
[3] [3] LI G C, XI B B, HE Y F, et al. Diamond-UNet: a novel semantic segmentation network based on UNet network and transformer for deep space rock images[J]. IEEE geoscience and remote sensing letters, 2024, 21: 1-5.
[4] [4] XIA L G, MI S L, ZHANG J X, et al. Dual-stream feature extraction network based on CNN and transformer for building extraction[J]. Remote sensing, 2023, 15(10): 2689.
[5] [5] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16×16 words: transformers for image recognition at scale[EB/OL]. (2020-10-22)[2024-01-23]. https://arxiv.org/abs/2010.11929.
[6] [6] LIU Z, LINY, CAO Y, et al. Swin transformer: hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, October 11-17, 2021, Montreal, Canada. New York: IEEE, 2021: 10012-10022.
[7] [7] YUAN W, RAN W H, SHI X D, et al. Multi-constraint transformer based automatic building extraction from high resolution remote sensing images[J]. IEEE journal of selected topics in applied earth observations and remote sensing, 2023, 16: 9164-9174.
[8] [8] HO J, JAIN A, ABBEEL P. Denoising diffusion probabilistic models[J]. Advances in neural information processing systems, 2020, 33: 6840-6851.
[9] [9] WU J, FU R, FANG H H, et al. MedSegDiff: medical image segmentation with diffusion probabilistic model[EB/OL]. (2022-11-01) [2024-01-23]. https://arxiv.org/abs/2211.00611.
[10] [10] YU M M, CHAN S X, ZHOU X L, et al. Small object detection on highways via balance feature fusion and task-specific encoding network[J]. Optoelectronics letters, 2024, 20(7): 424-429.
[11] [11] BICAKCI Y S, SARICA B. ATTransUNet: semantic segmentation model for building segmentation from aerial image and laser data[J]. Nordic machine intelligence, 2022, 2(3).
[12] [12] LI M L, RUI J, YANG S K, et al. Method of building detection in optical remote sensing images based on segformer[J]. Sensors, 2023, 23(3): 1258.
[13] [13] CHAN S X, WANG Y, LEI Y J, et al. Asymmetric cascade fusion network for building extraction[J]. IEEE transactions on geoscience and remote sensing, 2023, 61: 1-18.
[14] [14] SAHARIA C, HO J, CHAN W, et al. Image super-resolution via iterative refinement[J]. IEEE transactions on pattern analysis and machine intelligence, 2022, 45(4): 4713-4726.
[15] [15] WHANG J, DELBRACIO M, TALEBI H, et al. Deblurring via stochastic refinement[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 21-24, 2022, New Orleans, Louisiana, USA. New York: IEEE, 2022: 16293-16303.
[16] [16] GUO X T, YANG Y W, YE C F, et al. Accelerating diffusion models via pre-segmentation diffusion sampling for medical image segmentation[C]//2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), April 18-21, 2023, Cartagena de Indias, Colombia. New York: IEEE, 2023: 1-5.
[17] [17] WU J, JI W, FU H Z, et al. Medsegdiff-v2: diffusion-based medical image segmentation with transformer[C]//Proceedings of the AAAI Conference on Artificial Intelligence, February 20-27, 2024, Vancouver, Canada. Washington: AAAI, 2024, 38(6): 6030-6038.
[18] [18] XIAO T, LIU Y C, ZHOU B L, et al. Unified perceptual parsing for scene understanding[C]//Proceedings of the European Conference on Computer Vision (ECCV), September 8-14, 2018, Munich, Germany. Berlin, Heidelberg: Springer, 2018: 418-434.
[19] [19] XIA L G, ZHANG X B, ZHANG J X, et al. Building extraction from very-high-resolution remote sensing images using semi-supervised semantic edge detection[J]. Remote sensing, 2021, 13(11): 2187.
[20] [20] CHEN L C, ZHU Y K, PAPANDREOU G, et al. Encoderdecoder with atrous separable convolution for semantic image segmentation[C]//Proceedings of the European Conference on Computer Vision (ECCV), September 8-14, 2018, Munich, Germany. Berlin, Heidelberg: Springer, 2018: 801-818.
[21] [21] CHU X, TIAN Z, WANG Y, et al. Twins: revisiting the design of spatial attention in vision transformers[J]. Advances in neural information processing systems, 2021, 34: 9355-9366.
[22] [22] LIU Z, MAO H Z, WU C Y, et al. A convnet for the 2020s[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 21-24, 2022, New Orleans, Louisiana, USA. New York: IEEE, 2022: 11976-11986.
[23] [23] GUO M H, LU C Z, HOU Q B, et al. Segnext: rethinking convolutional attention design for semantic segmentation[J]. Advances in neural information processing systems, 2022, 35: 1140-1156.
Get Citation
Copy Citation Text
LEI Yanjing, WANG Yuan, CHAN Sixian, HU Jie, ZHOU Xiaolong, ZHANG Hongkai. BEDiff: denoising diffusion probabilistic models for building extraction[J]. Optoelectronics Letters, 2025, 21(5): 298
Received: Mar. 19, 2024
Accepted: Apr. 11, 2025
Published Online: Apr. 11, 2025
The Author Email: