Optoelectronics Letters, Volume. 21, Issue 5, 298(2025)

BEDiff: denoising diffusion probabilistic models for building extraction

Yanjing LEI, Yuan WANG, Sixian CHAN, Jie HU, Xiaolong ZHOU, and Hongkai ZHANG
References(23)

[1] [1] QIU W Y, GU L J, GAO F, et al. Building extraction from very high-resolution remote sensing images using refine-UNet[J]. IEEE geoscience and remote sensing letters, 2023, 20: 1-5.

[2] [2] LI X, XU F, LIU F, et al. Semantic segmentation of remote sensing images by interactive representation refinement and geometric prior-guided inference[J]. IEEE transactions on geoscience and remote sensing, 2024, 62: 1-18.

[3] [3] LI G C, XI B B, HE Y F, et al. Diamond-UNet: a novel semantic segmentation network based on UNet network and transformer for deep space rock images[J]. IEEE geoscience and remote sensing letters, 2024, 21: 1-5.

[4] [4] XIA L G, MI S L, ZHANG J X, et al. Dual-stream feature extraction network based on CNN and transformer for building extraction[J]. Remote sensing, 2023, 15(10): 2689.

[5] [5] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16×16 words: transformers for image recognition at scale[EB/OL]. (2020-10-22)[2024-01-23]. https://arxiv.org/abs/2010.11929.

[6] [6] LIU Z, LINY, CAO Y, et al. Swin transformer: hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, October 11-17, 2021, Montreal, Canada. New York: IEEE, 2021: 10012-10022.

[7] [7] YUAN W, RAN W H, SHI X D, et al. Multi-constraint transformer based automatic building extraction from high resolution remote sensing images[J]. IEEE journal of selected topics in applied earth observations and remote sensing, 2023, 16: 9164-9174.

[8] [8] HO J, JAIN A, ABBEEL P. Denoising diffusion probabilistic models[J]. Advances in neural information processing systems, 2020, 33: 6840-6851.

[9] [9] WU J, FU R, FANG H H, et al. MedSegDiff: medical image segmentation with diffusion probabilistic model[EB/OL]. (2022-11-01) [2024-01-23]. https://arxiv.org/abs/2211.00611.

[10] [10] YU M M, CHAN S X, ZHOU X L, et al. Small object detection on highways via balance feature fusion and task-specific encoding network[J]. Optoelectronics letters, 2024, 20(7): 424-429.

[11] [11] BICAKCI Y S, SARICA B. ATTransUNet: semantic segmentation model for building segmentation from aerial image and laser data[J]. Nordic machine intelligence, 2022, 2(3).

[12] [12] LI M L, RUI J, YANG S K, et al. Method of building detection in optical remote sensing images based on segformer[J]. Sensors, 2023, 23(3): 1258.

[13] [13] CHAN S X, WANG Y, LEI Y J, et al. Asymmetric cascade fusion network for building extraction[J]. IEEE transactions on geoscience and remote sensing, 2023, 61: 1-18.

[14] [14] SAHARIA C, HO J, CHAN W, et al. Image super-resolution via iterative refinement[J]. IEEE transactions on pattern analysis and machine intelligence, 2022, 45(4): 4713-4726.

[15] [15] WHANG J, DELBRACIO M, TALEBI H, et al. Deblurring via stochastic refinement[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 21-24, 2022, New Orleans, Louisiana, USA. New York: IEEE, 2022: 16293-16303.

[16] [16] GUO X T, YANG Y W, YE C F, et al. Accelerating diffusion models via pre-segmentation diffusion sampling for medical image segmentation[C]//2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), April 18-21, 2023, Cartagena de Indias, Colombia. New York: IEEE, 2023: 1-5.

[17] [17] WU J, JI W, FU H Z, et al. Medsegdiff-v2: diffusion-based medical image segmentation with transformer[C]//Proceedings of the AAAI Conference on Artificial Intelligence, February 20-27, 2024, Vancouver, Canada. Washington: AAAI, 2024, 38(6): 6030-6038.

[18] [18] XIAO T, LIU Y C, ZHOU B L, et al. Unified perceptual parsing for scene understanding[C]//Proceedings of the European Conference on Computer Vision (ECCV), September 8-14, 2018, Munich, Germany. Berlin, Heidelberg: Springer, 2018: 418-434.

[19] [19] XIA L G, ZHANG X B, ZHANG J X, et al. Building extraction from very-high-resolution remote sensing images using semi-supervised semantic edge detection[J]. Remote sensing, 2021, 13(11): 2187.

[20] [20] CHEN L C, ZHU Y K, PAPANDREOU G, et al. Encoderdecoder with atrous separable convolution for semantic image segmentation[C]//Proceedings of the European Conference on Computer Vision (ECCV), September 8-14, 2018, Munich, Germany. Berlin, Heidelberg: Springer, 2018: 801-818.

[21] [21] CHU X, TIAN Z, WANG Y, et al. Twins: revisiting the design of spatial attention in vision transformers[J]. Advances in neural information processing systems, 2021, 34: 9355-9366.

[22] [22] LIU Z, MAO H Z, WU C Y, et al. A convnet for the 2020s[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 21-24, 2022, New Orleans, Louisiana, USA. New York: IEEE, 2022: 11976-11986.

[23] [23] GUO M H, LU C Z, HOU Q B, et al. Segnext: rethinking convolutional attention design for semantic segmentation[J]. Advances in neural information processing systems, 2022, 35: 1140-1156.

Tools

Get Citation

Copy Citation Text

LEI Yanjing, WANG Yuan, CHAN Sixian, HU Jie, ZHOU Xiaolong, ZHANG Hongkai. BEDiff: denoising diffusion probabilistic models for building extraction[J]. Optoelectronics Letters, 2025, 21(5): 298

Download Citation

EndNote(RIS)BibTexPlain Text
Save article for my favorites
Paper Information

Received: Mar. 19, 2024

Accepted: Apr. 11, 2025

Published Online: Apr. 11, 2025

The Author Email:

DOI:10.1007/s11801-025-4072-2

Topics