Blue calico pattern generation based on improved stable diffusion model

WANG Zixiang; JIA Xiaojun; RAN Erfei; XU Congyuan

doi:10.16136/j.joel.2025.07.0145

Journal of Optoelectronics · Laser, Volume. 36, Issue 7, 712(2025)

Blue calico pattern generation based on improved stable diffusion model

WANG Zixiang^1,2, JIA Xiaojun^1,2、*, RAN Erfei³, and XU Congyuan²

Author Affiliations

¹School of Information Science and Engineering, Zhejiang Sci-Tech University, Hangzhou, Zhejiang 310018, China

²College of Information Science and Engineering, Jiaxing University, Jiaxing, Zhejiang 314001, China

³School of Computer Science and Technology (School of Artificial Intelligence), Zhejiang Sci-Tech University, Hangzhou, Zhejiang 310018, China

show less

Abstract Get PDF(in Chinese)

References(12)

[7] [7] KINGMA D P, WELLING M. Auto-encoding variational bayes ［EB/OL］. (2013-12-20) ［2024-03-27］. https://arxiv.org/abs/1312.6114.

[8] [8] GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks［J］. Communications of the ACM, 2020, 63(11):139-144.

[9] [9] MIRZA M, OSINDERO S. Conditional generative adversarial nets［J］. Computer Science, 2014:2672-268.

[10] [10] ZHU J Y, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks［C］//IEEE International Conference on Computer Vision(ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 2017:2223-2232.

[12] [12] BERMANO A H, GAL R, ALALUF Y, et al. State-of-the-art in the architecture, methods and applications of StyleGAN ［EB/OL］. (2022-02-28)［2024-03-27］. https://arxiv.org/abs/2202.14020.

[13] [13] HO J, JAIN A, ABBEEL P. Denoising diffusion probabilistic models［C］//Advances in Neural Information Processing Systems, December 6-12, 2020, Vancouver, BC, Canda. Red Hook: Curran Associates Inc. , 2020, 33:6840-6851.

[14] [14] SONG J, MENG C, ERMON S. Denoising diffusion implicit models［EB/OL］. (2020-10-06) ［2024-03-27］. https://arxiv.org/abs/2010.02502.

[15] [15] RADFORD A, KIM J W, HALLACY C, et al. Learning transferable visual models from natural language supervision［C］//International Conference on Machine Learning, July 18-24, 2021, Vienna, Austria. Brookline: JMLR, 2021, 139:8748-8763.

[16] [16] ROMBACH R, BLATTMANN A, LORENZ D, et al. High-resolution image synthesis with latent diffusion models［EB/OL］. (2021-12-20) ［2024-03-27］. https://arxiv.org/abs/2112.10752.

[17] [17] GAL R, ALAlALUF Y, ATZMON Y, et al. An image is worth one word: Personalizing text-to-image generation using textual inversion［EB/OL］. (2022-08-02)［2024-03-27］. https://arxiv.org/abs/2208.01618.

[18] [18] RUIZ N, LI Y, JAMPANI V, et al. Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation［C］ //IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-22, 2023, Vancouver, Canada. New York: IEEE 2023:22500-22510.

[19] [19] HU E J, SHEN Y, WALLIS P, et al. LoRA: Low-rank adaptation of large language models［EB/OL］. (2021-06-17) ［2024-03-27］. https://arxiv.org/abs/2106.09685.

Tools

Get Citation

Copy Citation Text

WANG Zixiang, JIA Xiaojun, RAN Erfei, XU Congyuan. Blue calico pattern generation based on improved stable diffusion model[J]. Journal of Optoelectronics · Laser, 2025, 36(7): 712

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Mar. 27, 2024

Accepted: Jun. 24, 2025

Published Online: Jun. 24, 2025

The Author Email: JIA Xiaojun (xjjiad@sina.com)

DOI:10.16136/j.joel.2025.07.0145

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology