Journal of Optoelectronics · Laser, Volume. 36, Issue 7, 712(2025)
Blue calico pattern generation based on improved stable diffusion model
[7] [7] KINGMA D P, WELLING M. Auto-encoding variational bayes [EB/OL]. (2013-12-20) [2024-03-27]. https://arxiv.org/abs/1312.6114.
[8] [8] GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks[J]. Communications of the ACM, 2020, 63(11):139-144.
[9] [9] MIRZA M, OSINDERO S. Conditional generative adversarial nets[J]. Computer Science, 2014:2672-268.
[10] [10] ZHU J Y, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]//IEEE International Conference on Computer Vision(ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 2017:2223-2232.
[12] [12] BERMANO A H, GAL R, ALALUF Y, et al. State-of-the-art in the architecture, methods and applications of StyleGAN [EB/OL]. (2022-02-28)[2024-03-27]. https://arxiv.org/abs/2202.14020.
[13] [13] HO J, JAIN A, ABBEEL P. Denoising diffusion probabilistic models[C]//Advances in Neural Information Processing Systems, December 6-12, 2020, Vancouver, BC, Canda. Red Hook: Curran Associates Inc. , 2020, 33:6840-6851.
[14] [14] SONG J, MENG C, ERMON S. Denoising diffusion implicit models[EB/OL]. (2020-10-06) [2024-03-27]. https://arxiv.org/abs/2010.02502.
[15] [15] RADFORD A, KIM J W, HALLACY C, et al. Learning transferable visual models from natural language supervision[C]//International Conference on Machine Learning, July 18-24, 2021, Vienna, Austria. Brookline: JMLR, 2021, 139:8748-8763.
[16] [16] ROMBACH R, BLATTMANN A, LORENZ D, et al. High-resolution image synthesis with latent diffusion models[EB/OL]. (2021-12-20) [2024-03-27]. https://arxiv.org/abs/2112.10752.
[17] [17] GAL R, ALAlALUF Y, ATZMON Y, et al. An image is worth one word: Personalizing text-to-image generation using textual inversion[EB/OL]. (2022-08-02)[2024-03-27]. https://arxiv.org/abs/2208.01618.
[18] [18] RUIZ N, LI Y, JAMPANI V, et al. Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation[C] //IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-22, 2023, Vancouver, Canada. New York: IEEE 2023:22500-22510.
[19] [19] HU E J, SHEN Y, WALLIS P, et al. LoRA: Low-rank adaptation of large language models[EB/OL]. (2021-06-17) [2024-03-27]. https://arxiv.org/abs/2106.09685.
Get Citation
Copy Citation Text
WANG Zixiang, JIA Xiaojun, RAN Erfei, XU Congyuan. Blue calico pattern generation based on improved stable diffusion model[J]. Journal of Optoelectronics · Laser, 2025, 36(7): 712
Category:
Received: Mar. 27, 2024
Accepted: Jun. 24, 2025
Published Online: Jun. 24, 2025
The Author Email: JIA Xiaojun (xjjiad@sina.com)