Journal of Optoelectronics · Laser, Volume. 36, Issue 7, 712(2025)

Blue calico pattern generation based on improved stable diffusion model

WANG Zixiang1,2, JIA Xiaojun1,2、*, RAN Erfei3, and XU Congyuan2
Author Affiliations
  • 1School of Information Science and Engineering, Zhejiang Sci-Tech University, Hangzhou, Zhejiang 310018, China
  • 2College of Information Science and Engineering, Jiaxing University, Jiaxing, Zhejiang 314001, China
  • 3School of Computer Science and Technology (School of Artificial Intelligence), Zhejiang Sci-Tech University, Hangzhou, Zhejiang 310018, China
  • show less
    References(12)

    [7] [7] KINGMA D P, WELLING M. Auto-encoding variational bayes [EB/OL]. (2013-12-20) [2024-03-27]. https://arxiv.org/abs/1312.6114.

    [8] [8] GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks[J]. Communications of the ACM, 2020, 63(11):139-144.

    [9] [9] MIRZA M, OSINDERO S. Conditional generative adversarial nets[J]. Computer Science, 2014:2672-268.

    [10] [10] ZHU J Y, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]//IEEE International Conference on Computer Vision(ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 2017:2223-2232.

    [12] [12] BERMANO A H, GAL R, ALALUF Y, et al. State-of-the-art in the architecture, methods and applications of StyleGAN [EB/OL]. (2022-02-28)[2024-03-27]. https://arxiv.org/abs/2202.14020.

    [13] [13] HO J, JAIN A, ABBEEL P. Denoising diffusion probabilistic models[C]//Advances in Neural Information Processing Systems, December 6-12, 2020, Vancouver, BC, Canda. Red Hook: Curran Associates Inc. , 2020, 33:6840-6851.

    [14] [14] SONG J, MENG C, ERMON S. Denoising diffusion implicit models[EB/OL]. (2020-10-06) [2024-03-27]. https://arxiv.org/abs/2010.02502.

    [15] [15] RADFORD A, KIM J W, HALLACY C, et al. Learning transferable visual models from natural language supervision[C]//International Conference on Machine Learning, July 18-24, 2021, Vienna, Austria. Brookline: JMLR, 2021, 139:8748-8763.

    [16] [16] ROMBACH R, BLATTMANN A, LORENZ D, et al. High-resolution image synthesis with latent diffusion models[EB/OL]. (2021-12-20) [2024-03-27]. https://arxiv.org/abs/2112.10752.

    [17] [17] GAL R, ALAlALUF Y, ATZMON Y, et al. An image is worth one word: Personalizing text-to-image generation using textual inversion[EB/OL]. (2022-08-02)[2024-03-27]. https://arxiv.org/abs/2208.01618.

    [18] [18] RUIZ N, LI Y, JAMPANI V, et al. Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation[C] //IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-22, 2023, Vancouver, Canada. New York: IEEE 2023:22500-22510.

    [19] [19] HU E J, SHEN Y, WALLIS P, et al. LoRA: Low-rank adaptation of large language models[EB/OL]. (2021-06-17) [2024-03-27]. https://arxiv.org/abs/2106.09685.

    Tools

    Get Citation

    Copy Citation Text

    WANG Zixiang, JIA Xiaojun, RAN Erfei, XU Congyuan. Blue calico pattern generation based on improved stable diffusion model[J]. Journal of Optoelectronics · Laser, 2025, 36(7): 712

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Mar. 27, 2024

    Accepted: Jun. 24, 2025

    Published Online: Jun. 24, 2025

    The Author Email: JIA Xiaojun (xjjiad@sina.com)

    DOI:10.16136/j.joel.2025.07.0145

    Topics