Acta Optica Sinica, Volume. 43, Issue 15, 1510002(2023)
From Perception to Creation: Exploring Frontier of Image and Video Generation Methods
Fig. 17. VIDM framework using two diffusion model to generate video content and action information respectively[111]
Fig. 18. PVDM framework that represents the video as three two-dimensional hidden variables, and thus uses the two-dimensional diffusion model for training[112]
Fig. 20. VideoFusion framework that uses pre-trained text-to-image diffusion model to generate base frame and uses video data to train a residual noise generator[114]
|
|
Get Citation
Copy Citation Text
Liang Lin, Binbin Yang. From Perception to Creation: Exploring Frontier of Image and Video Generation Methods[J]. Acta Optica Sinica, 2023, 43(15): 1510002
Category: Image Processing
Received: Mar. 30, 2023
Accepted: Jul. 22, 2023
Published Online: Aug. 15, 2023
The Author Email: Binbin Yang (yangbb3@mail2.sysu.edu.cn)