Infrared and Laser Engineering, Volume. 54, Issue 7, 20250189(2025)
Deep learning empowers generation and presentation of virtual-real fusion scenarios in holographic metaverse: development and prospects (invited)
[2] HE Zehao, CAO Liangcai. Display, interactions and applications of immersive metaverse: Progress and outlooks[J]. Science & Technology Review, 41, 6-14(2023).
[3] [3] GOTSCH D, ZHANG X, MERRITT T, et al. TeleHuman2: A cylindrical light field teleconferencing system f lifesize 3D human telepresence[C] Proceedings of the 2018 CHI Conference on Human Facts in Computing Systems, 2018, 18: 552.
[4] LAWRENCE J, GOLDMAN D B, ACHAR S et al. Project Starline: A high-fidelity telepresence system[J]. ACM Transactions on Graphics, 40, 242(2021).
[11] [11] SUN J, XIE Y, CHEN L, et al. Neuralrecon: Realtime coherent 3D reconstruction from monocular video[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition. 2021: 1559815607.
[12] [12] WENG C Y, CURLESS B, SRINIVASAN P P, et al. Humannerf: Freeviewpoint rendering of moving people from monocular video[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2022: 1621016220.
[14] [14] WANG Y, LIANG Y, XU H, et al. SQLdepth: Generalizable selfsupervised finestructured monocular depth estimation[C]Proceedings of the AAAI Conference on Artificial Intelligence, 2024: 57135721.
[29] BLINDER D, BIRNBAUM T, ITO T et al. The state-of-the-art in computer generated holography for 3D display[J]. Light: Advanced Manufacturing, 3, 572-600(2022).
[36] [36] ZHANG H, SHEN C, LI Y, et al. Exploiting tempal consistency f realtime video depth estimation[C]Proceedings of the IEEECVF International Conference on Computer Vision, 2019: 17251734.
[41] [41] SCHÖNBERGER J L, FRAHM J. Structurefrommotion revisited[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2016: 41044113.
[44] [44] ZHANG N, NEX F, VOSSELMAN G, et al. Litemono: A lightweight CNN transfmer architecture f selfsupervised monocular depth estimation[C] Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2023: 1853718546.
[47] [47] XIAN Ke. Monocular depth prediction: algithms applications[D]. Wuhan: Huazhong University of Science Technology, 2021. (in Chinese)
[48] [48] LYU X, LIU L, WANG M, et al. HRdepth: high resolution selfsupervised monocular depth estimation[C]Proceedings of the AAAI Conference on Artificial Intelligence, 2021, 35(3): 22942301.
[49] [49] EIGEN D, PUHRSCH C, FERGUS R, et al. Depth map prediction from a single image using a multiscale deep wk[C]Proceedings of the 28th International Conference on Neural Infmation Processing Systems, 2014, 2: 23662374.
[50] [50] EIGEN D, FERGUS R. Predicting depth, surface nmals semantic labels with a common multiscale convolutional architecture[C] Proceedings of the IEEECVF International Conference on Computer Vision, 2015: 26502658.
[51] [51] LIU F Y, SHEN C H, LIN G S. Deep convolutional neural fields f depth estimation from a single image[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2015: 51625170.
[52] [52] LAINA I, RUPPRECHT C, BELAGIANNIS V, et al. Deeper depth prediction with fully convolutional residual wks[C]Proceedings of the International Conference on 3D Vision, 2016: 239248.
[53] [53] PATIL V, SAKARIDIS C, LINIGER A, et al. P3Depth: Monocular depth estimation with a piecewise planarity pri[C] Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2022: 16101621.
[55] [55] GARG R, KUMAR B G V, CARNEIRO G, et al. Unsupervised CNN f single view depth estimation: Geometry to the rescue[C] Proceedings of the European Conference on Computer Vision, 2016: 740756.
[56] [56] ZHOU T, BROWN M, SNAVELY N, et al. Unsupervised learning of depth egomotion from video[C]Proceedings of IEEECVF Conference on Computer Vision Pattern Recognition, 2017: 66126619.
[57] [57] GODARD C, AODHA O M, FIRMAN M, et al. Digging into selfsupervised monocular depth estimation[C]Proceedings of the IEEECVF International Conference on Computer Vision, 2019: 38273837.
[58] [58] BIAN J W, LI Z, WANG N, et al. Unsupervised scaleconsistent depth egomotion learning from monocular video[C]Proceedings of the International Conference on Neural Infmation Processing Systems, 2019, 4: 3545.
[61] [61] AI H, CAO Z, CAO Y P, et al. HRDFuse: Monocular 360. depth estimation by collabatively learning holisticwithregional depth distributions[C] Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2023: 1327313282.
[63] [63] MING Y, HONG K, WEI Q, et al. Twostage enhancement wk f monocular depth estimation of indo scenes[C]Proceedings of the IEEE International Conference on Signal Processing, 2024: 495498.
[64] [64] CHEN R, LUO H, ZHAO F, et al. Structurecentric robust monocular depth estimation via knowledge distillation[C]Proceedings of the ACM Asian Conference on Computer Vision, 2024: 123140.
[65] [65] WANG Y, LI X, SHI M, et al. Knowledge distillation f fast accurate monocular depth estimation on mobile device[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2021: 24572465.
[66] [66] ZHOU Z, DONG Q. Twoinone depth: Bridging the gap between monocular binocular selfsupervised depth estimation[C]Proceedings of the IEEECVF International Conference on Computer Vision, 2023: 93779387.
[67] [67] XU Y, YANG X, YU Y, et al. Depth estimation by combining binocular stereo monocular structuredlight[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2022: 17461755.
[78] CHAKRAVARTHULA P, PENG Y, KOLLIN J et al. Wirtinger holography for near-eye displays[J]. ACM Transactions on Graphics, 38, 213(2019).
[88] PENG Y, CHOI S, PADMANABAN N et al. Neural holography with camera-in-the-loop training[J]. ACM Transactions on Graphics, 39, 185(2020).
[92] YAN X, LIU X, LI J et al. Generating multi-depth 3D holograms using a fully convolutional neural network[J]. Advanced Sciences, 11, 2308886(2024).
[95] HE Z, SUI X, JIN G et al. Optimal quantization for amplitude and phase in computer-generated holography[J]. Optics Express, 29, 119-133(2020).
[101] [101] HE K, ZHANG X, REN S, et al. Deep residual learning f image recognition[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2016: 770778.
[102] [102] HE K, ZHANG X, REN S, et al. Identity mappings in deep residual wks[C]Proceedings of the European Conference on Computer Vision, 2016: 630645.
Get Citation
Copy Citation Text
Zehao HE, Yunhui GAO, Liangcai CAO, Yan ZHANG. Deep learning empowers generation and presentation of virtual-real fusion scenarios in holographic metaverse: development and prospects (invited)[J]. Infrared and Laser Engineering, 2025, 54(7): 20250189
Category: Special issue—Advanced display technology and applications
Received: Mar. 25, 2025
Accepted: --
Published Online: Aug. 29, 2025
The Author Email: Yan ZHANG (yzhang@cnu.edu.cn)