Advanced Photonics, Volume. 7, Issue 5, 056005(2025)

Real-time all-directional 3D recognition and multidistortion correction via prior diffraction neural networks

Min Huang1,2, Bin Zheng1,3,4, Ruichen Li1, Yijun Zou1, Xiaofeng Li5, Chao Qian6, Huan Lu1,4, Rongrong Zhu1,7、*, and Hongsheng Chen3,4,6、*
Author Affiliations
  • 1Zhejiang University, Interdisciplinary Center for Quantum Information, State Key Laboratory of Modern Optical Instrumentation, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
  • 2National University of Defense Technology, College of Electronic Science and Technology, Changsha, China
  • 3Zhejiang University, International Joint Innovation Center, Key Laboratory of Advanced Micro/Nano Electronic Devices & Smart Systems of Zhejiang, The Electromagnetics Academy at Zhejiang University, Haining, China
  • 4Zhejiang University, Jinhua Institute of Zhejiang University, Jinhua, China
  • 5Air Force Engineering University, Air and Missile Defense College, Xi’an, China
  • 6Zhejiang University, ZJU-UIUC Institute, Interdisciplinary Center for Quantum Information, State Key Laboratory of Extreme Photonics and Instrumentation, Hangzhou, China
  • 7Hangzhou City University, School of Information and Electrical Engineering, Zhejiang, China
  • show less
    Figures & Tables(5)
    Application of a real-time all-directional 3D recognition multiple distortion correction system in automatic driving. The metasurface array supported by the deep knowledge prior diffraction neural network can recognize obstacles or people with different poses in a large field of view.
    Design of prior diffraction neural network. (a) Working process of the correction–recognition diffraction neural network based on deep knowledge prior. (b) Detailed network model of prior diffraction neural network.
    Dynamic and static three-dimensional recognition in the actual scene. (a) The camera lens is not parallel to the picture, resulting in perspective distortion. Moreover, the object has a different degree of plane distortion due to the inappropriate shooting angle. (b) Specific plane distortions caused by subjective and objective causes. Under nonideal shooting conditions, the picture has single or double plane random variation. (c) Different attitudes of the aircraft from the same perspective. The superimposed random distortion of the plane and three-dimensional space caused by the shooting angle. (d) Aircraft at different attitudes observed from different viewing angles. (e) To simulate the identification of aircraft by observers in the actual scene, the aircraft with different attitudes are recognized by the diffraction system in the experiment. (f) The training results of the deep phase generation network show that the phase loss of each metasurface is close to 0. (g) The loss of the training and test sets versus epoch. After thousands of epochs, the correction–recognition integrated system achieves an accuracy rate of 97.7% and 94.6% for the training and test sets, respectively. (h) The training results for the conventional diffraction neural network over the epoch. It is in an underfitting state, and the accuracy is only 55.2% for the test set.
    Experimental results of multiple distortion correction and static aircraft real-time recognition. (a) Experimental setup for planar and three-dimensional distortion correction and three-dimensional recognition. The aircraft is suspended on a graduated rack to simulate different flight attitudes. (b), (c) Experimental results of image restoration with double random distortion, including stereoscopic distortion. (d) The experimental recognition results are identified according to the diffraction distribution under different attitudes. The orientation of the aircraft is demonstrated and labeled with (θ,φ) in the left coordinate system.
    Experimental demonstration of dynamic three-dimensional recognition. (a) Measured electric field distribution with dynamic left-right steering (φ change) (Video 1, MP4, 1.92 MB [URL: https://doi.org/10.1117/1.AP.7.5.056005.s1]). (b) Measured electric field distribution with dynamic forward and backward tilt (θ change) (Video 2, MP4, 1.70 MB [URL: https://doi.org/10.1117/1.AP.7.5.056005.s2]). (c) Measured electric field distribution with dynamic movement in all directions (Video 3, MP4, 1.84 MB [URL: https://doi.org/10.1117/1.AP.7.5.056005.s3]).
    Tools

    Get Citation

    Copy Citation Text

    Min Huang, Bin Zheng, Ruichen Li, Yijun Zou, Xiaofeng Li, Chao Qian, Huan Lu, Rongrong Zhu, Hongsheng Chen, "Real-time all-directional 3D recognition and multidistortion correction via prior diffraction neural networks," Adv. Photon. 7, 056005 (2025)

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Research Articles

    Received: Jan. 22, 2025

    Accepted: Jul. 15, 2025

    Published Online: Aug. 21, 2025

    The Author Email: Rongrong Zhu (rorozhu@hzcu.edu.cn), Hongsheng Chen (hansomchen@zju.edu.cn)

    DOI:10.1117/1.AP.7.5.056005

    CSTR:32187.14.1.AP.7.5.056005

    Topics