Photonics Research, Volume. 12, Issue 12, 2747(2024)

Physics-aware cross-domain fusion aids learning-driven computer-generated holography

Ganzhangqin Yuan1、†, Mi Zhou1、†, Fei Liu2, Mu Ku Chen3, Kui Jiang4, Yifan Peng5,6, and Zihan Geng1、*
Author Affiliations
  • 1Institute of Data and Information, Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518071, China
  • 2School of Optoelectronic Engineering, Xidian University, Xi’an 710071, China
  • 3Department of Electrical Engineering, City University of Hong Kong, Hong Kong SAR, China
  • 4School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
  • 5Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong SAR, China
  • 6Department of Computer Science, The University of Hong Kong, Hong Kong SAR, China
  • show less
    Figures & Tables(14)
    (a) Workflow of conventional methods based on the vanilla UNet architecture. (b) Our proposed cross-domain fusion network highlights the role of our multi-stage conversion architecture and the infinity phase mapper (IPM) in accomplishing the cross-domain transformation task. The multi-stage conversion architecture employs multiple feature maps to facilitate a gradual transformation between two domains. Concurrently, the infinity phase mapper is designed to accommodate the periodic nature of phase values, ensuring the preservation of the physical consistency.
    Hologram generation workflow of our proposed method and the corresponding network architecture. The feature maps representing the various conversion stages are color-coded diversely. The first sub-network CDFN1 is used to predict the initial phase and the second sub-network CDFN2 is used to predict POHs.
    Schematic of infinity phase mapper (IPM) M, which maps infinite phase values ϕi to their corresponding points ϕo within the [−π,π] interval.
    Comparison of numerically reconstructed color images. From left to right: results of double phase-amplitude coding (DPAC), stochastic gradient descent method (SGD), HoloNet, CCNN, and our proposed CDFN, respectively (PSNR in dB).
    Comparison of simulated reconstruction images in our ablation study (green channel). From left to right: reconstruction results of HoloNet, multi-stage architecture (MS), multi-stage architecture with infinity phase mapper (MS w IPM), multi-stage architecture with deep supervision (MS w DS), multi-stage architecture with infinity phase mapper and deep supervision (MS w IPM&DS) (PSNR in dB).
    Normalized phase value distribution of a randomly picked POH. From left to right: results of HoloNet, CDFN without infinity phase mapper (CDFN w/o IPM), CDFN with infinity phase mapper (CDFN w IPM). Note that we shift the value in [−π,π] to [0,2π] in this figure for the easier observation of values around π. The arrows highlight that the IPM can generate a continuous phase value distribution around the π value, where the conventional active function generates a gap.
    Holographic display setup. (a) Schematic diagram of the optical display system setup. (b) Photograph of the optical display system setup.
    Optical reconstruction images in the green channel. From left to right: results of Gerchberg–Saxton algorithm (GS), double phase-amplitude coding (DPAC), HoloNet, CCNN, and CDFN, respectively.
    Optical reconstruction images from 1920×1072 pixels POHs on a 1920×1080 SLM in red, green, blue, and color channels. The images are directly captured by a camera. The color image is obtained by synchronizing the three-color laser source and sequentially loading different POHs. From left to right: results in red, green, blue, and color channels, respectively.
    Optical reconstruction images of our method in color channels directly captured by a camera and the corresponding phase-only holograms. (a) Cropped-zoomed patch of the first color image in (b) for visualization. (b) Captured color images. (c) The corresponding phase-only holograms of (b).
    Results of our method applied to binary images. (a) Simulated images. (b) Zoom-in patches. (c) Experimental images. Note that target binary images are not in our training set.
    • Table 1. Quantitative Results of Different Methods Tested on the DIV2K Testing Dataset, which Consists of 100 Images (Color Channels)a

      View table
      View in Article

      Table 1. Quantitative Results of Different Methods Tested on the DIV2K Testing Dataset, which Consists of 100 Images (Color Channels)a

      DPACSGDHoloNetCCNNCDFN (Ours)
      PSNR/SSIM19.97/0.68932.69/0.94229.87/0.92630.72/0.92731.68/0.944
    • Table 2. Efficiency Comparison for Three CGH Methods Tested on the DIV2K Testing Dataset, which Consists of 100 Images with a Resolution of 1920×1072 Pixelsa

      View table
      View in Article

      Table 2. Efficiency Comparison for Three CGH Methods Tested on the DIV2K Testing Dataset, which Consists of 100 Images with a Resolution of 1920×1072 Pixelsa

      SGDHoloNetCDFN (Ours)
      Time (s)16.8510.0100.012
      Parameter quantity2.87×1052.10×105
      FLOPs3.29×10113.39×1011
    • Table 3. Ablation Study with Average PSNR (dB) and SSIM Metrics on the DIV2K Testing Dataset, which Consists of 100 Images (Green Channel)

      View table
      View in Article

      Table 3. Ablation Study with Average PSNR (dB) and SSIM Metrics on the DIV2K Testing Dataset, which Consists of 100 Images (Green Channel)

      HoloNetMSMS w IPMMS w DSMS w IPM&DS
      PSNR/SSIM30.15/0.92630.88/0.93232.13/0.94831.65/0.94032.26/0.948
    Tools

    Get Citation

    Copy Citation Text

    Ganzhangqin Yuan, Mi Zhou, Fei Liu, Mu Ku Chen, Kui Jiang, Yifan Peng, Zihan Geng, "Physics-aware cross-domain fusion aids learning-driven computer-generated holography," Photonics Res. 12, 2747 (2024)

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Holography, Gratings, and Diffraction

    Received: Apr. 19, 2024

    Accepted: Aug. 17, 2024

    Published Online: Nov. 12, 2024

    The Author Email: Zihan Geng (geng.zihan@sz.tsinghua.edu.cn)

    DOI:10.1364/PRJ.527405

    CSTR:32188.14.PRJ.527405

    Topics