Laser & Optoelectronics Progress, Volume. 62, Issue 17, 1739013(2025)

Technological Transformations in Optical Perception: From Encoding to Computing (Invited)

Caihua Zhang1,2, Zheng Huang1,2, Conghe Wang1,2, Shukai Wu1,2, Tuo Li3, Kejian Zhu3, and Hongwei Chen1,2、*
Author Affiliations
  • 1Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
  • 2Beijing National Research Center for Information Science and Technology, Beijing 100084, China
  • 3Shandong Yunhai Guochuang Cloud Computing Equipment Industry Innovation Co., Ltd., Jinan 250013, Shandong , China
  • show less
    Figures & Tables(19)
    Technological transformations in optical perception
    Basic architecture of optical perception
    Workflow of optical encoding technology
    Coded optical imaging for functional expansion. (a) Rainbow 3D camera[15]; (b) scanning spectral imaging approaches[18]; (c) snapshot colored compressive spectral imager[19]; (d) single-dispersive-element coded aperture snapshot spectral imaging system[20]; (e) programmable pixel compressive camera[21]; (f) snapshot spatial-temporal compressive imaging system incorporating polarization information[22]
    Coded optical imaging for performance enhancement. (a) Single-pixel terahertz imaging system[25]; (b) principle and microscope of structured illumination microscopy[26-27]; (c) Fourier ptychographic imaging system[28]; (d) action recognition pipeline based on optical pixel-wise encoding[29]
    Optical pre-sensing computing creates new architectures for intelligent perception
    Mathematical model of a neuron[34]
    Linear computing in diffractive optical neural networks based on phase masks or SLMs. (a) All-optical diffractive deep neural networks[36]; (b) reconfigurable diffractive processing unit[37]
    Linear computing in diffractive optical neural networks based on metasurfaces. (a) Multifunctional metasurface-based diffractive neural networks[38]; (b) programmable diffractive deep neural network based on a digital-coding metasurface array[39]; (c) metasurface diffractive optical neural network for simulating human-level decision-making and control[40]
    Linear computing realized via 4f optical system. (a) Convolution computing based on phase modulation in 4f system[41];(b) convolution computing based on amplitude modulation in 4f system[42]; (c) optical pooling based on 4f system[43]
    Simplified machine vision based on incoherent light amplitude modulation. (a) Lensless opto-electronic neural network[44]; (b) face recognition system with a mask-encoded microlens array[46]
    Multilayer pre-sensing computing based on nonlinear activation. (a) Multilayer fully connected computational architecture based on nonlinear activation[51]; (b) multilayer convolutional computational architecture based on nonlinear activation[52]
    Training methods for hardware parameters in optical neural networks[54]
    Training architecture for optical neural networks based on forward-forward algorithm[54]
    Realization of optical nonlinearity and optical compression via multiple scattering[59]
    Intelligent recognition systems based on phase modulation in incoherent light scenarios. (a) Privacy-preserving facial depression recognition technology[60]; (b) privacy-preserving scene description system[61]
    Metasurface folded lens system for ultrathin cameras[62]
    • Table 1. Comparison between sensing system and cognitive system

      View table

      Table 1. Comparison between sensing system and cognitive system

      ItemSensingCognition
      PurposeRecord the sceneUnderstand the scene
      MeanOptical systemComputing system
      StandardFidelityAccuracy and intelligence
    • Table 2. Typical artificial neural networks in machine vision field

      View table

      Table 2. Typical artificial neural networks in machine vision field

      Network typeCore structural featureTypical application scenario
      CNNLocal perception and weight sharingImage classification, object detection, semantic segmentation
      RNNTemporal connection and memory unitsVideo action recognition, object tracking, dynamic image flow prediction
      TransformerSelf-attention mechanismImage classification, end-to-end object detection, long-sequence visual understanding
      GANGenerator-discriminator adversarial trainingImage super-resolution, style transfer, defect simulation generation, data augmentation
    Tools

    Get Citation

    Copy Citation Text

    Caihua Zhang, Zheng Huang, Conghe Wang, Shukai Wu, Tuo Li, Kejian Zhu, Hongwei Chen. Technological Transformations in Optical Perception: From Encoding to Computing (Invited)[J]. Laser & Optoelectronics Progress, 2025, 62(17): 1739013

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: AI for Optics

    Received: May. 22, 2025

    Accepted: Jul. 2, 2025

    Published Online: Sep. 16, 2025

    The Author Email: Hongwei Chen (chenhw@tsinghua.edu.cn)

    DOI:10.3788/LOP251303

    CSTR:32186.14.LOP251303

    Topics