Technological Transformations in Optical Perception: From Encoding to Computing (Invited)

Caihua Zhang; Zheng Huang; Conghe Wang; Shukai Wu; Tuo Li; Kejian Zhu; Hongwei Chen

doi:10.3788/LOP251303

Laser & Optoelectronics Progress, Volume. 62, Issue 17, 1739013(2025)

Technological Transformations in Optical Perception: From Encoding to Computing (Invited)

Caihua Zhang^1,2, Zheng Huang^1,2, Conghe Wang^1,2, Shukai Wu^1,2, Tuo Li³, Kejian Zhu³, and Hongwei Chen^1,2、*

Author Affiliations

¹Department of Electronic Engineering, Tsinghua University, Beijing 100084, China

²Beijing National Research Center for Information Science and Technology, Beijing 100084, China

³Shandong Yunhai Guochuang Cloud Computing Equipment Industry Innovation Co., Ltd., Jinan 250013, Shandong , China

show less

Abstract Get PDF(in Chinese)

Figures & Tables(19)

Fig. 1. Technological transformations in optical perception

Download full size

Fig. 2. Basic architecture of optical perception

Download full size

Fig. 3. Workflow of optical encoding technology

Download full size

Fig. 4. Coded optical imaging for functional expansion. (a) Rainbow 3D camera^[15]; (b) scanning spectral imaging approaches^[18]; (c) snapshot colored compressive spectral imager^[19]; (d) single-dispersive-element coded aperture snapshot spectral imaging system^[20]; (e) programmable pixel compressive camera^[21]; (f) snapshot spatial-temporal compressive imaging system incorporating polarization information^[22]

Download full size

Fig. 5. Coded optical imaging for performance enhancement. (a) Single-pixel terahertz imaging system^[25]; (b) principle and microscope of structured illumination microscopy^[26-27]; (c) Fourier ptychographic imaging system^[28]; (d) action recognition pipeline based on optical pixel-wise encoding^[29]

Download full size

Fig. 6. Optical pre-sensing computing creates new architectures for intelligent perception

Download full size

Fig. 7. Mathematical model of a neuron^[34]

Download full size

$Linear computing in diffractive optical neural networks based on phase masks or SLMs. (a) All-optical diffractive deep neural networks[36]; (b) reconfigurable diffractive processing unit[37]$

Fig. 8. Linear computing in diffractive optical neural networks based on phase masks or SLMs. (a) All-optical diffractive deep neural networks^[36]; (b) reconfigurable diffractive processing unit^[37]

Download full size

$Linear computing in diffractive optical neural networks based on metasurfaces. (a) Multifunctional metasurface-based diffractive neural networks[38]; (b) programmable diffractive deep neural network based on a digital-coding metasurface array[39]; (c) metasurface diffractive optical neural network for simulating human-level decision-making and control[40]$

Fig. 9. Linear computing in diffractive optical neural networks based on metasurfaces. (a) Multifunctional metasurface-based diffractive neural networks^[38]; (b) programmable diffractive deep neural network based on a digital-coding metasurface array^[39]; (c) metasurface diffractive optical neural network for simulating human-level decision-making and control^[40]

Download full size

Fig. 10. Linear computing realized via 4f optical system. (a) Convolution computing based on phase modulation in 4f system^[41];(b) convolution computing based on amplitude modulation in 4f system^[42]; (c) optical pooling based on 4f system^[43]

Download full size

Fig. 11. Simplified machine vision based on incoherent light amplitude modulation. (a) Lensless opto-electronic neural network^[44]; (b) face recognition system with a mask-encoded microlens array^[46]

Download full size

Fig. 12. Multilayer pre-sensing computing based on nonlinear activation. (a) Multilayer fully connected computational architecture based on nonlinear activation^[51]; (b) multilayer convolutional computational architecture based on nonlinear activation^[52]

Download full size

Fig. 13. Training methods for hardware parameters in optical neural networks^[54]

Download full size

Fig. 14. Training architecture for optical neural networks based on forward-forward algorithm^[54]

Download full size

Fig. 15. Realization of optical nonlinearity and optical compression via multiple scattering^[59]

Download full size

Fig. 16. Intelligent recognition systems based on phase modulation in incoherent light scenarios. (a) Privacy-preserving facial depression recognition technology^[60]; (b) privacy-preserving scene description system^[61]

Download full size

Fig. 17. Metasurface folded lens system for ultrathin cameras^[62]

Download full size

Table 1. Comparison between sensing system and cognitive system
View table
Table 1. Comparison between sensing system and cognitive system
Item Sensing Cognition
Purpose Record the scene Understand the scene
Mean Optical system Computing system
Standard Fidelity Accuracy and intelligence

Table 2. Typical artificial neural networks in machine vision field

View table

Table 2. Typical artificial neural networks in machine vision field

Network type	Core structural feature	Typical application scenario
CNN	Local perception and weight sharing	Image classification， object detection， semantic segmentation
RNN	Temporal connection and memory units	Video action recognition， object tracking， dynamic image flow prediction
Transformer	Self-attention mechanism	Image classification， end-to-end object detection， long-sequence visual understanding
GAN	Generator-discriminator adversarial training	Image super-resolution， style transfer， defect simulation generation， data augmentation

Tools

Get Citation

Copy Citation Text

Caihua Zhang, Zheng Huang, Conghe Wang, Shukai Wu, Tuo Li, Kejian Zhu, Hongwei Chen. Technological Transformations in Optical Perception: From Encoding to Computing (Invited)[J]. Laser & Optoelectronics Progress, 2025, 62(17): 1739013

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites