Advanced all-optical classification using orbital-angular-momentum-encoded diffractive networks

Kuo Zhang; Kun Liao; Haohang Cheng; Shuai Feng; Xiaoyong Hu

doi:10.1117/1.APN.2.6.066006

Advanced Photonics Nexus, Volume. 2, Issue 6, 066006(2023)

Advanced all-optical classification using orbital-angular-momentum-encoded diffractive networks Editors' Pick

Kuo Zhang^1、†, Kun Liao², Haohang Cheng¹, Shuai Feng^1、*, and Xiaoyong Hu^2,3,4、*

Author Affiliations

¹Minzu University of China, School of Science, Beijing, China

²Peking University, Collaborative Innovation Center of Quantum Matter, Nano-Optoelectronics Frontier Center of Ministry of Education, State Key Laboratory for Mesoscopic Physics, Department of Physics, Beijing, China

³Shanxi University, Collaborative Innovation Center of Extreme Optics, Taiyuan, China

⁴Peking University Yangtze Delta Institute of Optoelectronics, Nantong, China

show less

Figures & Tables(9)

Fig. 1. Schematic diagrams of the three types of the OAM-encoded $D^{2} NN$ . The OAM beams illuminating the digits are multiplexed by 10 OAM modes ranging from $- 5$ to $+ 5$ in equal proportions. The red numbers represent the topological charges of the OAM modes, while the black numbers in brackets correspond to the assumed digits associated with the OAM modes. The digit inputs are illuminated by the multiplexed OAM beams, and the predicted OAM beams are obtained in the output plane after modulation by the OAM-encoded $D^{2} NNs$ . The right side of the output plane shows the OAM spectra of the OAM beams. Three different configurations of OAM-encoded $D^{2} NNs$ have been described below: (a) single detector OAM-encoded $D^{2} NN$ for single-task classification, (b) single detector OAM-encoded $D^{2} NN$ for multitask classification, and (c) multidetector OAM-encoded $D^{2} NN$ for multitask classification.

Download full size

View in Article

$(a) The amplitude and phase distributions of the OAM beams are shown for the input plane, the diffractive layers, and the output plane. The input image is a handwritten digit “1” encoded as an OAM beam with +2 mode. (b) Schematic of the modulation of the light field by the single-detector OAM-encoded D2NN. (c) The OAM spectrum of the output OAM beams. The red plot corresponding to the OAM mode with the highest normalized intensity indicates the inferred category of the input digit. (d) The loss and accuracy functions for both the training and test sets. Three simulations were conducted for each set, and the corresponding results are represented by the three dashed lines. The solid lines represent the average results of the three function curves depicted by the dashed lines. (e) A confusion matrix summarizes the numerical classification results in the test set. The matrix provides a comprehensive overview of the performance of the single-detector OAM-encoded D2NN in recognizing the handwritten digits from the MNIST data set.$

Fig. 2. (a) The amplitude and phase distributions of the OAM beams are shown for the input plane, the diffractive layers, and the output plane. The input image is a handwritten digit “1” encoded as an OAM beam with +2 mode. (b) Schematic of the modulation of the light field by the single-detector OAM-encoded $D^{2} NN$ . (c) The OAM spectrum of the output OAM beams. The red plot corresponding to the OAM mode with the highest normalized intensity indicates the inferred category of the input digit. (d) The loss and accuracy functions for both the training and test sets. Three simulations were conducted for each set, and the corresponding results are represented by the three dashed lines. The solid lines represent the average results of the three function curves depicted by the dashed lines. (e) A confusion matrix summarizes the numerical classification results in the test set. The matrix provides a comprehensive overview of the performance of the single-detector OAM-encoded $D^{2} NN$ in recognizing the handwritten digits from the MNIST data set.

Download full size

View in Article

$(a) The amplitude and phase distribution of the OAM beams in the input plane, diffractive layers, and output plane. The input handwritten digits are “7” and “0,” which correspond to the multiplexed OAM beams that produce “-3” and “+1” OAM modes. (b) Schematic of the light field modulation by single-detector OAM-encoded D2NN for multitask classification. The OAM beam encodes two handwritten digits as the input. After undergoing OAM-encoded D2NN modulation, it produces a new OAM beam corresponding to two modes at the same spatial location. (c) The OAM spectrum of the output OAM beams. The two OAM modes detected by the detector with the highest normalized intensity represent the assumed categories of the input digits, and their classes are indicated by the red bars. (d) Loss function and accuracy during training and testing. Solid lines indicate the average result of the three-function curve represented by the dashed line. (e) The confusion matrix summarizes the numerical classification result in the test set.$

Fig. 3. (a) The amplitude and phase distribution of the OAM beams in the input plane, diffractive layers, and output plane. The input handwritten digits are “7” and “0,” which correspond to the multiplexed OAM beams that produce “-3” and “+1” OAM modes. (b) Schematic of the light field modulation by single-detector OAM-encoded $D^{2} NN$ for multitask classification. The OAM beam encodes two handwritten digits as the input. After undergoing OAM-encoded $D^{2} NN$ modulation, it produces a new OAM beam corresponding to two modes at the same spatial location. (c) The OAM spectrum of the output OAM beams. The two OAM modes detected by the detector with the highest normalized intensity represent the assumed categories of the input digits, and their classes are indicated by the red bars. (d) Loss function and accuracy during training and testing. Solid lines indicate the average result of the three-function curve represented by the dashed line. (e) The confusion matrix summarizes the numerical classification result in the test set.

Download full size

View in Article

$(a) From top to bottom, the multidetector OAM-encoded D2NN provides recognition for two digits, three digits, and four-digits, respectively. The amplitude and phase distribution of the OAM beams in the input plane, diffractive layers, and output plane. (b) Schematic of the light field modulation by four-detector OAM-encoded D2NN for multitask classification. Each input OAM beam at different positions encodes only one digit and generates the corresponding OAM mode of that digit at the output, which is detected by a detector at a fixed position. (c) The OAM spectrum of the output OAM beams. The two blue OAM spectra correspond to the OAM beams generated by the two-detector OAM-encoded D2NN, from top to bottom, respectively. The green OAM spectrum in the first row corresponds to the separate OAM beam in the first row of the three-detector OAM-encoded D2NN, and the green OAM spectra in the second and third rows correspond to the two OAM beams from left to right in the second row, respectively. The four red OAM spectra are arranged in a sequential relationship from left to right and from top to bottom.$

Fig. 4. (a) From top to bottom, the multidetector OAM-encoded $D^{2} NN$ provides recognition for two digits, three digits, and four-digits, respectively. The amplitude and phase distribution of the OAM beams in the input plane, diffractive layers, and output plane. (b) Schematic of the light field modulation by four-detector OAM-encoded $D^{2} NN$ for multitask classification. Each input OAM beam at different positions encodes only one digit and generates the corresponding OAM mode of that digit at the output, which is detected by a detector at a fixed position. (c) The OAM spectrum of the output OAM beams. The two blue OAM spectra correspond to the OAM beams generated by the two-detector OAM-encoded $D^{2} NN$ , from top to bottom, respectively. The green OAM spectrum in the first row corresponds to the separate OAM beam in the first row of the three-detector OAM-encoded $D^{2} NN$ , and the green OAM spectra in the second and third rows correspond to the two OAM beams from left to right in the second row, respectively. The four red OAM spectra are arranged in a sequential relationship from left to right and from top to bottom.

Download full size

View in Article

Fig. 5. (a) The loss function and accuracy function of the two-detector, three-detector, and four-detector OAM-encoded $D^{2} NNs$ in training and testing are arranged from left to right. The solid line represents the average result of the function curves for the three simulations, which is represented by the dashed line. Their average accuracy in the test set is 70.94%, 52.41%, and 40.13%, respectively. (b) Confusion matrices of the three multidetector OAM-encoded $D^{2} NNs$ , summarizing the numerical classification results of the test set. Due to the large number of pixel points in the confusion matrices of the three-detector and four-detector OAM-encoded $D^{2} NNs$ , the confusion matrices are reduced and localized zoomed-in images are inserted.

Download full size

View in Article

$The different colored curves represent different diffractive networks, as illustrated in the square diagram located in the lower left corner. (a) The deviation of the pixel size and the layer spacing. The horizontal coordinate represents the error range from 0.8 times the pixel size and the corresponding layer spacing to 1.2 times the pixel size and the corresponding layer spacing. (b) The analysis of the deviation of the object misalignment in horizontal and vertical directions. (c) The analysis of the deviation of the misalignment layer. The left image represents a random misalignment error of 5% for each layer, while the right image represents a random misalignment error of 10% for each layer.$

Fig. 6. The different colored curves represent different diffractive networks, as illustrated in the square diagram located in the lower left corner. (a) The deviation of the pixel size and the layer spacing. The horizontal coordinate represents the error range from 0.8 times the pixel size and the corresponding layer spacing to 1.2 times the pixel size and the corresponding layer spacing. (b) The analysis of the deviation of the object misalignment in horizontal and vertical directions. (c) The analysis of the deviation of the misalignment layer. The left image represents a random misalignment error of 5% for each layer, while the right image represents a random misalignment error of 10% for each layer.

Download full size

View in Article

Fig. 7. (a) The left figure shows the geometrical model of the five layer $D^{2} NN$ with the pixel size of $50 \times 50$ , and the right figure shows the mask model of the number “9” illuminated by the OAM beam. (b) The simulation of the incident OAM beam. (c) The simulation of the output plane by a one-layer $D^{2} NN$ with the pixel size of $30 \times 30$ . (b), (c) The figures from left to right are amplitude distribution simulated with Python, amplitude distribution simulated with COMSOL Multiphysics software, phase distribution simulated with Python, and phase distribution simulated with COMSOL Multiphysics software.

Download full size

View in Article

Table 1. Comparison with other D2NN using more than three degrees of freedom.

View table

View in Article

Table 1. Comparison with other D2NN using more than three degrees of freedom.


Reference	Degree of freedom	Footprint	Function	Performance	Parallel classification	Single detector
This work	OAM	$164.3 μ m \times 164.3 μ m$	Image recognition	Accuracy: 85.49%	Yes	Yes
11	—	$8 cm \times 8 cm$	Image recognition	Accuracy: 93.39%	No	No
15	Wavelength	$8 cm \times 8 cm$	Image recognition	Accuracy: 91.29% (84.02%)^a	No	Yes
38	Wavelength	$6 cm \times 6 cm$	Image recognition	Accuracy: 87.74%	No	Yes
39	Wavelength	$0.8 mm \times 0.8 mm$	Image recognition	Accuracies of four tasks are 92.8%, 83.0%, 81.0%, and 90.4%, respectively	Yes	No
48	Wavelength	$88.2 μ m \times 88.2 μ m$	Multispectral imaging	Filter transmission efficiency: $> 79 %$	—	—
49	Wavelength	$5 cm \times 5 cm$	Spectral filters	Process optical waves over a continuous, wide range of frequencies	—	—
16	Polarization	$11.2 μ m \times 11.2 μ m$	Image recognition	Accuracy: 93.75%	Yes	No
50	Polarization	$24 λ \times 24 λ$	Linear transformations	Perform multiple complex-valued, arbitrary linear transformations using polarization multiplexing	—	—
42	OAM	3 cm × 3 cm	Logic operation	Proposed an OAM logical operation	—	—
61	OAM	3 cm × 3 cm	Optical communication	The diffraction efficiency and mode conversion purity: $> 96 %$ .	—	—
The bit error rates: $< 10^{- 4}$
64	OAM	$2.5 μ m \times 2.5 μ m$	Holography	10 multiplexed OAM modes among five spatial depths in deep multiplexing holography	—	—


66	OAM	$100 λ \times 100 λ$	Spectral detection	Optical operations/electronic operations: $\sim 10^{3}$	—	—

Table 2. Various indices for single-detector OAM-encoded D2NN for single-task classification (S-OAM-encoded D2NN-S), single-detector OAM-encoded D2NN for multi-task classification (S-OAM-encoded D2NN-M), multidetector OAM-encoded D2NN for repeatable multitask classification (M-OAM-encoded D2NN-M).

View table

View in Article

Table 2. Various indices for single-detector OAM-encoded D2NN for single-task classification (S-OAM-encoded D2NN-S), single-detector OAM-encoded D2NN for multi-task classification (S-OAM-encoded D2NN-M), multidetector OAM-encoded D2NN for repeatable multitask classification (M-OAM-encoded D2NN-M).


	Training time (h)	Training loss	Training accuracy (%)	Test loss	Test accuracy (%)
S-OAM-encoded $D^{2} NN$ -S	12.74	0.402	84.30	0.343	85.43
S-OAM-encoded $D^{2} NN$ -M	5.69	0.708	57.42	0.667	64.13
M-OAM-encoded $D^{2} NN$ -M(2)	6.04	0.820	67.69	0.772	70.94
M-OAM-encoded $D^{2} NN$ -M(3)	4.09	1.345	48.94	1.238	52.41
M-OAM-encoded $D^{2} NN$ -M(4)	3.19	1.970	36.25	1.932	40.13

Tools

Get Citation

Copy Citation Text

Kuo Zhang, Kun Liao, Haohang Cheng, Shuai Feng, Xiaoyong Hu, "Advanced all-optical classification using orbital-angular-momentum-encoded diffractive networks," Adv. Photon. Nexus 2, 066006 (2023)

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Research Articles

Received: Jun. 18, 2023

Accepted: Nov. 6, 2023

Published Online: Nov. 27, 2023

The Author Email: Shuai Feng (fengshuai75@163.com), Xiaoyong Hu (xiaoyonghu@pku.edu.cn)

DOI:10.1117/1.APN.2.6.066006

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology