Acta Optica Sinica, Volume. 45, Issue 3, 0320001(2025)
Design and Analysis of Optical Extreme Learning Machine Based on Free Space Propagation
In the past few decades, artificial intelligence (AI) algorithms have been applied in various fields. Among them, neural network algorithms have become the common paradigm of modern AI and have achieved remarkable achievements in image recognition, natural language processing, speech recognition, and recommendation systems. However, the training process of these digital neural networks demands a large amount of time and energy. Therefore, optical computing solutions, with their multidimensional, high-speed, and low-energy advantages, have become a popular research area in AI applications. Extreme learning machine (ELM) is a machine learning paradigm where most connections in the model are established through randomly initialized nonlinear hidden nodes, and only a small part of the weights are adjusted during training after down-sampling. The advantage of this model is that it significantly reduces the training time as it replaces the time-consuming backpropagation with simple ridge regression. Nevertheless, in digital ELMs, the model performance heavily depends on the number of nodes in the hidden layer, which may lead to large memory consumption. To address this problem, we propose an optical extreme learning machine (optical ELM) by implementing random projections in the free-space propagation and investigating the effect of defocus on optical ELM. The optical aberrations, errors, and defects existing in the experimental process act as random components in the optical ELM, corresponding to the random transmission matrix in the digital ELM. This approach enables the passive realization of a large-scale hidden layer. We aim to design an easily deployable optical ELM that does not require complex processing of input and output data but achieves random projection through passive propagation in the optical domain. This method intends to simplify the system architecture while taking advantage of optical technology to achieve efficient parallel computing.
Fig. 1a shows the architecture of the digital ELM, defining the random projection process in the digital ELM as a randomly generated transmission matrix
Figs. 3 and 4 show the simulation results for the digital ELM. First, under random projection, increasing both the input size and the number of hidden nodes remarkably improves ELM performance. Second, for high-resolution images, down-sampling in the optical domain is a more effective way to reduce computational burden. Figs. 5 and 6 illustrate the experimental results of the optical ELM. It shows that the utilization of the inherent random aberrations and errors of optical experiments and the nonlinear response of the camera in the framework is effective and can provide the necessary random mapping for optical ELM. Increasing the number of hidden nodes is related to the improvement of model performance. However, the propagation distance (PD) has a minimal impact on the model’s performance.
We present a framework for optical ELM and provide a detailed analysis and experimentation. The experimental results show that the optical ELM can achieve a certain classification accuracy under specific parameter settings. By using the inherent random aberrations and defects during free-space propagation and the nonlinear response of the camera, this approach replaces the time-consuming and energy-intensive random mapping process used in digital ELM, thus enhancing hardware efficiency. This study validates the effectiveness of optical transmission in passively processing large-scale image data. Compared with other complex systems, this design only requires a simple deployment of optical pathways while achieving the initial research goal: to design an easily deployable optical ELM that does not require complex processing of input data. However, there is still room for improvement in this experiment. Thus, Fig. 7 shows a potential improvement scheme by introducing a nonlinear scattering medium during free-space propagation, which can provide the model with more randomness and nonlinear effects, thereby enhancing model performance.
Get Citation
Copy Citation Text
Zhihong Xu, Schoenhardt Steffen, Xi Chen, Min Gu, Goi Elena. Design and Analysis of Optical Extreme Learning Machine Based on Free Space Propagation[J]. Acta Optica Sinica, 2025, 45(3): 0320001
Category: Optics in Computing
Received: Oct. 21, 2024
Accepted: Nov. 14, 2024
Published Online: Feb. 20, 2025
The Author Email: Goi Elena (elenagoi@usst.edu.cn)
CSTR:32393.14.AOS241671