Emerging innovations like tactile Internet [
Journal of the European Optical Society-Rapid Publications, Volume. 19, Issue 1, 2023020(2023)
Intelligent self calibration tool for adaptive few-mode fiber multiplexers using multiplane light conversion
Space division multiplexing (SDM) is promising to enhance capacity limits of optical networks. Among implementation options, few-mode fibres (FMFs) offer high efficiency gains in terms of integratability and throughput per volume. However, to achieve low insertion loss and low crosstalk, the beam launching should match the fiber modes precisely. We propose an all-optical data-driven technique based on multiplane light conversion (MPLC) and neural networks (NNs). By using a phase-only spatial light modulator (SLM), spatially separated input beams are transformed independently to coaxial output modes. Compared to conventional offline calculation of SLM phase masks, we employ an intelligent two-stage approach that considers knowledge of the experimental environment significantly reducing misalignment. First, a single-layer NN called Model-NN learns the beam propagation through the setup and provides a digital twin of the apparatus. Second, another single-layer NN called Actor-NN controls the model. As a result, SLM phase masks are predicted and employed in the experiment to shape an input beam to a target output. We show results on a single-passage configuration with intensity-only shaping. We achieve a correlation between experiment and network prediction of 0.65. Using programmable optical elements, our method allows the implementation of aberration correction and distortion compensation techniques, which enables secure high-capacity long-reach FMF-based communication systems by adaptive mode multiplexing devices.
1 Introduction
Emerging innovations like tactile Internet [
When launching data to FMF, those spatial modes that overlap with the incident input beam are excited [
In contrast, mode multiplexers based on MPLC [
In recent papers on MPLC [
In this work, we present a method to advance the mode shaping procedure by a smart data-driven online-calibration, where the entire MPLC apparatus is considered as black box. By training an NN, we create a digital twin of the setup called Model-NN. With the Model-NN, we gain knowledge about the experimental environment. Afterwards, we train another NN called Actor-NN that controls the model. This idea is inspired by the work published in Ref. [
2 Smart calibration for multiplane light conversion
A general scheme of an MPLC device is shown in
Figure 1.Optical MPLC setup considered in our work. Two spatially seperated Gaussian input spots are incident to the SLM. After two reflection passages between SLM and mirror (M), the output beams are imaged by a 4f-telescope onto a camera where intensity images are recorded. Both the FMFs facet and the camera are at a distance d ≈ 1.5 cm from the SLM. A smart calibration based on artificial intelligence (AI) is implemented to generate proper SLM phase masks in order to shape desired output modes.
Although the WMA provides a numerically correct solution to calculate the SLM phase masks, it is prone to errors that can barely be considered during offline calculation [
Figure 2.Structure of the envisioned NN architecture. (a) Training of a single-layer Model-NN. Training data consists of 2k uniformly distributed random SLM phase masks and the corresponding intensity images measured at the MPLC output. This generates a digital twin of the MPLC setup. (b) Another single-layer called Actor-NN is trained on 2k intensity images according to the EMNIST data set. The Actor-NN is used to predict phase masks, which are the input for the Model-NN. When training the Actor-NN, the Model-NN is fixed. (c) The predicted phase mask of the trained Actor-NN is applied to the SLM and the intensity is measured. We achieve a correlation of Γ = 0.65 compared to the Model-NNs predicted intensity.
In a first step, the MPLC is set up with two passages transfering two input beams. To gain knowledge about the black box, a single-layer NN is trained. SLM phase masks ϕSLM are used as NN input and the output is the corresponding intensity image I with
We use a single-layer perceptron as NN architecture for both, Actor- and Model-NN. The input pixels representing SLM phase masks are thereby connected straightforwardly to the pixels of the output intensity image and vice-versa, resulting in 2.304.000 trainable parameters. The NN and its internal structure is shown in
Figure 3.Structure of the NN, images are consisting of 8bit images with values from 0 to 1. The resolution is 32 × 32 and 150 × 150 for phase masks and intensity images, respectively.
With the single-layer perceptron layout, training converges within several minutes on consumer hardware for both NNs.
After training the NN, a digital twin of the MPLC setup is created called Model-NN. Consequently, the Model-NN includes all misalignments and tolerances of the setup. In a second step, the Model-NN is kept static and another single-layer NN called Actor-NN is trained controlling the model. The ultimate predictions provide phase masks that are required for the desired light shaping task. This intelligent approach allows us to consider experimental tolerances already in the calibration procedure, which advances the MPLC setup.
2.1 Model-NN mimicking the MPLC setup
The first step of the Actor-Model approach comprises training the Model-NN to create the digital twin. For this task, representative training data is necessary. For our experiments, we consider the setup shown in
Figure 4.Training progress of both Model-NN and Actor-NN. Experimental data (i.e. SLM phase masks and intensity camera images) is used for the Model-NN, while EMNIST data base is used for the Actor-NN. We used 2k data each, where 1600 samples are used for training and 400 for test, respectively. Both NNs comprise of a single-layer structure with sigmoid activation function. MSE is used as loss function with adam optimizer, wheres fidelity is used as performance indicator. Convergence is observed in all scenarios that are training (solid) and test data (dashed) for Model-NN (red) and Actor-NN (blue), respectively.
When considering a second beam at the input, another independent network part has to be created for instance by training another Model-NN. For multi-beam configurations, it is crucial that all output beams have a common overlap region which defines the area where output modes are generated. This constraint can introduce special requirements to the SLM phase masks, especially for many input beams, which will be discussed in Section 3. In
Figure 5.Example of dual-input beam configuration. Simulation data is shown. (a) and (b) show heatmaps of 1k images using 1k different random SLM phase masks. (c) overlap between (a) and (b).
2.2 Actor-NN controlling the model
In the previous step, the Model-NN is trained mimicking the experimental setup. Now we will use another NN, i.e. Actor-NN to control the model. In this step, the trained Model-NN is frozen, as shown in
Figure 6.Results from the Actor-Model approach. All samples shown are images from the EMNIST data set. Left column: Ground truth images used as Actor-NN input. Middle left: phase mask predictions resulting from the Actor-NN. Middle right: predictions from the Model-NN after the Actor-NN was trained. The correlation between Model-NN prediction and ground truth is Γ ≈ 0.7. Right: experimental results. Using the trained Actor-NN, SLM phase masks are generated that are employed for driving the MPLC. The images shown are camera recordings capturing the MPLC output. Note, that a single-beam and single-passage configuration is considered. The correlation between experiment and Model-NN prediction is Γ ≈ 0.65.
After training of the entire Actor-Model structure (see
3 Discussion
Our results show that the smart Actor-Model approach enables targeted shaping of light beams that are input to an MPLC device. With this calibration method, we treat the optical setup as a black box. Thus, we do not necessitate pixel-precise alignment in the experiment to match offline calculation, such as the WMA. In turn, we train a digital twin of the MPLC setup called Model-NN that considers knowledge of the experimental environment reducing misalignment. Therefore, proper training data of the setup needs to be created. Training data of the Model-NN comprises SLM phase masks at the input and corresponding intensity camera images at the output. For the actor, training is done by using EMNIST data set as the ground truth. The prediction of the Actor-NN during training is directly connected to the input phase mask of the fixed Model-NN, predicting the systems output intensity. The Actor-NN is trained by comparing the predicted intensity with the ground truth images. The Actor-NNs performance is limited by the performance of the Model-NN. Additionally the Actor can only provide phase masks, to shape a certain area on the camera, which is mainly the center. By reducing the ground truth intensity to this area, the performance is increased significantly to Γ ≈ 0.7, where Γ ≈ 0.82 is the maximal achievable correlation from the Model-NNs performance for unknown data.
In our investigations, we use simple single-layer perceptrons for a dual-passage configuration. Due to the exponential parameter increase in fully-connected layers when increasing the number of input / output pixels, more sophisticated NN architectures are required, e.g. DenseNet or MTNet [
For performing mode generation with the system shown, the mode profiles must be used as ground truth. This requires complex training data. So far, however, intensity-only images have been used for both model and actor. For mode generation, the system must therefore be extended by an interferometer. The NN architecture should be complemented by complex-valued NNs [
The results shown in this work are produced in single-beam and single-passage configuration. However, to shape coaxial output modes in a multi-beam configuration, all output beams must overlap in a certain region, as shown in
In multi-beam configurations, multiple independent network structures, such as multi-Model-NNs have to be trained that mimick the propagation properties of independent optical tributaries. In such arrangements, the Model-NN approach shown in this work can straightforwardly be adopted. The NN architecture in a multi-beam arrangement should comprise independent sub-NNs for each beam in both Actor-NN and Model-NN. Since in each passage all beams share the same phase mask, the Actor-NN outputs must be combined, training on a common phase mask. In the solution shown here, we predict a phase mask for one passage that is used as Actor-NN output, i.e. Model-NN input. However, for multi-passage configurations, all SLM pixels referring to different passages can be treated as one accumulated phase mask.
Adding tributaries and thus increasing the number of input beams requires increasing the number of reflection passages. According to a fair estimate with WMA, N passages are required for N input spots [
4 Conclusions
We have demonstrated an intelligent approach to calibrate an MPLC device using experimental data. Although we treat the entire light shaping system as black box, delicate knowledge about the experimental behavior is gained by using machine learning algorithms with Γ ≈ 0.82. Here, we have shown that the Actor-Model approach is feasible for online calibration of an all-optical mode multiplexer based on MPLC. In contrast to an offline calculation of SLM phase masks, our approach does not suffer from mismatches between algorithm and experiment reducing the alignment effort dramatically. This is particularly beneficial for the employment of low-complex SDM networks or the transmission of fragile quantum states.
[1] M.M. Baig, H. GholamHosseini, M.J. Connolly. Mobile healthcare applications: system design review, critical issues and challenges.
[2] P. Saranya, P. Asha. Survey on big data analytics in health care.
[3] M.A. Uusitalo, P. Rugeland, M. Boldi, E. Calvanese Strinati, G. Carrozzo, P. Demestichas, M. Ericson, G. Fettweis, M.-H. Hamon, M. Latva-aho, J. Martrat, A. Pärssinen, B. Richerzhagen, D. Sabella, H.D. Schotten, P. Serrano, G. Stea, T. Svensson, E. Ustundag Soykan, G. Wikström, V. Ziegler, Y. Zou. Hexa-X, The European 6G flagship project”, EuCNC 2021.
[4] A.D. Ellis, J. Zhao, D. Cotter. Approaching the non-linear Shan non limit.
[5] R. Ryf, S. Randel, A.H. Gnauck, C. Bolle, A. Sierra, S. Mumtaz, M. Esmaeelpour, E.C. Burrows, R.-J. Essiambre, P.J. Winzer, D.W. Peckham, A.H. McCurdy, R. Lingle. Mode-division multiplexing over 96 km of few-mode fiber using coherent 6 × 6 MIMO processing.
[6] P.J. Winzer, D.T. Neilson, A.R. Chraplyvy. Fiber-optic transmission and networking: the previous 20 and the next 20 years.
[7] D. Richardson, J. Fini, L Nelson. Space Division Multiplexing in optical Fibres.
[8] F.M. Ferreira, F.A. Barbosa, F.A. Ruocco, M.C. Lo. Scaling up SDM transmission capacity.
[9] F.M. Ferreira, D. Fonseca, H.J.A. da Silva. Design of few-mode fibers with M-modes and low differential mode delay.
[10] G. Rademacher, B.J. Puttnam, R.S. Luís. Peta-bit-per-second optical communications system using a standard cladding diameter 15-mode fiber.
[11] G. Rademacher, B.J. Puttnam, R.S. Luís, T.A. Eriksson, N.K. Fontaine, M. Mazur, H. Chen, R. Ryf, D.T. Neilson, P. Sillard, F. Achten, Y. Awaji, H. Furukawa. Degenerate mode-group division multiplexing.
[12] S.G. Leon-Saval, N.K. Fontaine, J.R. Salazar-Gil, B. Ercan, R. Ryf, J. Bland-Hawthorn. Mode-selective photonic lanterns for space-division multiplexing.
[13] G. Labroille, B. Denolle, P. Jian, P. Genevaux, N. Treps, J.-F. Morizur. Efficient and mode selective spatial mode multiplexer based on multi-plane light conversion.
[14] M. Mounaix, N.K. Fontaine, D.T. Neilson, R. Ryf, H. Chen, J.C. Alvarado-Zacarias, J. Carpenter. Time reversed optical waves by arbitrary vector spatiotemporal field generation.
[15] J.W. Czarske, D. Haufe, N. Koukourakis, L. Büttner. Transmission of independent signals through a multimode fiber using digital optical phase conjugation.
[16] S. Rothe, N. Koukourakis, H. Radner, A. Lonnstrom, E. Jorswieck, J.W. Czarske. Physical layer security in multimode fiber optical networks.
[17] S. Rothe, K.L. Besser, D. Krause, R. Kuschmierz, N. Koukourakis, E. Jorswieck, J.W. Czarske. Securing data in multimode fibers by exploiting mode-dependent light propagation effects.
[18] N. Fontaine, R. Ryf, H. Chen, D. Neilson, J. Carpenter. Design of high order mode-multiplexers using multiplane light conversion.
[19] N.K. Fontaine, R. Ryf, H. Chen, D.T. Neilson, K. Kim, J. Carpenter. Laguerre-Gaussian mode sorter.
[20] T. Hashimoto, T. Saida, I. Ogawa, M. Kohtoku, T. Shibata, H. Takahashi. Optical circuit design based on a wavefront-matching method.
[21] J. Fang, J. Bu, J. Li, C. Lin, A. Kong, X. Yin, H. Luo, X. Song, Z. Xie, T. Lei, X. Yuan. Performance optimization of multiplane light conversion (MPLC) mode multiplexer by error tolerance analysis.
[22] N. Borhani, E. Kakkava, C. Moser, D. Psaltis. Learning to see through multimode fibers.
[23] C. Zhu, E.A. Chan, Y. Wang, W. Peng, R. Guo, B. Zhang, C. Soci, Y. Chong. Image reconstruction through a multimode fiber with a simple neural network architecture.
[24] D. Pohle, S. Rothe, N. Koukourakis, J. Czarske. Surveillance of few-mode fiber-communication channels with a single hidden layer neural network.
[25] J. Sun, J. Wu, N. Koukourakis, L. Cao, R. Kuschmierz, J. Czarske. Real-time complex light field generation through a multi-core fiber with deep learning.
[26] U. Teğin, B. Rahmani, E. Kakkava, N. Borhani, C. Moser, D. Psaltis. Controlling spatiotemporal nonlinearities in multimode fibers with deep neural networks.
[27] X. Lin, Y. Rivenson, N.T. Yardimci, M. Veli, Y. Luo, M. Jarrahi, A. Ozcan. All-optical machine learning using diffractive deep neural networks.
[28] B. Rahmani, D. Loterie, E. Kakkava, N. Borhani, U. Teğin, D. Psaltis, C. Moser. Actor neural networks for the robust control of partially measured nonlinear systems showcased for image propagation through diffuse media.
[29] G. Cohen, S. Afshar, J. Tapson, A. van Schaik. EMNIST: Extending MNIST to handwritten letters.
[30] K. Matsushima, T. Shimobaba. Band-limited angular spectrum method for numerical simulation of free-space propagation in far and near fields.
[31] S. Rothe, Q. Zhang, N. Koukourakis, J. Czarske. Intensity-only mode decomposition on multimode fibers using a densely connected convolutional network.
[32] Q. Zhang, S. Rothe, N. Koukourakis, J. Czarske. Learning the matrix of few-mode fibers for high-fidelity spatial mode transmission.
[33] Y. Luo, S. Yan, H.A. Li, P. Lai, Y. Zheng. Focusing light through scattering media by reinforced hybrid algorithms.
[34] K. Schmidt, N. Koukourakis, J.W. Czarske. Assignment of focus position with convolutional neural networks in adaptive lens based axial scanning for confocal microscopy.
[35] E. Cuche, F. Bevilacqua, C. Depeursinge. Digital holography for quantitative phase-contrast imaging.
[36] J.A. Barrachina.
[37] D. Loterie, S. Farahi, I. Papadopoulos, A. Goy, D. Psaltis, C. Moser. Digital confocal microscopy through a multimode fiber.
[38] N. Koukourakis, F. Wagner, S. Rothe, M.O. Karl, J.W. Czarske. Investigation of human organoid retina with digital holographic transmission matrix measurements.
[39] D. Pohle, F.A. Barbosa, F.M. Ferreira, J. Czarske, S. Rothe. Intelligent self calibration tool for adaptive mode multiplexers using multiplane light conversion.
Get Citation
Copy Citation Text
Dennis Pohle, Fabio A. Barbosa, Filipe M. Ferreira, Jürgen Czarske, Stefan Rothe. Intelligent self calibration tool for adaptive few-mode fiber multiplexers using multiplane light conversion[J]. Journal of the European Optical Society-Rapid Publications, 2023, 19(1): 2023020
Category: Research Articles
Received: Jan. 30, 2023
Accepted: Apr. 18, 2023
Published Online: Aug. 31, 2023
The Author Email: Pohle Dennis (dennis.pohle@tu-dresden.de), Czarske Jürgen (juergen.czarske@tu-dresden.de)