Key Laboratory of Specialty Fiber Optics and Optical Access Networks, Joint International Research Laboratory of Specialty Fiber Optics and Advanced Communication, Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai 200444, China
Vector vortex beams (VVBs) have attracted considerable attention due to their unique polarization distribution and helical phase wavefront. We first attempt to retrieve the modal coefficients of hybrid VVBs measured by their multiplex polarized intensities using the deep learning (DL)-based stochastic parallel gradient descent (SPGD) algorithm. The Xception-based DL model with multi-view images can make an accurate prediction of modal coefficients that are validated by the theoretical calculations of the waveplate angles, demonstrating a high correlation of 99.65%. The universality of the algorithm to high-order vector-eigenmodes (VMs) decomposition is proved to enable the precise reconstruction of modal patterns generated by mode-selective couplers, which promotes the accurate characteristics of VVBs in laser beam characterization and fiber mode-division multiplexing.
【AIGC One Sentence Reading】:Summary: We validate high-order VM decomposition using DL-SPGD, achieving 99.65% accuracy in predicting modal coefficients of VVBs, enhancing laser beam and fiber optic applications.
【AIGC Short Abstract】:This study validates high-order vector-eigenmode decomposition using a deep learning-based SPGD algorithm, focusing on VVBs' unique polarization and phase. The Xception-based model accurately predicts modal coefficients, achieving a 99.65% correlation with theoretical calculations. This approach enables precise VM reconstruction, enhancing VVB characterization in laser beams and fiber multiplexing.
Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.
Due to the capability of carrying orbital angular momentum (OAM) and possessing distinctive polarization characteristics, vector vortex beams (VVBs) are widely applied in the fields of super-resolution microscopy[1,2], molecule orientation imaging[3], and mode-division multiplexing (MDM)[4–6]. In particular, focusing radially polarized light can create a strong longitudinally electrical component at the focus, while tightly focused azimuthally polarization leads to a strong axial magnetic field and a vanishing electric field[2,3]. Among them, cylindrical vector beams, as orthogonal eigenmodes, offer more available high-order modes, which can further expand the communication capacity in multi-mode fibers. Through vector-eigenmodes (VMs) modulation, linearly polarized (LP) or circularly polarized (CP) OAM can be exploited as information channels[7]. However, due to mode coupling and interference in fibers, hybrid VMs are inevitably involved in the transmission. Therefore, there is a challenge in identifying modal coefficients in the fields of optical communication and fiber MDM.
In recent years, modal decomposition (MD) techniques based on numerical methods, such as inverse matrix decomposition[8,9] and the stochastic parallel gradient descent (SPGD) algorithm[10,11], have rapidly become essential tools for revealing the characteristics of optical field modes. However, they may suffer from the drawback of falling into local optimal solutions. Meanwhile, deep learning (DL) algorithms, especially convolutional neural networks (CNN), have shown promise in the field of optics due to their powerful global search capabilities[12–14], but most of them are retrieved based on only-intensity beam distributions. More polarization-projected intensity distributions azimuthally across the transversal beam are necessary to differentiate high-order degenerate VMs, which will lead to training computational costs for the DL process increasing greatly. The common concern is that DL-based networks are typically trained on simulated data and then applied to experimental data. Due to the lack of an equivalent relationship between simulated and experimental data, the reliability of experimentally reconstructed modal results cannot be guaranteed.
Here, to address these issues, we first attempt to validate the modal coefficients of the measured hybrid VVBs using the DL-based SPGD algorithm by retrieving multiplex polarized intensities, instead of only considering the beam intensity. Leveraging the correspondence between the waveplate angles and the modal coefficients, as well as polarization characteristics of the hybrid VMs, Xception models with different channel numbers are trained based on the simulation data and selected to decompose the intensity distribution of four polarization directions. In combination with the SPDG algorithm, Xception-based predictions of modal coefficients and modal intensity reconstruction in VM and OAM bases are obtained. This method successfully makes an accurate prediction of modal coefficients, which is validated through theoretical calculations based on the waveplate angles, demonstrating a high correlation of up to 99.65%. Second, modal coefficients of typically mixed VVBs are qualitatively analyzed in the form of VM and OAM mode bases. Finally, modal coefficients of first-order and second-order degenerate eigenmodes in few-mode fibers (FMF) by all-fiber mode-selective couplers (MSCs) are reconstructed. The universality of the DL-SPGD algorithm is demonstrated through the precise reconstruction of high-order VMs, which greatly promotes the accurate characteristics of VVBs in laser beam characterization, MDM, and other optical communication fields.
Sign up for Chinese Optics Letters TOC Get the latest issue of Advanced Photonics delivered right to you!Sign up now
2. Verification Experiment and Results
The first-order VM division multiplexing setup based on a vortex waveplate (VWP) is constructed based on the configuration of a Mach–Zehnder interferometer, as shown in Fig. 1(a). The fundamental mode Gaussian beam () emitted by a laser source () is collimated by a lens and converted into a uniformly LP beam by a polarizer (Pol1). , , or their admixture can be generated with appropriate polarized light passing through the VWP. Then, through a non-polarization beam splitter (BS), one way transmits and modes, and those of another way are transformed into or modes after passing a half-wave plate (HWP)[15,16]. A charge-coupled device (CCD) is used to record modal polarized field distribution of the VMs after passing another polarizer (Pol2). The multiplexing VM field distributions can be expressed as where , , and correspond to the field amplitude, phase, and electric field of the VMs, respectively.
Figure 1.(a) Diagram of the experimental device. (b) The relationship between the VMs and the angle of Pol1. (θ1) and the angle of HWP (θ2). TEM00, fundamental mode; Pol, polarizer; VWP, vortex waveplate; BS, non-polarizing beam splitter; HWP, half-wave plate; CCD, charge-coupled device.
The transformation relationship between the VMs by changing the waveplate angles is shown in Fig. 1(b). Relative to the fixed VWP, and are defined as the angles of Pol1 and the HWP, which can be precisely rotated to convert the VMs through phase geometry transformation of the VWP. By adjusting different angles of and , the mixed VMs with different weights among them can be obtained. It is noteworthy that the waveplates introduce an inherent phase difference of between TM01/TE01 and . Additionally, a 50:50 BS divides the input light into two parts, which means where , , , and correspond to the weight coefficients of VMs.
In this Letter, Xception is chosen as the CNN model because of its unique network structure. It has depthwise separable convolutions, which can significantly reduce the number of parameters and improve the training efficiency[17]. Because the VMs have non-uniform spatial polarization characteristics, the training images with the resolution of used in this model are taken from their polarization projected intensity patterns. For the first- and second-order VMs, four intensity patterns of polarization direction at 0°, 45°, 90°, and 135° are sufficient to reveal the polarization properties of the VMs. As shown in Fig. 2, two schemes of pattern-structured data are compared during the Xception-based DL process. One is the -channel (here, is 4) image data, and the other is the matrix image data formatted with the resolution of as a multi-view image. Furthermore, more polarized intensities at a smaller angle separation are desirable for higher-order VMs, but it will result in more channel numbers for the training scheme or increase the resolution of multi-view images for the training process.
The entire flow of the DL-SPGD algorithm is shown in Fig. 2. The dataset of 40,000 random patterns is first trained in the Xception model for testing prediction. Then, the rough modal coefficients and are used as the initial values of the SPGD algorithm for accurate modal reconstruction. After certain iterations, predicted modal coefficients in the VM bases and reconstructed modal patterns using the DL-SPGD are obtained, which can be further converted to LP-OAM or CP-OAM bases using the matrix transformation of degenerating mode bases[7].
First, the performance comparison of different channel numbers is presented in Table 1. Validation loss is a crucial metric for evaluating the performance of network models, which are calculated here using the mean squared error between the output and label vectors. where and represent the output and label vectors, respectively. The vector length () of both first- and second-order VMs is seven, including four weight coefficients and three relative phase differences. represents the training batch size to be 10. To evaluate the predictive performance, the average error of modal coefficients is calculated, which comprises the modal weight errors and relative phase error . Additionally, model correlation is another metric used to demonstrate the accurate precision of predicted modal coefficients. Here, the correlation is denoted as where represents the reconstructed or predefined mean intensity. Compared to the 4-channel model, the 1-channel model can reduce the number of convolution operations, resulting in a shorter training duration. From Table 1, it also can be seen that the performance of the 1-channel data can excel over that of the 4-channel data because less information is lost in the training process and polarization-dependent intensity information is embedded in the matrix patterns.
Based on the performance evaluation and polarization characteristics of the VMs, the validation loss curves are seen in Fig. 3(a), which show a fast convergence after about 90 epochs and a similar loss trend for the first- and second-order VMs. The results indicate that the feature of the matrix pattern can be learned well with a low loss. Another dataset with 1000 polarized intensity pattern is tested, and the average modal error of weights and phases is lower than 0.89% and 0.76%, as shown in Fig. 3(b). It is worth noting that the modal coefficient errors of low-order VMs are smaller than those of high-order ones. This might be understood because high-order VMs possess complicated intensity and polarization distribution.
Figure 3.(a) Validation loss for first- and second-order VMs. (b) Their average errors of modal weights and relative phases. (c) The averaged correlations of the SPGD and the DL- SPGD algorithms for first-order VMs over 200 testing samples. (d) Given and predicted weights and relative phases for first-order VMs.
Second, compared to that from the SPGD algorithm with inputting random coefficients, the predicted modal ones from the Xception-based DL process can not only efficiently acquire convergence with fewer iterations but also improve the accuracy of correction, as shown in Fig. 3(c). The comparisons between given and predicted modal coefficients using DL-SPGD algorithms for first-order VMs are shown in Fig. 3(d). It can be seen that the modal coefficients reconstructed by the DL-SPGD algorithm are closer to the given ones, and the correlation reaches 99.997%. This is due to the fact that the DL-SPGD algorithm perfectly combines the global optimization of DL and the accurate research capability of the SPGD.
Third, we attempt to experimentally validate the modal coefficients of measured VMs through DL-based SPGD algorithm. The modal weights can be calculated based on the Jones Matrix transformation of waveplates at different angles ( and ). The modal pattern generated as and is reconstructed in Fig. 4(a), and the reconstructed polarization beam profiles have great consistency with the measured ones. Accurate predictions of modal coefficients of the experimental pattern in the VMs base are achieved, as shown in Fig. 4(b), with a correlation of 99.57%. Moreover, according to the transformation between VM and LP-OAM and CP-OAM bases, their measured and predicted modal weights and relative phases are compared in Figs. 4(c) and 4(d). It should be noted that there are opposite modal phases with the same modal weights because of the phase ambiguity. Modal coefficients cannot be determined solely based on the four polarization projected intensity patterns. By utilizing pattern interference, the phase distribution can be examined, thereby obtaining unique predicted mode coefficients that correspond to the theoretical values.
Figure 4.(a) Measured (Mea.) and reconstructed (Rec.) intensity patterns of a VM by the DL-SPGD algorithm. The measured and predicted modal coefficients in (b) the VM, (c) the CP-OAM, and (d) the LP-OAM mode bases.
Fourth, the predicted and measured modal coefficients are consistent at any admixture of VMs, which further validates the efficient decomposition using the DL-SPGD algorithm. As shown in Fig. 5, the mixed and or the mixed with variable modal weights in two paths of the Mach–Zehnder interferometer are combined to constitute the hybrid VMs, which are determined by the orientation of the VWP and HWP. The modal weights of and are determined by the input polarization direction relative to the VWP . When is fixed, the mode coefficients of TE01 and TM01 remain constant, while, by rotating , the modal weights of and modes show periodic changes. The dependences of their modal weights on the angles are shown in Figs. 5(a)–5(d), which demonstrate excellent agreement between measured and predicted modal weights, and an average correlation reaches up to 99.65%. This proves that the DL-SPGD algorithm can effectively provide accurate predictions for experimental data.
Figure 5.Predicted (red dots and blue triangles) and measured (blue and red lines) modal weights. (a) Mixed TM01 and TE01 modes, (b) mixed HE21even and HE21odd modes. Hybrid four VMs when (c) θ1 = 0 and (d) θ1 = π/4. (e1), (f1) The output and polarized patterns of the VMs. Interference patterns when the polarized angles are set at (e2), (f2) 0° and (e3), (f3) 90°. (Mes., measured modal coefficients; Pre., predicted modal coefficients.
Two typical mixed VMs by mode superposition are marked as points A and B in Figs. 5(c) and 5(d), which are considered as degenerate odd and even modes of the same order VMs with a phase difference of . Through matrix transformation of degenerating mode bases, the hybrid VM (point A) is described as , and another hybrid VM (point B) is seen as . These mixed OAM modes can be separated using a polarizer, and their interference patterns are observed, as shown in Figs. 5(e2), 5(f2) and 5(e3), 5(f3), respectively. It is seen that there is only at 0° polarization and only at 90° polarization. Therefore, rotating the direction of the polarizer, the output modes can be switched between positive and negative topological charges of OAM.
3. Experiment of Recognition in All-Fiber MSCs
Finally, through the precise reconstruction of high-order VM generation using all-fiber MSCs, we demonstrate high-order VM decomposition based on the DL-SPGD algorithm. The measured and predicted patterns of the first- and second-order hybrid VMs are shown in Figs. 6(a1) and 6(b1), respectively. It is found that their measured beam profiles have great consistency with the reconstructed ones. Among them, their intensity distributions with spatially LP-mode separations are non-uniform and polarization-dependent, which can be reconstructed more accurately using multi-view images. The predicted modal coefficients of the first-order VMs are shown in Fig. 6(a2), in which is dominated for a larger proportion of over 77% and has a phase difference of 1.726 with . As shown in Fig. 6(b2), the second-order VMs with the prediction coefficients of and are more than 99%, and there is a phase difference close to between and . Their predicted modal coefficients in the OAM base are shown in Figs. 6(a3), 6(a4), 6(b3), and 6(b4). The predicted patterns expressed in the VM and OAM bases are self-consistent with the measured patterns. In a word, our proposed algorithm for high-order VM decomposition maintains high accuracy in the practical experiment. Of course, it is desirable to use more polarization intensities as multi-view images to reveal the polarization characteristics, but this would increase the calculation amount and reduce the calculation efficiency for multi-channel data.
Figure 6.Experimental (Exp.) and reconstructed (Rec.) intensity patterns of (a1) first-order and (b1) second-order VMs converted from fiber MSCs. Their predicted modal weights and phases of (a2), (b2) VM modes, (a3), (b3) CP-OAM base, and (a4), (b4) LP-OAM base.
In summary, the applicability of the DL-SPGD algorithm to high-order vector-eigenmode decomposition is verified based on multiple polarized intensity distributions. Due to the polarization-dependent VMs, the Xception model using multi-view data can provide faster training efficiency and excellent prediction accuracy. The correlation based on the DL-SPGD algorithm can be significantly increased to 99.997%, and the average correlation between the predicted and the measured VMs is 99.65%. The modal coefficients of the typical hybrid VVBs are quantitatively analyzed in the form of VM and OAM modal bases. Finally, the modal coefficients of high-order VMs using fiber MSCs demonstrate the experimental universality of the proposed MD method. This advancement not only bridges the gap between DL techniques and fiber optics but also promotes the precise characterization of VVBs in laser characterization, fiber MDM, and optical communications.
[17] F. Chollet. Xception: deep learning with depthwise separable convolutions. Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition, 195(2017).
Tools
Get Citation
Copy Citation Text
Huihui Zhao, Siyan Wang, Yancheng Huang, Wei Chen, Fufei Pang, Xianglong Zeng, "Experimental validation for high-order vector-eigenmode decomposition with polarization characteristics," Chin. Opt. Lett. 22, 110602 (2024)