Chinese Optics Letters, Volume. 23, Issue 11, 113601(2025)

Metasurface-enabled intelligent recognition and visualization for fiber modes

Yu Zhao1,2, Huijiao Wang1, Tian Huang1, Zile Li1,2、*, Lei Yu1、**, and Guoxing Zheng1,2,3
Author Affiliations
  • 1Electronic Information School, Wuhan University, Wuhan 430072, China
  • 2Peng Cheng Laboratory, Shenzhen 518055, China
  • 3Wuhan Institute of Quantum Technology, Wuhan 430206, China
  • show less

    Mode-division multiplexing technology leveraging diverse spatial modes has advanced to sustain capacity expansion in fiber-optic communications. The intelligent recognition of spatial modes using ultra-compact devices and low-complexity designs is crucial for mode visualization and system miniaturization. In this work, we theoretically design and experimentally demonstrate a neural network-optimized metasurface capable of dual-mode pattern recognition through dual-channel image display. Our framework offers three key advantages: device compatibility, design flexibility, and function scalability by integrating neural networks and metasurfaces into mode-division multiplexing platforms. Our framework enhances research and applications of intelligent metasurface-driven pattern recognition and object classification, as well as information encoding and decoding.

    Keywords

    1. Introduction

    To meet the demands of our information-driven society, the rapid growth of data traffic has necessitated significant capacity expansion in fiber-optic communications[1]. Space-division multiplexing (SDM), which explores the degree of freedom in the transverse spatial domain, has emerged as a promising solution for high-capacity communications[2]. Mode-division multiplexing (MDM), a subset of SDM, exploits diverse spatial modes to establish multiple information channels that carry independent data streams synchronously, thereby significantly enhancing data capacity and spectral efficiency in a single fiber[3]. In MDM fiber-optic communication systems, several mode basis sets are applicable for constructing parallel information channels, including linearly polarized (LP) modes[4,5], vector modes[6,7], and orbital angular momentum (OAM) modes[8,9]. Among these, the LP mode basis is widely adopted in practical applications due to the relative ease with which these modes can be excited and detected[10]. It is crucial to extract specific information about the spatial characteristics of LP modes transmitted through a single fiber, which can be extended to mode visualization for managing multiple mode channels. Furthermore, based on the scalable mode channels in the transverse spatial domain, spatial characteristics of different modes can be utilized to achieve higher-capacity and higher-density information encoding and decoding, which can be further extended to applications in high-complexity and high-security encryption and decryption. To achieve mode visualization and system miniaturization, it is essential to employ ultra-compact devices with low-complexity designs that facilitate integrated manipulation and intelligent recognition of modes transmitted in a fiber.

    Metasurfaces[11], planar optics patterned with subwavelength-scale nanoscatterers, have demonstrated the ability to manipulate various light properties[12], including amplitude[13], phase[14], polarization[15], and frequency[16]. Metasurfaces have achieved significant achievements in multifunctional optical components across multiple dimensions, such as wavelength[17], polarization[18], and spatial mode[19]. With their ultra-compact construction, wide-range integration, and multi-dimension function, metasurfaces provide a novel platform for MDM, offering both theoretical exploration and practical application. For mode emission, metasurfaces have been used to generate multiple spatial modes, either in free space or integrated on fiber ports, including OAM modes with varying topological charges[20], higher-order LP modes[21], and arbitrarily structured light[22]. In the realm of mode division, current metasurface-based devices have successfully realized the multiplexing and demultiplexing of LP modes in two orthogonal polarizations[23] and OAM modes with different topological charges[24]. Furthermore, practical applications have seen the emergence of OAM holography, allowing the multiplexing of a broad range of OAM modes, leading to OAM-encoded holograms for high-security optical encryption[25]. Beam steering systems that combine control of polarization and wavelength have facilitated the scaling of information channels for high-capacity optical communication[2628]. However, the recognition and visualization of independent spatial modes using a platform that integrates metasurface devices with optical fibers remains a significant challenge.

    In this work, we propose an intelligent mode-visualizing metasurface for recognizing LP modes transmitted in a single fiber by integrating neural network techniques and metasurfaces into an MDM platform. Specifically, the metasurface functions as a Fourier grating, enabling the switching of two holographic images in their respective target spatial regions by altering the incident mode, as shown in Fig. 1. This capability allows for mode recognition and visualization by displaying distinct characters in the captured images on the far-field observation plane. Our metasurface employs nanostructures with identical geometric size and different orientation angles to achieve phase modulation based on the geometric phase. To enhance computation speed and ensure operation accuracy, we incorporate neural network algorithms into the optimization process for the phase profile of the metasurface. A series of experiments was conducted to validate our framework, where the holographic images for the two modes were accurately displayed in their designated target regions. The ultra-compact construction applicability and multi-dimension manipulation capability of our metasurface, combined with the high-speed computation and large-volume operation of the neural network architecture, offer significant advantages, including device compatibility, design flexibility, and function scalability for intelligent mode management. Looking ahead, our framework paves a practical pathway across a wide range of applications, including intelligent metasurface-driven pattern recognition and object classification, information encoding and decoding, as well as encryption and decryption.

    Schematic diagram of the mode-visualizing metasurface.

    Figure 1.Schematic diagram of the mode-visualizing metasurface.

    2. Materials and Methods

    We use a neural network architecture to optimize the phase profile of the metasurface. Figure 2 provides an illustration of the training architecture. Initially, we collect input datasets containing amplitudes A01 and A11, phases P01 and P11, of standard and transformed mode optical fields, which are derived from planar transformations in four dimensions, containing translations along x and y axes, rotations along the central axis of mode fields, and magnifications around the central point of mode fields (See Appendix A for details about data collection).

    Training architecture of the mode-visualizing metasurface.

    Figure 2.Training architecture of the mode-visualizing metasurface.

    To visualize the two modes, we create two target images respectively with characters “LP01” and “LP11” positioned at different spatial regions, the intensities of which represent the designed output targets, denoted as T01 and T11. These target images undergo preprocessing, including distortion correction, energy compensation, and value normalization. By the phase profile of the metasurface φ and fast Fourier transform (FFT), the actual outputs O01 and O11 can be computed as O01=|FFT[A01exp(iP01+iφ)]|2,O11=|FFT[A11exp(iP11+iφ)]|2.

    We customize the loss function Loss, which is computed as Loss=αSUM[(O01T01)2×WMask01]+βSUM[(O11T11)2×WMask11].

    The terms WMask01 and WMask11 refer to the “weight masks” applied to T01 and T11, which are binary masks to emphasize the target output areas and reduce the influence of background regions, thereby enhancing the convergence speed of the training network. The squared difference between the designed and actual outputs is weighted by the masks and then aggregated using the SUM operator. The coefficients α and β are, respectively, set as 1.15 and 1 for energy compensation between the two-mode optical fields. Subsequently, the Loss is delivered in the backpropagation to the neural network, and the gradient is updated by the Adam optimizer to generate the new phase profile (See Appendix B for details about the training configuration).

    To achieve high transmittance and precise manipulation at the operation wavelength of 1550 nm, we carefully design the metasurface unit cells, focusing on the geometry dimensions and orientation angles.

    The unit-cell nanostructure, made from monocrystalline silicon, consists of a nanobrick sitting on a substrate, as shown in Fig. 3(a). The structural parameters include cell size C, length L, width W, height H, and orientation angle α of the nanobrick. We used COMSOL Multiphysics software to scan cell size C, length L, and width W of the nanobrick. The final nanostructure design, with dimensions of C=860nm, L=680nm, W=260nm, and H=1000nm, achieves a high transmittance over 70% and complete phase modulation from 0 to 2π for the cross-polarized component, as shown in Fig. 3(b).

    Structural design of the mode-visualizing metasurface. (a) Schematic of the nanostructure unit-cell. (b) Simulated optical responses of the nanostructure unit-cell. (c) Optimized phase profile. (d) Enlarged phase profile.

    Figure 3.Structural design of the mode-visualizing metasurface. (a) Schematic of the nanostructure unit-cell. (b) Simulated optical responses of the nanostructure unit-cell. (c) Optimized phase profile. (d) Enlarged phase profile.

    The optimized phase profile of the metasurface, consisting of 1000 pixel × 1000 pixel, is illustrated in Fig. 3(c), and an enlarged part (20 pixel × 20 pixel) at the upper-left corner of the phase profile is shown in Fig. 3(d). Subsequently, the orientation angles α of the nanobricks are derived from the optimized phase profile φ of the metasurface, following the principle of geometric phase, which is expressed as φ=2α.

    As a result, single-sized nanostructures with identical geometric dimensions but varying orientation angles are arranged in a single-layer configuration, accomplishing the structural construction of the metasurface.

    3. Results

    To evaluate the theoretical feasibility of our framework, we compute the Kirchhoff diffractions using the constructed metasurface. Input optical fields of the LP01 and LP11 modes, which are normally projected onto the metasurface, are shown in Figs. 4(a) and 4(e). After modulations by the metasurface, the simulated output holographic images, observed at 10 cm from the metasurface, are, respectively, shown in Figs. 4(b) and 4(f). The simulated target images of both modes as depicted by the white boxes are accurately positioned in their respective target regions, which are extracted and enlarged displaying the characters “LP01” and “LP11,” as shown in Figs. 4(c) and 4(g), respectively. The designed target images used in the training architecture are depicted in Figs. 4(d) and 4(h). These evidences indicate a high degree of agreement between simulated and designed target images. The simulation results indicate that our constructed metasurface enables the switching of the holographic images under the conversion of the incident spatial modes.

    Numerical simulation results of the mode-visualizing metasurface under x-polarized modes. The first and second rows show the input modes, simulated outputs, simulated targets, and designed targets for the LP01 and LP11 modes, respectively. The white boxes highlight the target images.

    Figure 4.Numerical simulation results of the mode-visualizing metasurface under x-polarized modes. The first and second rows show the input modes, simulated outputs, simulated targets, and designed targets for the LP01 and LP11 modes, respectively. The white boxes highlight the target images.

    To demonstrate the functional applicability of our framework, we fabricate a metasurface sample and set up an optical system to characterize mode recognition and visualization (Appendix C for details about sample fabrication). Figure 5 presents scanning electron microscope (SEM) images of the fabricated metasurface at different magnifications, in which the nanobrick adhesion appears, caused by fabrication imperfections. This adhesion reduces the holographic efficiency and negatively impacts the experimental performance. This issue can be mitigated through improvements in both the design and fabrication processes. From the design perspective, increasing the dimensional contrast between the nanobrick cell sizes and lengths can reduce the probability of adhesion. In terms of fabrication, decreasing the length and width of the nanobricks can help correct size deviations.

    Partial SEM images of the fabricated metasurface.

    Figure 5.Partial SEM images of the fabricated metasurface.

    The optical system used to capture the holographic images is illustrated in Fig. 6(a). A tunable laser (THORLABS TLX1) served as the light source, and a few-mode fiber supported the LP01 and LP11 modes, operating at a wavelength of 1550 nm. The mode fields were collimated by a lens (THORLABS PAF2-2C) and then passed through a circular polarizer (THORLABS CP1L1550) to adjust their polarization states. The mode fields projected onto the metasurface were captured by an infrared camera (HAMAMATSU C12741-03), as shown in Figs. 6(b) and 6(e). The holographic images were focused by a lens (LATEK MBCX10606-C) and then received by an infrared detector board (LATEK IRDC1-200S-M-SP230614) with a black cardstock to block zero-order light. The holographic images were captured using a commercial camera (Nikon D5100). As shown in Figs. 6(c) and 6(f), the patterns of the holographic images transformed when the incident modes switched between LP01 and LP11 modes. The target regions in these holographic images are extracted and enlarged to display the details, revealing the characters “LP01” and “LP11,” which showed the spatial characteristics of the incident modes, as shown in Figs. 6(d) and 6(g).

    Experimental results of the mode-visualizing metasurface. (a) Experimental setup to receive the holographic images. (b)–(g) Captured mode field, captured holographic image, and enlarged target image under LP01 and LP11 mode incidences.

    Figure 6.Experimental results of the mode-visualizing metasurface. (a) Experimental setup to receive the holographic images. (b)–(g) Captured mode field, captured holographic image, and enlarged target image under LP01 and LP11 mode incidences.

    Distortions observed in the holographic images primarily arise from aberrations induced by the high numerical aperture focusing lens. Additionally, structural imperfections in the fabricated metasurface and spatial misalignments between the incident mode field and the metasurface further degrade image quality. Integrating the metasurface directly onto the fiber end face can reduce these spatial misalignments, thereby enhancing the accuracy and fidelity of the experimental results.

    We employed the Pearson correlation coefficient to evaluate the mode fidelity, which is calculated as ρ(A,B)=cov(A,B)σAσB.Here, cov(A,B) represents the covariance between A and B, and σA and σB denote the standard deviations of A and B. We calculated the Pearson correlation coefficients between the experimental target images without being focused and the corresponding designed target images, obtaining values of 0.80 and 0.78, respectively, for the LP01 and LP11 modes, indicating the high fidelity of the experimental images. In comparison, the Pearson correlation coefficients between the simulated target images and the designed target images were calculated, as shown in Figs. 7(a) and 7(b). Moreover, the energy ratio between the target image of the incident mode and that of the other mode in the simulated and experimental holographic image was calculated to evaluate the mode crosstalk, as shown in Figs. 7(c) and 7(d). Although the crosstalk increased and the fidelity declined, target characters for each mode in the holographic images were identified in experiments. Overall, the experimental results demonstrate that our fabricated metasurface successfully achieved the recognition and visualization of two spatial modes through the patterns of the holographic images.

    Quality evaluation of the target images. (a) Pearson correlation coefficients between the simulated and desired target images. (b) Pearson correlation coefficients between the experimental and desired target images. (c) Energy ratio between the target image of the incident mode and that of the other mode in the simulated holographic image. (d) Energy ratio between the target image of the incident mode and that of the other mode in the experimental holographic image.

    Figure 7.Quality evaluation of the target images. (a) Pearson correlation coefficients between the simulated and desired target images. (b) Pearson correlation coefficients between the experimental and desired target images. (c) Energy ratio between the target image of the incident mode and that of the other mode in the simulated holographic image. (d) Energy ratio between the target image of the incident mode and that of the other mode in the experimental holographic image.

    4. Conclusion

    In summary, we theoretically designed and experimentally demonstrated an intelligent mode-visualizing metasurface that recognizes fiber modes via holographic imaging. This single-layer, single-cell approach integrates metasurface-based phase modulation with neural network optimization, ensuring image projection across multiple information channels. By combining the ultra-compact structure and concise design of metasurfaces with the high-speed computation and scalability of neural networks, our framework enhances compatibility, flexibility, and expandability for mode management. These attributes make it highly promising for object classification, information encoding and decoding, and encryption and decryption.

    Appendix A: Data Collection

    Relying on the theoretical dimension parameters of the fiber, we generate reference field distributions for the LP01 and LP11 modes. The mode fields from the fiber are captured by an infrared camera positioned 2 cm from the collimating lens, and the actual field distributions are extracted from the images with an area of 860μm×860μm. Subsequently, the values of fiber parameters are scanned to fit the reference field distributions to actual field distributions. We determine the standard LP01 and LP11 mode field distributions, respectively, exhibiting correlation coefficients of 0.9851 and 0.9737 with the actual mode field distributions.

    We employ position parameters to generate transformed mode fields, adjustable within experimentally controllable ranges. For position translations along the x and y axes, considering the different distributions of the two-mode standard field distributions, we, respectively, set the translation scale of the LP01 and LP11 modes in the positive and negative directions to be 8.6 and 10.75 μm, with a resolution of 1.72 and 2.15 μm. Addressing the angle rotation along the central axis of the mode fields, we establish a rotation scale of 2 deg with both clockwise and counterclockwise directions, with a resolution of 1 deg. For area magnifications around the central point of the mode fields, we set the reduction scale as 0.9 with a resolution of 0.01 and the enlargement scale as 1.1 with a resolution of 0.01 for the two modes.

    Appendix B: Training Configuration

    Our neural network architecture is implemented using Python (v3.8.13) and PyTorch (v1.12.0) on an NVIDIA GeForce RTX 3090 GPU and an Intel (R) Core (TM) i9-10900X CPU at 3.70 GHz with 128 GB of RAM, running on the Ubuntu 20.04 operating system. The optimization employs the following parameters: decay rates β1=0.9 and β2=0.999, learning rate η=0.001. The entire training process took approximately 2 h.

    Appendix C: Sample Fabrication

    Our designed metasurface featured a single-layer construction, containing all-dielectric nanostructures made from monocrystalline silicon. First, a PMMA photoresist with a concentration of 1:1 was spin-coated onto the sample, which underwent an exposure process through an electron beam exposure machine (eLINE Plus) based on the designed construction of the nanobricks. Following this, a 25-nm-thick film of Cr was thermally evaporated after the development and fixation process. The Cr in the unexposed area was removed through a lift-off process, forming the Cr film consistent with the designed construction of nanobricks on the surface. Finally, etching was conducted using an inductively coupled plasma etcher (PlasmaPro 100Cobra 300) with an etching depth of 1000 nm. A corrosive was employed to eliminate the remaining Cr, obtaining the final metasurface.

    Appendix D: Scalability Demonstration

    To explore the scalability of metasurface-enabled mode recognition, we have presented the design of a dual-dimensional multiplexed metasurface across both polarization and mode domains. Based on the principle of the transmission phase, the metasurface composed of nanostructures with varying dimensions enables independent phase modulation for x- and y-polarized mode fields, allowing for the recognition of modes under both polarization states. The optimized phase of the dual-mode multiplexed metasurface is assigned as the phase profile in the x-polarization direction of the dual-dimensional metasurface. Using the same neural network architecture and mode datasets, the y-polarization phase profile is obtained by defining target holographic images with the characters “LP01” and “LP11” positioned in the lower-left and lower-right regions, respectively, for the LP01 and LP11 modes.

    The simulation results under x-polarized modes are shown in Fig. 4, while those for y-polarized modes are provided in Fig. 8. The figures illustrate the input mode fields, simulated output holographic images, simulated target images (highlighted with white boxes), and designed target images. The target characters “LP01” and “LP11” are clearly and accurately reconstructed in their designated regions with high fidelity. These results demonstrate that the designed dual-dimensional multiplexed metasurface achieves mode recognition across both polarization and mode domains, thereby enhancing information capacity and transmission density.

    Numerical simulation results of mode-visualizing metasurface under y-polarized modes. The first and second rows show the input modes, simulated outputs, simulated targets, and designed targets for the LP01 and LP11 modes, respectively. The white boxes highlight the target images.

    Figure 8.Numerical simulation results of mode-visualizing metasurface under y-polarized modes. The first and second rows show the input modes, simulated outputs, simulated targets, and designed targets for the LP01 and LP11 modes, respectively. The white boxes highlight the target images.

    [4] G. Rademacher, B. J. Puttnam, R. S. Luís et al. 10.66 peta-bit/s transmission over a 38-core-three-mode fiber. Optical Fiber Communication Conference (OFC) 2020, Th3H.1(2020).

    Tools

    Get Citation

    Copy Citation Text

    Yu Zhao, Huijiao Wang, Tian Huang, Zile Li, Lei Yu, Guoxing Zheng, "Metasurface-enabled intelligent recognition and visualization for fiber modes," Chin. Opt. Lett. 23, 113601 (2025)

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Nanophotonics, Metamaterials, and Plasmonics

    Received: Apr. 22, 2025

    Accepted: Jun. 19, 2025

    Posted: Jun. 19, 2025

    Published Online: Sep. 23, 2025

    The Author Email: Zile Li (lizile@whu.edu.cn), Lei Yu (ly.wd@whu.edu.cn)

    DOI:10.3788/COL202523.113601

    CSTR:32184.14.COL202523.113601

    Topics