The fast algorithms in Fourier optics have invigorated multifunctional device design and advanced imaging technologies. However, the necessity for fast computations limits the widely used conventional Fourier methods, where the image plane has a fixed size at certain diffraction distances. These limitations pose challenges in intricate scaling transformations, 3D reconstructions, and full-color displays. Currently, the lack of effective solutions makes people often resort to pre-processing that compromises fidelity. In this paper, leveraging a higher-dimensional phase space method, a universal framework is proposed for customized diffraction calculation methods. Within this framework, a variable-scale diffraction computation model is established for adjusting the size of the image plane and can be operated by fast algorithms. The model’s robust variable-scale capabilities and its aberration automatic correction capability are validated for full-color holography, and high fidelity is achieved. The tomography experiments demonstrate that this model provides a superior solution for holographic 3D reconstruction. In addition, this model is applied to achieve full-color metasurface holography with near-zero crosstalk, showcasing its versatile applicability at nanoscale. Our model presents significant prospects for applications in the optics community, such as beam shaping, computer-generated holograms (CGHs), augmented reality (AR), metasurface optical elements (MOEs), and advanced holographic head-up display (HUD) systems.
【AIGC One Sentence Reading】:The phase space framework introduces a flexible diffraction model for imaging and display, enabling variable-scale computations, aberration correction, and high-fidelity holography, with broad applications in optics.
【AIGC Short Abstract】:The proposed phase space framework introduces a variable-scale diffraction model, enabling flexible imaging plane adjustments. Validated for full-color holography, it achieves high fidelity and automatic aberration correction, demonstrating superior performance in holographic 3D reconstruction and nanoscale metasurface holography. This model offers significant potential for optics applications, including beam shaping and advanced holographic displays.
Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.
1. INTRODUCTION
Traditional projection and displays are no longer sufficient to meet the increasingly diverse demands, such as holographic head-up display (HUD) [1] and augmented reality (AR) [2]. Advanced imaging and display technologies, represented by Fourier optics with diffractive optics [3] as a prime example, provide an alternative approach. For instance, the advent of the computer-generated hologram (CGH) has provided various manifestations for holography, including three-dimensional (3D) reconstruction [4–8], high-resolution capabilities [5,7,9,10], and implementations in augmented reality (AR) and virtual reality (VR) [7,11,12]. In recent years, driven by increasingly mature design and process capabilities, higher precision and flexibility in optimizing and fabricating devices have been gained. This has unlocked boundless possibilities in manufacturing, imaging, and display, such as the utilization of freeform surfaces [13–15], diffractive optical elements [13,16,17], optical waveguides [2,11], and liquid crystal [18,19]. In particular, the advent of metasurfaces has opened versatile modalities for coherent imaging [20–23]. Beyond the ultra-thin merits inherent to traditional refractive devices, metasurfaces offer a substantial advantage in polarization control [24]. Metasurface holography, with its high information density in a planar format, holds tremendous potential for cutting-edge displays [23–28], paving the way for the realization of high-fidelity holography.
Despite continuous advancements in the design and fabrication methods of optical elements, without exception, these designs involving scalar diffractive optics invariably employ conventional Fourier methods. The angular spectrum method (ASM), single Fresnel transform (SFT), and their variants represent the prevailing ones. Despite their widespread utilization and computational efficiency with fast Fourier transform (FFT), they suffer inherent limitations. Fixed sampling intervals, dictated by the conservation of the space-bandwidth product (SBP), restrict their universality. ASM requires equal-sized object and image planes, while SFT correlates image plane size linearly with propagation distance under the operation of optical Fourier transformation [3,29,30]. In a considerable proportion of cases, ASM based or SFT based diffraction iteration processes may result in insufficient casting or resource waste. Challenges arise when intricate scale transformation operations are required such as AR and holographic HUD, or when different depths of 3D projection demand varying levels of detail. Moreover, in color imaging and display, conventional Fourier methods such as the Fraunhofer method (FM) may exhibit chromatic aberrations, necessitating complex correction and demanding additional computational resources. So, the pixel pre-scaling for different colors during the computational process will compromise fidelity. In addition, the widely acknowledged accurate Rayleigh–Sommerfeld integral (RSI) is computationally slow and thus less suitable for inverse design applications. Conventional Fourier methods offer a rapid and efficient tool for diffraction calculation, and associated sampling and calculation strategies have been continually proposed and refined [4,31–37]. But in general they did not break inherent scaling limitations and merely represented a slight extension of conventional ASM and SFT. At times, new strategies were accompanied by decreased image quality and narrowed application scope. For example, diffraction calculation algorithms can be based on non-uniform FFT [38]. However, they may not be well-suited for inverse design applications, so they are rarely used. Overall, the modulation of the light field still has a relatively low degree-of-freedom, even in relatively simple cases of coherent imaging and display, such as CGH imaging and automatic HUD. The fundamental reason for this lies in the fact that these methods are entrapped by the two local solutions of the Helmholtz equation in wave optics. New solutions are difficult to achieve through fast Fourier algorithms.
Finding concise solutions to address these longstanding issues directly from the Helmholtz equation may be a futile endeavor at present. Therefore, we seek breakthroughs from a higher-dimensional perspective to derive new diffraction computation methods suitable for fast Fourier algorithms. Phase space analysis, initially introduced in quantum mechanics through the behavior of the Wigner distribution function (WDF) [39], has found extensive applications in describing optical systems [40,41]. It has been instrumental in understanding various phenomena, including the relationship between coherence and radiation measurement [42], the connection between ambiguity function and holography [43], and the measurement of partial coherence [44]. The completeness of the WDF for signal description fully reveals the spatial distribution of light in phase space. In higher dimensions, the WDF based phase space analysis itself constitutes a form of Fourier analysis. Phase space analysis directly analyzes the modulation of the light field from the perspective of distribution and transmission. Conventional Fourier optics can be viewed as a degenerate form of phase space analysis, and they are not mutually exclusive. In recent research [45–47], the relationship between conventional Fourier methods and WDF propagation was discussed. However, there are few reports on FFT based Fourier algorithms designed for coherent light field diffraction calculations using phase space analysis.
Sign up for Photonics Research TOC Get the latest issue of Advanced Photonics delivered right to you!Sign up now
Within the higher-dimensional phase space perspective, this paper explores a new course for coherent imaging computations. First, it explores a universal framework based on matrix decomposition that empowers the researchers to design diffraction calculation methods according to different application scenarios. Subsequently, a variable-scale model is delivered that allows the maximum spatial frequency of the image plane to be freely selected within certain constraints. This model still leverages Fourier analysis for fast computations and effectively addresses the size limitation issue inherent in conventional Fourier methods. The comparison and advantages of our model against other canon methods are illustrated in Table 1. Experimental validations, including advanced holographic and tomographic displays, demonstrate the model’s robustness in variable-scale capability and chromatic aberration correction. Additionally, our model is applied to the design of full-color, near-zero crosstalk holographic metasurfaces, showcasing their applicability at nanoscale. Therefore, the longstanding algorithmic chromatic aberration issue in full-color holography would be effectively addressed. Rooted in phase space analysis, our approach offers a new tool for coherent imaging and display, providing an effective diffraction computation scheme for the diffractive optics community.
Comparison of Our Model against the Canon Methods
Model
Origin
Fast Inverse Algorithm
Working Distance
Pre-processing
Scaling Factor
RSI
Wave optics
No
Arbitrary
Not needed
1
ASM
Wave optics
Yes
Near field
Not needed
1
SFT
Wave optics
Yes
Far field
Required
Proportion of
FM
Wave optics
Yes
Ultra-far field
Required
Proportion of
Our model
Phase space optics
Yes
Arbitrary
Not needed
Variable
2. UNIVERSAL FRAMEWORK AND THE PROPOSED MODEL
Under coherent illumination, a two-dimensional (2D) complex field distribution possesses a WDF characterized in phase space. The WDF of on the object or input plane is a four-dimensional (4D) function, encompassing information from both spatial and Fourier domains simultaneously, providing a comprehensive description of the light field signal: where * denotes the conjugation, is the spatial offset relative to in spatial domain, and is the -vector in the Fourier domain with indicating the transpose operation. The point spread function (PSF) in free space characterizes the system’s response to a point source, while the transfer function analyzed in the Fourier domain represents the response to plane waves. In phase space, since the WDF is the complete signal representation, its propagation can respond to both the point source and a single frequency, simultaneously and mathematically. It not only seamlessly combines the spatial domain and Fourier domain but also establishes a strong connection with the concept of “light rays” in geometrical optics. The corresponding ray spread function, which manifests as the response of the product of two Dirac delta functions , forms a double Wigner distribution.
In scenarios where the Hamiltonian is limited to quadratic terms at most, the analysis of wave propagation can be elegantly achieved through straightforward matrix transformations. Linear canonical transform (LCT) emerges as a robust instrument for characterizing the behavior of light during its propagation within first-order optical systems [48]. By employing the paraxial approximation, the trajectories of light rays within such optical systems can be efficiently described using a fundamental ABCD matrix, which is a specific form of the LCTs. The ray transformation matrix in phase space is applied as , and the propagation relationship between on the image plane and on the object plane is expressed as [40,49]
In addition, the WDF transport equation of coherent light in free space is where is the propagation distance, is the wavelength, and denotes the wave number. Equation (3) has the following solution:
Considering the low-band-limited characteristics of general objects and the applicability of sampling theorem, adapting and the transformation of , has the following form: where is the identity matrix. contains the propagation characteristics of to , having the ability to analyze the complete propagation pattern. For convenience, we adopt its 2D form, which can be effortlessly extended to four dimensions: is a concise Fresnel transform matrix, which is a two-order LCT. This straightforward matrix encapsulates extensive Fourier analysis features [50]. In Ref. [51], it has been proven that certain optical transformations, such as the Fourier transform, Fresnel transform, quadratic phase modulation represented by single-lens modulation, and fractional Fourier transform, all have their corresponding matrix forms in phase space. Moreover, these matrices can be decomposed into LCT matrix cascades based on the conservation of SBP for numerical tracking. The LCTs corresponding to various transformations or modulations applied to in phase space are detailed in Appendix A. In phase space analysis, to evaluate the accuracy of a diffraction calculation method in a single numerical diffraction calculation or tasks such as phase retrieval in inverse design, it is essential to provide an appropriate matrix cascade while ensuring the conservation of the SBP to correspond to the respective computational operations. Meanwhile, the LCT matrix cascades, which are employed for free-space diffraction, can be designed, and they presumably have the capability to degenerate into under specific conditions. In essence, this leads to a universal framework where we can extend an infinite number of methods to calculate light propagation, catering to diverse needs, especially in free space diffraction within first-order optical systems. That is to say, in the phase space context, researchers or engineers can design diffraction computation methods independently and provide corresponding matrix cascades which must closely align with or . Alternatively, or can be directly decomposed into a few operations like Fourier transforms.
To validate our perspective and break the size limitations introduced by ASM and SFT, a decomposition approach with variable-scale characteristics would be proposed for the calculation of coherent imaging and display. To facilitate the operations of FFT in either spatial domain or Fourier domain, can be decomposed as follows: where and is a non-zero constant. is the scaling matrix, with being the scaling factor, i.e., a magnifier. and are the Fourier transform matrix and the inverse Fourier transform matrix, respectively. The phase space diagram (PSD) is defined as the region within phase space where WDF is non-negative. In Fig. 1(a), a representative PSD is depicted, characterized by a spatial extent and a bandwidth . Although it can be approximated through methodologies such as signal superposition [46], finding its physical entity is challenging. In this context, our focus is solely on illustrating the transformation trajectory of it. Figure 1(a) illustrates the transformation process of the PSD. It clearly shows the process of clockwise (CW) rotation, counterclockwise (CCW) rotation, and magnifying and shearing of the PSD under the applications of each LCT according to Eq. (7). The same processes of ASM and SFT are easy to be delivered (see Appendix A). The decomposition in Eq. (7) transforms a simple single phase space transformation into a series of chirp modulation and Fourier transformation operations, which is efficient and computationally tractable for implementation on a computer. Therefore, in spatial domain, the diffraction process in free space can be described as where and correspond to the coordinate systems on the object plane and the image plane, respectively. is a complex constant, and denotes linear convolution. The chirp modulator is equivalent to in phase space, and similarly, is equivalent to . In addition, is the expression of in the spatial domain. The transformation operator corresponds to . Although does not result in any computational operation during numerical computation, in physical terms it directly maps the field distribution of system to system. This results in the calculated field distribution possessing variable-scale characteristics. In the case of , Eq. (8) collapses into a typical Fresnel form. Moreover, it shares similarities with Schmidt’s description [32]. In the context of a fixed object plane with fixed sampling procedures, we retain the freedom to control the spatial extent of the image plane, i.e., the tunability of spatial frequency, while conserving the SBP. We refer to this method, allowing precise governance over the spatial frequency, with the spatial frequency tunable method (SFTM). It is a tool with potential capabilities rooted in phase space analysis, applicable for solving problems in specific realms such as beam shaping, holography, metasurface design, and pixel mismatch in end-to-end optimization [52]. It provides a powerful and easily implementable computational tool for the diffractive optics community, enabling them to break free from the confines of traditional methods. Figure 1(b) demonstrates the variable-scale capability and the automatic aberration correction capability of the SFTM.
Figure 1.Modulation process of the SFTM and demonstration of variable-scale holography. (a) Schematic diagram of the transformation process of the SFTM in phase space for . A typical PSD with a spatial extent and a bandwidth is sheared in the -direction through chirp modulation and then undergoes coordinate transposition through simple Fourier transform. The PSD performs an inverse Fourier transform after shearing again and then magnifies it with a magnifier. After the last chirp modulation, the PSD becomes the Fresnel form of its original state. (b) Demonstration of full-color holography without pre-processing using SFTM based CGHs.
The relationships and constraints (CSTs) of , , and several other quantities are illustrated in Table 2 (the sampling criteria analysis for complex amplitude and intensity is detailed in Appendix B). CST 4 offers two distinct sampling approaches, sampling in the Fourier domain and sampling in the spatial domain, to accommodate various scale transformation requirements. The partitioning of the sampling region, introduced by CST 4, is referred to as the spatial frequency sampling region (SFSR) and spatial sampling region (SSR), respectively. Figure 2(a) illustrates the permissible region of allowed and values for , , and scenarios. Meanwhile, the restrictions of both ASM and SFT have been delineated, where they represent only a segment of a straight line in the space. The threshold is the boundary between ASM and SFT. It is evident that, with proper design, values for can be quite flexible, and it goes far beyond the conventional Fourier analysis represented by ASM, SFT, and their derivatives.
Sampling CSTs for the SFTM
CST
Expression
CST 1
CST 2
CST 3
CST 4
CST 5
Figure 2.(a) Allowed - space of the SFTM for , , and . The solid dots represent the data set used for the monochromatic CGH experiment. Plots (b) and (c) show the comparisons of slices of the analytic and SFTM based numerical results at and , respectively.
Figures 2(b) and 2(c) show the comparison of amplitude between the SFTM based numerical results and the analytical solutions for a square aperture illuminated by a converging spherical wave when . The focus of the spherical wave is at the same distance from the square aperture as the diffraction distance . When is 200 mm and the scaling factor is 0.8, the peak signal-to-noise ratio (PSNR) of the numerical result is 57.23 dB. When is 800 mm and is 5.0, the PSNR is 63.67 dB. Additionally, when and the aperture width is 2 mm, the PSNR of the numerical result is 42.34 dB at 100 mm () and is 47.11 dB at 800 mm (). The PSNR values are all greater than 40 dB, proving that the SFTM is quite robust.
3. DEMONSTRATION BY CGHS
We employ a reflective 2D phase-only spatial light modulator (SLM) with a pixel size of 8 μm to verify the accuracy and practicality of the SFTM. The inverse diffraction method has been employed to design -pixel CGHs. A slight perturbation from the correct calculation may result in the “butterfly effect” in inverse diffraction after interactions [53]. Although the singularity of inverse diffraction kernels can be disregarded in the design of CGHs for display under homogeneous light illumination, it is nearly impossible to accurately simulate and experiment with the correct image if the diffraction calculation model deviates from . Traditional terminology refers to this situation as non-convergence. We utilize a three-stage iterative Fourier transform algorithm (IFTA) based on adaptive constraints in the Fourier domain (the algorithm flowchart is shown in Appendix C) instead of the traditional monotonous Gerchberg–Saxton (GS) algorithm to design CGHs with a signal window (SW) of pixels. This choice is motivated partly by the high sensitivity of the GS algorithm to local optima. On the other hand, practically, the phase modulated by the SLM is quantized into 256 steps, and achieving continuous control of phase is often impractical in the manufacturing process of diffractive optical elements. Importantly, the three-stage IFTA demonstrates superior reconstruction accuracy compared to the traditional GS algorithm, exhibiting higher values of the structural similarity index measure (SSIM) and lower values of root-mean-square error (RMSE) after hundreds of iterations [54].
Figure 12 in Appendix F illustrates the experimental setup for the monochromatic display of the CGHs. Figure 3(a) shows a comparison of the images projected by CGHs designed with ASM and SFTM within a propagation distance shorter than the threshold . The actual size of the image is directly proportional to the scaling factor . When , the size of the image within the SW is 6 mm. Since was set to 1 and can be seen within the oversampled region of transfer function for ASM at , 50 mm, 60 mm, 80 mm, and 100 mm [specific parameter selections have been marked with solid points in Fig. 2(a)], the rabbits designed with ASM and SFTM exhibit almost identical effects, without apparent aliasing or twin images. Similarly, as indicated by the solid points in the SFT regime in Fig. 2(a), we compared SFT and SFTM at the propagation distances of ; see Fig. 3(b). Letting SFTM have the same linear scaling characteristics as SFT, we captured the enlarged images at , , 240 mm, 280 mm, and 320 mm, respectively. In Fig. 3(b), the remarkable congruence among the cat images, acquired through three-stage IFTA using SFT and SFTM, is evident. This substantiates that the inverse algorithm, governed by the SFTM, not only upholds for , where SFTM serves as a high-order alternative to ASM, but also showcases identical linear scaling characteristics to SFT for . Consequently, the SFTM can seamlessly substitute SFT in this regime. Additionally, holographic image projection experiments were conducted across different magnification ranges within the permissible regions of the space. As depicted in Fig. 3(c), a 0.8× chick and a 1.5× chick were captured at and , respectively. Likewise, , , and peacocks were projected at , 210 mm, and 260 mm.
Figure 3.Experimental results of SFTM algorithms under various circumstances. Comparison of the SFTM to (a) ASM and (b) SFT illustrates that within a considerable diffraction distance, the SFTM has almost the same effect within the applicable range of ASM and SFT. (c) The scaling ability of the SFTM is presented under different and . All values have been marked with orange solid dots in Fig. 2(a). (d) Implementation of the SFTM’s long-distance and extreme magnification capability by projecting a 20× shark towards a distance of 1800 mm.
In reality, most constraints of the sampling criteria originate from the Nyquist sampling theorem [3], which can be slightly relaxed in practical operations. On the other hand, allows the size of the support on the image plane extending to . By employing suitable filtering techniques, we can try to slightly break through the limitations of the allowed space for the SFTM, leading to further breakaway from the limitations of conventional Fourier methods. Similarly, in Fig. 3(c), a chick out of the allowed space is projected at , and at another location of , a peacock, just beyond the boundary of the constraints, is projected. They both yield excellent imaging results. Furthermore, the SFTM algorithm’s capability for long-distance projection is assessed by enlarging an image of a shark to 20 times its original size at , as depicted in Fig. 3(d). In conventional Fresnel or Fraunhofer models, the achievable magnification factor is typically limited to around 15.
Conventionally, the far-field projection of holographic images relies on SFT based IFTA or FM based IFTA. However, chromatic aberration exists, wherein the size of the image plane is directly proportional to the wavelength as depicted in Fig. 4(a). Holographic image distortion within a large field of view (FOV) was usually addressed through two methodologies: image pre-processing and hologram correction. Nevertheless, both are computationally intensive. They also do not fundamentally alter the intrinsic linear-scale characteristics. Importantly, fidelity will be compromised. As previously highlighted, our model is applicable for long-distance holographic image projection. By employing the SFTM based three-stage IFTA for holographic design, not only can the crosstalk be further minimized, but also the linear-scale mismatch introduced by Fresnel-FFT algorithms can be rectified. This approach eliminates the need for intricate and constrained pre-processing procedures. Moreover, the scale of the image plane is adjustable, offering substantial flexibility for the realization of full-color holographic displays. We decoupled a color windmill image based on the three primary colors and let them iterate within their respective color channels. In Fig. 4(d), at a propagation distance of , a magnification of a full-color holographic image projection is obtained with a PSNR of 21.71 dB. Its optical reconstruction result is shown in Fig. 4(e). Notably, this process was conducted without pre-processing or chromatic aberration correction. The SFT based numerical and optical reconstruction results are presented in Figs. 4(b) and 4(c), respectively. The numerical reconstruction exhibits a lower PSNR, and the quality of the optical reconstruction is inferior to the results obtained with the SFTM.
Figure 4.Comparison of SFT (Fraunhofer) and the SFTM in full-color holography. (a) Chromatic aberration comparison using the SFT (Fraunhofer) based algorithm and SFTM based algorithm under vertical illumination in diffractive optical elements. (b) and (c) present the numerical reconstruction results for the two methods, while (c) and (e) are the respective optical reconstruction results.
During the general 3D reconstruction process, different depths of a 3D object may require varying detail levels. Typically, SFT is used for reconstructing large 3D objects, obtaining 3D Fresnel holograms, as the image plane size of SFT linearly increases with the propagation distance. However, the size of an image plane at a particular depth remains fixed, while the 2D image at that depth may vary depending on the object and its position. Pre-scaling of the 2D target images at each depth is generally required for 3D reconstruction. This inevitably leads to fidelity reduction in the entire 3D scene. Recently, holograms of large objects with up to hundreds of planes were reported [4], offering an effective solution for future dynamic large-field 3D holography. However, this remarkable work is still based on the Fresnel method, requiring pre-processing of the 2D target image at each depth, which not only compromises fidelity but also consumes excessive computational resources. If our model can be utilized in tomography involving two or more image planes, significant improvements on addressing these issues can be expected.
In the previous section, the scaling capabilities and chromatic aberration correction abilities of the SFTM were validated. Therefore, we conducted experiments to evaluate the multi-plane imaging capabilities of the SFTM algorithm. Figure 5(a) depicts a schematic illustration of the SFTM based tomographic reconstruction. In this representation, a fixed-size pixel on the hologram plane can be mapped to different image planes, each with pixels of varying sizes. This emphasizes the variable-scale capability of the SFTM in tomographic reconstruction. Utilizing the tomography algorithm (see Appendix C for details), we imaged three pentagrams of different internal structures with magnification of , , and at depths , 400 mm, and 700 mm, respectively. As shown in Fig. 5(b), a comparison of three identical duck objects reveals significant variations in magnification among the distinct pentagrams. For 3D Fresnel holograms, the depth of focus (DOF) of each image plane is a crucial factor influencing 3D reconstruction [4]. It is directly affected by the relation , where is the serial number of the image plane. The requirement for low crosstalk on image planes imposes a significant constraint on the 3D reconstruction of 3D Fresnel holograms, where is an empirical parameter. To mitigate this limitation, that is, to decrease DOFs, one relatively straightforward approach, without considering additional optimization or rearrangement of Fresnel zone plate phases, is to increase the resolution of the hologram. However, it is often impractical and can lead to resource consumption and waste. We note that, under the same projection depth, the increase in magnification leads to an elongation of the DOF, as is commonly known in the case of Fresnel holograms. Figures 5(c) and 5(d) show the results of two-layer full-color tomography. The rainbow flower is positioned at with a magnification of 1.5, and the cube is at , enlarged by . It can be seen that when is appropriate and the magnification is not excessive, image crosstalk from the other plane is not significant. But in Fig. 5(b), we can see that the pentagram on the middle image plane is impacted by crosstalk from the 10× pentagram on the farthest image plane, whereas the pentagram on the nearest image plane experiences minimal interference. The settings for these magnifications of the pentagrams serve two purposes: first, to verify whether the SFTM possesses sufficient diffraction modulation capability, and second, for the convenience of capture. In numerous scenarios, achieving such extreme magnification may not be necessary. Therefore, it becomes viable to project images with varying spatial frequencies onto distant and multiple image planes. This approach can effectively overcome the constraints of 3D Fresnel holograms, paving ways for the design of high-quality, super-multi-plane 3D variable-scale holograms.
Figure 5.Implementation of variable-scale tomography. (a) Schematic of the SFTM based tomographic. The pixel sizes on three different image planes at different depths can be manipulated by SFTM algorithms as the SBP is conserved. (b) Three tomography images of , , and pentagrams at depths of , 400 mm, and 700 mm were projected, respectively. (c), (d) The results of the two-layer full-color tomography experiment. The rainbow flower locates at , and the 2.0× cube locates at .
The superiority of metasurfaces over traditional refractive and diffractive devices is evident not only in their ultra-thin features but also in their powerful capability for controlling multiple wavelengths and facilitating efficient polarization reuse. In our previous work [24], we implemented a metasurface design characterized by low crosstalk and featuring a tri-polarization-channel configuration for holographic display. Like most designs of full-color holographic metasurfaces, the previous design relied on the conventional Fraunhofer model, necessitating intricate pixel correction operations for different color channels. It will inevitably result in reduced resolution and fidelity. If the variable-scale model can be employed in the context of metasurface holography to validate its applicability at nanoscale, this problem will be effectively solved. Furthermore, researchers would have access to a more powerful tool for diffraction calculations in metasurface design, freeing themselves from the constraints of ASM and SFT.
Figure 6(a) illustrates the schematic of the near-zero crosstalk metasurface holography using the SFTM based algorithm. The letters “H,” “N,” and “U” corresponding to different colors should all have the same size. Figure 6(b) illustrates the meta-atom design of the metasurface. The nanopillars, which can produce unique phases on mutually orthogonal linear polarizations, are fabricated on a square-shaped substrate with a period of . They possess three tunable freedoms, i.e., parameters , and rotation angle , with a fixed height of . The degree-of-freedom of the Jones matrix can be dynamically controlled by the three parameters of the meta-atom [24], achieving excellent broadband transmission and conversion efficiency characteristics of the metasurface. We decouple the light of the three colors, red (), green (), and blue (), into different polarization channels, minimizing crosstalk between various polarization states. Then, on the image plane, they will be superimposed based on the principles of the three primary colors. Leveraging the broadband transmission characteristics of the meta-atom, the polarization states of incident green and red lights are set to -polarized and blue light is set to -polarized. The output red light is set to -polarized, while green and blue lights are set to -polarized. We decouple the target color image into red, green, and blue channels. Each channel component is allowed to undergo the three-stage IFTA process. The SFTM model is used for the reconstruction of the phase-only holograms (see Appendix D). The size of the metasurface is , encompassing units of meta-atoms. For the incident light of red, green, and blue, the allowed spaces are identified in Fig. 8. Significantly, when designing the metasurface, special consideration must be directed towards the allowed space for blue light incidence. This is particularly crucial since blue light exhibits the weakest diffraction capability, as constrained by the size of . Leveraging the lookup algorithm, we systematically identify and intricately arrange the appropriate meta-atoms with fixed parameters , , and . This meticulous arrangement allows precise control over the Jones matrix, ensuring compliance with the specifications of three channels, diverse polarization states, and distinct phase distributions. As a result, this systematic procedure leads to the assembly of the metasurface.
Figure 6.Full-color SFTM based metasurface holography design. (a) Schematic of full-color metasurface holography. (b) A meta-atom with three independent tunable structure parameters , . Scanning electron microscopy (SEM) images of the holographic metasurface in (c) oblique-view and (d) top-view are presented. (e) Simulation and (f) experiment results for letters “H,” “N,” and “U.”
The finite-difference time-domain (FDTD) method is employed to calculate the electromagnetic field for meta-atoms. Subsequently, the metasurface design is translated into sample through the electron beam lithography (EBL) followed by the reactive ion etching (RIE) during the sample fabrication procedure (see Appendix E for details). We designed a holographic metasurface with and . On the image plane, the same-size letters “H,” “N,” and “U” were projected, corresponding to the red, green, and blue colors, respectively. Figures 6(c) and 6(d) show the SEM images of the designed metasurface in oblique-view and top-view. As shown in Figs. 6(e) and 6(f), the simulation and experimental results closely match, with negligible crosstalk between the color channels (the characterization setup for the metasurface sample can be found in Appendix F). The fabricated metasurface samples may exhibit minor deviation from the design. These subtle differences lead to an actual wavefront passing through a metasurface not entirely the same as the designed one. Therefore, the experimental results shown in Fig. 6(f) exhibit a slight blurring phenomenon.
The observable low crosstalk superposition capability among the three primary colors highlights the effectiveness of the meta-holography system. Our demonstration underscores the applicability of the SFTM in nanoscale holography, effectively resolving the chromatic aberration issue caused by diffraction algorithms, thus paving the way for diverse applications in advanced coherent imaging and display.
6. DISCUSSION
The matrix cascade designed in our work provides powerful computational applications and extremely high-degree-of-freedom for diffraction calculations. It is evident that both the liquid crystal array in the SLM and the meta-atom in the holography metasurface provide phase modulation capabilities spanning nearly 0 to . Consequently, the theoretical bandwidth of the WDF in phase space can be broadened. Conventional IFTA is band-limited; that is why there is not a “sufficiently large” region in the allowed space. Breaking this limitation is not infeasible but may result in a reduction in image quality. If we need to consider ultra-wideband scenarios, the WDFs of certain transfer functions or PSFs may not be a simple Dirac delta function but could exhibit distorted tails on the space or frequency axis. In such cases, the highest order of the Hamiltonian might break the quadratic limitation, making the decomposition by LCTs exceedingly challenging. However, this is a separate topic and holds significant importance in the shaping and optimization of the PSF (or transfer function).
The design of the variable-scale model offers a novel paradigm for coherent imaging and display. Compared to ASM and SFT, it is not overly complex and can be implemented by FFT based algorithms, providing a powerful and efficient method for various diffraction calculations. However, SFTM is not always the optimal choice in every situation. An SFTM based single calculation requires two or three FFTs, whereas SFT only needs one. In far-field holography, if the image quality and magnification are not important but the generation time consumption is critical, the SFTM might not be the best option. In near-field holography, if the magnification is close to 1 and precise reconstruction of the diffraction field is required, ASM might be a better choice. This is because, in ultra-wideband scenarios, the form of the propagation matrix might not be .
In the design of full-color holography, each CGH corresponds to a specific color. If we want to design holographic videos, a control module for the synchronous switching of the three-color CGHs and the lasers is required. This reduces the holographic frame rate to one-third of the SLM refresh frame rate. Our future focus will be laid on the design of a single full-color hologram by SFTM. By simultaneously illuminating a single hologram with lasers of three colors, a full-color holographic image can be directly displayed, thereby maintaining the display frame rate of holographic videos. Complemented by GPU acceleration, the SFTM exhibits the potential to realize dynamic, high degree-of-freedom, and high frame-rate full-color displays in future applications.
7. CONCLUSION
In summary, from the perspective of WDF transportation characteristics, a framework was proposed under the principle of matrix decomposition. Using this framework, a Fourier analysis method named the SFTM with variable-scale diffraction calculation characteristics is derived, which offers much higher degrees of modulation freedom than traditional methods. The effectiveness of SFTM based algorithms have been demonstrated, highlighting their powerful scaling control and automatic chromatic aberration correction capability. Additionally, the SFTM is a superior method for 3D Fresnel holography compared to SFT or FM. A meta-atom design strategy is implemented that maps polarizations respectively to color channels. Near-zero crosstalk, variable image scaling, and full-color metasurface holography are achieved. This effectively resolves the longstanding issues of resolution and fidelity reduction in the metasurface community.
Finally, our variable-scale model is robust enough to provide a significant degree-of-freedom for most coherent imaging and displays applications. This makes it widely applicable in beam shaping, diffractive optical element design, CGH, 3D displays, HUDs, and meta-holography.
APPENDIX A: PHASE SPACE ANALYSIS
The WDF of is real-valued but does not possess complete non-negativity, which prohibits it from being directly interpreted as an energy density. However, it can still be integrated along the spatial frequency vector, resulting in the intensity distribution for a two-dimensional optical field:
Alternatively, integrating along the spatial vector provides spectral information:
The symbol * denotes taking conjugation, and is the Fourier transform of . Equations (A1) and (A2) can serve as a direct representation of the diffraction pattern and be utilized for extracting spectral features.
For a two-component multiplicative signal , its WDF can be expressed as the convolution of the individual WDFs along the spatial frequency vectors for each component: where represents the linear convolution. For spherical wave illumination or chirp modulated light field signals, their WDF in phase space can be simply regarded as the convolution of the original signal and Dirac delta function, which is convenient for characterizing the transformation process of the PSD.
For a two-component convolved signal , its WDF can be written as
The PSF of free space diffraction takes on a quadratic phase form. It can be concluded that the light field signal on the object plane convolved with that PSF is equivalent to the convolution of WDF of with the Dirac delta function along the -axis.
For the light field , the linear canonical transform has the following relationships with the transmission matrix in phase space. Without considering the singularity of , in case that det , or else in the limiting case of ,
Since the multiplicative pure phase factors have no effect on the WDF of , if , , it can be seen that Eq. (A5) is the Fourier transform of signal . Therefore, the “clockwise rotation” of matrix by corresponds to the Fourier transform of , serving as a specific case of fractional Fourier transform. Similarly, “counterclockwise rotation” of matrix by corresponds to the inverse Fourier transform operator. On the other hand, when considering the case of and setting , according to Eq. (A6), chirp modulation will be obtained. The corresponding chirp modulation matrix is denoted as , where is a non-zero constant. If we intend to magnify the WDF, or perform a scaling transformation, it is necessary for to be zero according to Eq. (A6). As the conservation of SBP is guaranteed, represents a simple mapping with a scaling factor , i.e., mapping the coordinate to . The mentioned LCT operators correspond to rotation, shearing, and stretch deformations for the PSD of in phase space, respectively. In the one-dimensional case, the situation becomes simpler and more intuitive.
ASM and SFT are the most widely used approaches for diffraction calculations, especially in cases involving inverse diffraction calculations. Considering the one-dimensional scenario, in phase space, ASM decomposes the transform matrix into and can also be decomposed by SFT as follows:
Figure 7 shows the transformation process of the PSD with a spatial extent and bandwidth for ASM and SFT. In addition, the PSF convolution method used for far-field calculations also has a similar process, and the difference is that the PSD needs to be convolved with a “straight line” (at least under paraxial approximation) in phase space. These different methods ultimately lead to the same result, where the PSD is horizontally sheared by .
Figure 7.Schematic diagram of the PSD transformation process for the (a) ASM, (b) SFT, and (c) PSF convolution method in phase space.
For the sake of convenience, the one-dimensional case will only be considered in the following discussion, and it can be readily extended to two dimensions. When the object support on the object plane is illuminated vertically by monochromatic plane waves, the maximum image support on the image plane is given by where is the length of the object support, is the length of the maximum image support, is the wavelength of the plane wave, and represents the propagation distance. Since the object support is band-limited, the maximum diffraction angle is determined by , where is the bandwidth of the object.
Therefore, if the output field is calculated by the SFTM, the support of the image plane should not exceed the maximum image support: that is,
Equation (B3) is referred to as Constraint 1. Furthermore, it is worth mentioning that should be satisfied in inverse design; otherwise, information will be lost in the form of evanescent waves.
When the image plane needs to be reduced in size, it cannot capture information from the entire extent of the object support if the object bandwidth is too narrow, that yields to
Equation (B4) is called Constraint 2.
In the calculation of the SFTM, conducting sampling in the spatial domain or frequency domain primarily serves to mitigate the issue of inadequate sampling of the quadratic phase factors. For the quadratic phase factor , which is directly multiplied by , the Nyquist sampling criterion requires and it leads to where is the number of sampling points on the object support. Equation (B6) is set to be Constraint 3.
For the sampling of the quadratic phase factor , the criteria provide two distinct operational approaches to align with varying conditions. Because is described in the Fourier domain, there are two fundamental methods for sampling it: direct sampling in the Fourier domain or sampling in spatial domain and then taking the Fourier transform of it. The Nyquist sampling criterion provides completely opposite (or complementary) sampling requirements, and it can be derived that where is the number of sampling points for fully describing the signal on the object plane. The two complementary inequalities in Eq. (B7) are called Constraint 4.
Up to this point, only the quadratic phase factor remains to be taken into consideration. If a certain work involves multiple diffraction and inverse diffraction calculations, with a focus on distributions of complex amplitude, then the sampling of is critical and requires stringent attention. As per the analysis presented earlier in this text, applying the Nyquist sampling criterion, it leads to
Equation (B8) is referred to as Constraint 5.
Figures 8(a)–8(c) respectively depict the allowed space at cases when , , and , where and is set to 1000 as half of . The cases intuitively demonstrate that the allowed space becomes narrower as the wavelength of the incident light shortens. Simultaneously, it is readily apparent that the scaling range of the SFTM is significantly broader than that of ASM and SFT.
Figure 8.The constraint relationship between and in cases (a) , (b) , and (c) , respectively.
When the LCTs are operated to the WDF of the signal, the SBP in phase space remains constant. However, the support along both the space-axis and frequency-axis directions will expand or contract. Therefore, sampling points of on the object plane may potentially lead to insufficient sampling. In this paper, uniform sampling is the primary method in the SFTM because FFT can meet the vast majority of computational requirements. The number of sampling points for fully describing the signal should cover the SBP as comprehensively as possible. Considering a typical signal , which has a simple rectangle PSD in phase space with bandwidth and spatial extent , its four vertices in phase space are as follows:
Applying the LCTs to in phase space, the WDF of will undergo affine transformation such as rotation and shearing. This directly leads to the number of sampling points , which is under uniform sampling, being larger than . Taking the four vertices of in phase space as an example, during the process of , is an LCT operator with an LCT matrix , and the bandwidth and the spatial extent change into where is the distances matrix, and the operator is used to perform the operation of extracting the absolute maximum element from each row and placing its absolute value in the corresponding row of a new matrix [51]. For , Then, the sampling points on the object plane may be not less than
Since SFTM does not involve fractional Fourier transforms, during the execution of the SFTM, WDF or PSD will not undergo rotations that are non-integer multiples of . In other words, rotations do not cause a change in the sampling points . Additionally, the magnifier will not impact . Judging from this, it can be seen that the quadratic phase factors are the primary determinant affecting , so effectively addressing the shearing introduced by the quadratic phase factors during the SFTM operation is crucial for determining . One of the advantages of analyzing the selection of in phase space is that, under the influence of LCTs, the deformation of WDF is intuitive, thereby avoiding intricate and stringent conditions in the space or Fourier domain.
Currently, is employed once again to analyze the issue of selecting . For , By applying Eqs. (B10)–(B12), the required number of sampling points for the first chirp modulation satisfies
Similarly, for the remaining operations of the LCTs of SFTM, Therefore, can be determined by
In general, once the bandwidth is fixed, can be determined by scaling factor and propagation distance . Alternatively, we can determine first and then assess whether and meet the experimental requirements accordingly.
The intensity distribution is the only observable quantity for the human eye. For a single diffraction case in free space, only the intensity pattern is of concern, and then whether is well-sampled becomes irrelevant. From the perspective of phase space, the WDF of intensity manifests as a correlation operation of that of the complex amplitude along the frequency axis [46], i.e., The local bandwidth of is twice that of , so the global bandwidth of intensity is twice the maximum local bandwidth of : Given the propagation distance , the value of is taken as
In other words, when , the correct sampling of the complex amplitude is sufficient to ensure the correct sampling of the intensity. When , the sampling requirements for intensity are stricter than those for the complex amplitude. Additionally, increasing the sampling frequency, which means reducing , also ensures the correct sampling of the complex amplitude. In our algorithm’s verification of CGHs, if , sampling at an interval half the size of the pixel guarantees correct sampling for both intensity and complex amplitude.
APPENDIX C: ITERATIVE ALGORITHMS
As Fig. 9 shows, the three-stage IFTA consists of three stages [54]. The first stage involves the traditional GS algorithm, obtaining initial phases constrained by local optima. In the second stage, optimization of the SSIM of the diffraction pattern is achieved by minimizing a standard function, i.e., the deviation between the actual diffraction image and the ideal image. This may involve a trade-off with diffraction efficiency, but for holographic image display, diffraction efficiency is not a critical metric as long as the diffraction effect is satisfactory. In the third stage, considering the practical manufacturing process of optical elements and the stepped nature of SLM’s phase modulation, soft quantization is introduced to enhance the practicality of the algorithm. At the beginning, random phase can be attached to the object plane or to the image plane. In order to facilitate the implementation of the tomography, we uniformly attach the random phase to the object plane in this work, and the input for each stage after the first stage is the phase distribution of the object plane after the end of the previous stage.
The first loop converges when the SSIMs of the three holographic images are greater than 0.5, and the second loop converges when the three SSIMs are greater than 0.7. In our experiments, the two loops required no more than 20 iterations. A three-layer monochrome CGH takes approximately 26 s to be generated, while CGHs for a two-layer full-color image required 60 s.
APPENDIX D: DESIGN OF THE FULL-COLOR HOLOGRAPHIC METASURFACE
We keep consistent across the three channels to ensure that the image plane sizes of the three channels remain uniform. After obtaining the phase maps of each channel, we construct the metasurface by adaptively selecting meta-atoms that fulfill all corresponding polarization conditions and phases using a lookup algorithm. As the polarization states of incident and output light are determined, we present in Figs. 11(b) and 11(c) the relationships between the phase-change characteristics and transmission properties of green light with respect to the variations in and . The parameters of the meta-atoms are highly flexible, as achieving phase variations from to while maintaining high transmission properties is evidently achievable. The phase sweep for meta-atoms is performed on a computer with a 13th Gen Intel® Core™ i9-13900K CPU and 128 GB RAM, taking around 16 h in total.
Figure 11.(a) Algorithm flowchart of full-color metasurface holography. The (b) phase shift and (c) transmittance of the green light obtained by sweep operations are shown.
APPENDIX E: FABRICATION OF THE HOLOGRAPHIC METASURFACE
First, an 800-nm-thick layer of polymethyl methacrylate (PMMA) electron beam resist was applied to the quartz substrate and baked on a hot plate at 180°C for 5 min. Subsequently, this resist layer was exposed by EBL (Raith 150two) at a beam current of 200 pA. The sample was then immersed in a mixture of methyl isobutyl ketone (MIBK) and isopropyl alcohol (IPA) () for 1 min and left in IPA for 30 s. The reverse hole shape structure of the nanopillar was determined. Then, we deposited a 300-nm-thick layer using an atomic layer deposition system to fill the holes in the resist. In addition, there was a 300-nm-thick layer retained throughout the top of the sample. Therefore, we proceeded to etch the top layer using inductively coupled plasma reactive ion etching for 90 s and then remove the residual photoresist between the structures. Ultimately the tri-polarization-channel dielectric metasurface was obtained.
APPENDIX F: EXPERIMENTAL SETUP
This study utilized the HOLOEYE Photonics AG company’s SLM with the model designation PLUTO-2.1 for conducting CGH experiments. We loaded phase-only holograms onto the SLM using MATLAB and software provided by HOLOEYE on a computer. Figure 12 illustrates the experimental setup for the display of the CGHs. S-polarized collimated light with , expanded, modulated, and shaped, is incident onto the SLM. The reflected light modulated by SLM then passes through a system and images on a CCD camera or a holographic screen after filtering.