High-resolution dual-polarization single-pixel imaging through dynamic and complex scattering media using random-frequency-encoded time sequences

Zian Wang; Tianshun Zhang; Yin Xiao; Zhigang Liu; Wen Chen

doi:10.1364/PRJ.569507

1. INTRODUCTION

Enhancing the imaging quality in scattering environments remains a challenge in optics [1 –8]. The inherent inhomogeneity of scattering media induces multiple refractions, distorting the wavefront [9]. This phenomenon fundamentally limits optical imaging, as real-world scenes could exhibit complex and dynamic scattering behaviors. To address wavefront restoration, numerous approaches have been reported, e.g., memory effect [10 –12], ballistic light imaging [13 –16], matrix transfer inversion [17 –19], and optical phase conjugation [20 –23]. Among these, single-pixel imaging (SPI) has emerged to be promising for solving scattering problems [24 –27], primarily due to its reliance on light intensity fluctuations for object reconstruction. Previous studies have demonstrated that SPI achieves high-quality reconstruction, when static scattering media are placed in the optical channel. Although SPI performs well in scattering media, it still faces a challenge when applied in complex environments, i.e., simultaneous distortions of illumination and detection paths.

When dynamic scattering media exist between the light source and an object in the illumination path, the scattering degrades illumination patterns, causing them to deviate from the pre-designed ones [28]. Furthermore, dynamic scattering media would induce a series of dynamic scaling factors that disrupt beam correlation [29] and lead to a failure of SPI. Therefore, SPI in dynamic scattering media remains a challenge, requiring robust reconstructions while preserving computational efficiency. While deep learning [30 –32] and iterative algorithms [33,34] have yielded promising results, they could depend on prior knowledge about scattering media and suffer from high computational cost. Correlation imaging [28,35] can enable object reconstruction in static scattering environments. However, its feasibility in dynamic scattering environments remains unexplored, and a further improvement of imaging quality is required. The strategies via a design of illumination patterns (e.g., Hadamard [36], Fourier coefficients [37]) have also been developed and applied. However, in dynamic scattering environments where illumination and detection paths are distorted, previous single-pixel detection techniques relying on the light intensity collection would fail. Other approaches, such as rotating ground glass with CCD-based pattern recording [38], can address pattern degradation problems but still face the limitations in real-world scenarios. It is well recognized that when the illumination and detection paths are severely distorted, high-resolution SPI remains not to be explored.

In this paper, we report high-resolution common-path SPI with dual polarizations that integrates an optical design with random-frequency-encoded time sequences to overcome the challenge in dynamic and complex scattering environments. The method employs a common-path optical configuration with dual polarizations (s- and p-polarized light beams), leading to the path consistency for correcting a series of dynamic scaling factors induced by complex media existing in the illumination and detection paths. To solve the pattern degradation problem caused by dynamic scattering media in the illumination path, a series of random-frequency-encoded time sequences are designed via assigning each pixel along the time axis a random frequency. Via the design of a common-path dual-polarization setup, the effect of the scattering media can be suppressed. During object reconstruction, fast Fourier transform (FFT) can be performed on the corrected single-pixel light intensities. Each pixel value in a reconstructed object image is obtained with a generated amplitude spectrum identified via the pixel position pre-defined with a specific encoding frequency. Scattering-induced noise can be dispersed into artifacts that are removed simply using a mean filter. The proposed method does not require prior knowledge about scattering media, and complex optical components are also not needed. Experimental results demonstrate its superiority over existing methods in achieving high-resolution optical imaging under complex conditions where illumination and detection paths are severely distorted at the same time.

2. PRINCIPLE

Figure 1 shows the principle of the developed SPI system based on a random-frequency encoding scheme with dual polarizations and a common path. A series of time sequences with values in a range of 0–1 is generated, and each time sequence is assigned a random frequency as shown in Fig. 1(a). A generated time sequence is used for a specific pixel along the time axis, and at an instant $t$ we have $β_{x y} = 0.5 \sin (2 π f_{x y} \frac{t}{f_{s}}) + 0.5,$ (1)where $x y$ denotes a 2D coordinate of the pixels, $f_{x y}$ denotes a frequency randomly assigned for a time sequence (i.e., a pixel along the time axis), and $f_{s}$ denotes a sampling ratio. To avoid the aliasing, a relationship of $f_{s} ⩾ 2 f_{\max}$ is adopted to satisfy the Nyquist sampling theorem, where $f_{\max}$ denotes the maximum value of $f_{x y}$ . Here, parameter $f_{x y}$ is randomly set in a range of 1.0–4096.0 Hz with an interval of 1.0 Hz. The frequency interval could be enlarged for easily decoding the frequencies during the reconstruction. As shown in Fig. 1(b), after all time sequences are generated, a series of 2D random patterns can be correspondingly obtained and each pixel along the time axis has a unique frequency identity. The light beam is split by PBS1 into s- and p-polarized light beams with mutually perpendicular polarization states. The s-polarized light beam is sequentially modulated by a series of 2D random patterns embedded into a spatial light modulator (SLM), and the p-polarized light beam is reflected by a mirror. Then, the light beams pass through complex scattering media in a common path where the illumination and detection are simultaneously distorted.

Figure 1.A flow chart of the developed dual-polarization common-path SPI with random-frequency-encoded time sequences in dynamic and complex scattering media where illumination and detection paths are severely distorted: (a) typical time sequences generated for each pixel along the time axis having random frequencies, (b) a designed common-path SPI scheme with dual polarizations (PBS, polarization beam splitter; DSM, dynamic scattering media), and (c) the object reconstruction process.

Download full size

View all figures

After wave propagation through dynamic scattering media existing in the illumination and detection paths, PBS2 is used to separate the s- and p-polarized light beams and the intensity of s-polarized light recorded by a single-pixel bucket detector can be described by $I_{s} = k_{t} \sum_{x} \sum_{y} [(α I_{1} β_{x y} + I_{c - x y}) O_{x y}],$ (2)where $α$ denotes the light transmittance in the illumination path through scattering media, $k_{t}$ denotes dynamic scaling factors induced by complex media at an instant $t$ , $O$ denotes the transmission matrix of an object, and $I_{1}$ and $I_{c - x y}$ denote the s-polarized light intensity incident to SLM and scattered light intensity onto an object, respectively. The complex media in the illumination path cause the degradation of illumination patterns. Then, distorted optical waves propagate through the object and another complex scattering medium. The illumination and detection paths are severely distorted at the same time, leading to a failure of conventional SPI methods. The p-polarized light beam is reflected by a mirror, and is collected by another single-pixel bucket detector described by $I_{p} = k_{t} \sum_{x} \sum_{y} [(α I_{2} + I_{c - x y}) O_{x y}],$ (3)where $I_{2}$ denotes intensity of the p-polarized light beam reflected by the mirror. At an instant $t$ , the collected single-pixel light intensities $I_{s}$ and $I_{p}$ are obtained after the same degradations with the same scaling factors, since a common path is applied. Therefore, parameters $k_{t}$ , $α$ , and $I_{c - x y}$ can be assumed as the same for light intensities $I_{s}$ and $I_{p}$ at an instant $t$ . The series of dynamic scaling factors $k_{t}$ induced by scattering media can be removed by $I_{r} = \frac{I_{s}}{I_{p}} = \frac{\sum_{x} \sum_{y} [(α I_{1} β_{x y} + I_{c - x y}) O_{x y}]}{\sum_{x} \sum_{y} [(α I_{2} + I_{c - x y}) O_{x y}]},$ (4)where $I_{r}$ denotes a corrected light intensity.

After an FFT operation is performed on Eq. (4), we can have (see details in Appendix A) $FFT (I_{r}) \propto \sum \sum δ (ξ - f_{x y} / f_{s}) O + \sum \sum WO,$ (5)where $W$ denotes noise with a Gaussian distribution. It is indicated in Eq. (5) that $I_{c - x y}$ serves as Gaussian-distributed noise. Then, each amplitude spectrum $| FFT (I_{r}) |$ is directly assigned to a corresponding pixel to obtain a reconstructed 2D object image according to the pixel position pre-defined with the specific encoding frequency. Finally, noise in the reconstructed 2D object image can be simply removed via a mean filter. The object reconstruction process is shown in Fig. 1(c). In practice, a spectrum optimization algorithm may be further used to enhance frequency identification accuracy. Here, the FFT operation is directly employed to illustrate the effectiveness of the proposed method.

3. EXPERIMENTAL RESULTS AND DISCUSSION

A. Proof-of-Principle Experiment

To validate the proposed method, a complex scene is designed and employed in optical experiments. As shown in Fig. 2, an amplitude-only SLM with pixel size of 4.5 μm is sequentially loaded with 16,384 2D random patterns, i.e., $f_{s} = 4 f_{\max}$ as 16,384 in Eq. (1). A diode-pumped green laser (CrystaLaser, CL532-025-S) is expanded by an objective lens (40×) to illuminate SLM, and then the loaded patterns can be projected onto an object placed between two established dynamic scattering environments. Here, original patterns with $64 \times 64$ pixels are linearly interpolated to be $512 \times 512$ pixels to satisfy experimental requirements. The dynamic scattering medium placed in the illumination path is established by using a ground glass diffuser (Thorlabs, DG10-1500) being kept rotating. The dynamic scattering medium placed in the detection path is established by using 3.0-mL skimmed milk (diluted with 200.0-ml clean water) to be continuously dripped into a water tank [dimensions of $10.0 cm (length) \times 15.0 cm (width) \times 30.0 cm$ (height)] initially filled with 3000-mL clean water in each experiment, and a stirrer with a speed of 400 r/min is used to generate a dynamic environment inside the water tank. Here, the Beer-Lambert law [39] is used to quantitatively analyze the effect of the water tank placed in the detection path, and the Beer’s coefficient gradually increases to $8.9 \times 10^{- 3} {mm}^{- 1}$ in experiments. At the detection plane, light intensities in the s- and p-polarized light beam paths are simultaneously recorded by using two single-pixel detectors (Thorlabs, PDA100A2).

Figure 2.Schematic of an experimental setup to validate the proposed method. Two lenses with the same focal length of 150.0 mm are used as a 4f system between the SLM and object, and are omitted for sake of brevity. QWP: quarter wave plate; BD: single-pixel bucket detector.

Download full size

View all figures

An object, i.e., Group 2 Elements 2 and 3 of USAF 1951 resolution target, is first tested. Figure 3(a) shows a reconstructed object image using the proposed method for a comparison, when no scattering media are used in Fig. 2. Figure 3(b) shows a reconstructed object image obtained by using the proposed method in dynamic and complex scattering media as shown in Fig. 2, and the reconstructed object image after the further use of a mean filter is shown in Fig. 3(c). Contrast-to-noise ratio (CNR) [40] is calculated to quantify the imaging quality with a signal area (red box) and a background area (green box) indicated in Fig. 3(a). To ensure the consistency and rationality, the same signal and background areas are applied to evaluate experimental results. The CNR value in Fig. 3(a) is 21.37, and CNR values in Figs. 3(b) and 3(c) are 10.34 and 17.28, respectively. It is demonstrated that the proposed method can be used to recover high-quality object images, and is robust against dynamic complex scattering media where the illumination and detection paths are severely distorted at the same time. It is also verified in experiments that noise appears in the form of pixel-level artifacts as shown in Fig. 3(b), and the reconstruction quality can be enhanced via a simple use of the mean filter.

Figure 3.Experimental results: (a) a reconstructed object image obtained by using the proposed method without any scattering media placed in the optical setup in Fig. 2, (b) a reconstructed object image obtained by using the proposed method through dynamic scattering media composed of the rotating diffuser and the water tank, and (c) a reconstructed object image after the further use of a mean filter to (b).

Download full size

View all figures

B. Different Sampling Ratios

To evaluate performance of the proposed method at different sampling ratios, experimental results are obtained and shown in Figs. 3(c) and 4(a)–4(c). The CNR values in Figs. 3(c) and 4(a)–4(c) are 17.28, 3.05, 9.09, and 11.38, when the sampling ratio $f_{s}$ in Eq. (1) is set as $4 f_{\max}$ , $f_{\max}$ , $2 f_{\max}$ , and $3 f_{\max}$ , respectively. It can be found in experiments that as the sampling rate decreases, quality of the reconstructed object images deteriorates using the proposed method. This is attributed to a reduction in frequency resolution used in the designed random-frequency encoding scheme. As given in Eq. (1), $f_{s}$ is determined by the maximum value of $f_{x y}$ . When $f_{s}$ falls below $2 f_{\max}$ , the Nyquist sampling theorem is violated. This violation leads to frequency aliasing, resulting in a loss of high-frequency information, which ultimately causes a failure of object reconstruction. Therefore, it is essential to consider parameter $f_{s}$ to make sure that the sampling theorem is satisfied.

Figure 4.Experimental results: (a)–(c) the reconstructed object images obtained by using the proposed method respectively at a sampling ratio of $f_{\max}$ , $2 f_{\max}$ , and $3 f_{\max}$ .

Download full size

View all figures

C. Different Encoding Schemes

It is further illustrated why the random-frequency encoding scheme is adopted in the proposed method rather than the usage of a sequential-frequency encoding scheme in the proposed method, as shown in Fig. 5. In the sequential-frequency encoding scheme, time sequences for each pixel along the time axis are assigned the frequencies from 1.0 to 4096.0 Hz with an interval of 1.0 Hz in sequence. In experiments, the parameters are the same as those used in Figs. 2 and 3. The CNR values of the reconstructed object images in Figs. 5(b) and 5(d) are 17.28 and 14.40, when the random-frequency encoding and sequential-frequency encoding schemes are adopted in the proposed method, respectively. It is experimentally demonstrated that when the random-frequency encoding scheme is applied in the proposed SPI system, a higher-quality object image can be recovered. In addition, it can be observed that there is much noise at the bottom in Fig. 5(d). This is attributed to the diffraction effect induced by the usage of a sequential-frequency encoding scheme where the frequencies assigned for each pixel along the time axis are in sequence. As shown in Fig. 5(c), the sequential-frequency encoding scheme generates a series of patterns featuring varying numbers of fringes also with continuously varying directions. This implies that the sequential-frequency-encoded time sequences lead to the generation of diffraction orders with dynamically changing direction and spacing.

Figure 5.Experimental results obtained by using random-frequency encoding and sequential-frequency encoding schemes in the proposed method: (a) a schematic of 2D patterns generated by using the random-frequency encoding scheme in the proposed method and (b) a reconstructed object image, and (c) a schematic of 2D patterns generated by using the sequential-frequency encoding scheme in the proposed method and (d) a reconstructed object image.

Download full size

View all figures

When the 2D patterns generated by using the sequential-frequency encoding scheme are used for the modulation in Fig. 2, the designed common path with dual polarizations could be disrupted. The sizes of illumination patterns in the s- and p-polarized light beam paths would not be the same, and the same dynamic scaling factors cannot be created at each instant. In addition, the devices used cannot cover all diffraction orders, causing a periodic modulation. In complex media where the illumination and detection paths are severely distorted, effective light intensities collected by single-pixel detectors undergo significant attenuations. Therefore, noise generated by the undetected higher-order diffraction components leads to a creation of relatively higher amplitudes in the Fourier domain. The low-frequency noise is induced, leading to those at the bottom of the reconstructed object image in Fig. 5(d). The random-frequency encoding scheme is designed and employed in the proposed method to minimize the diffraction effect due to the absence of fringe patterns, and the illumination patterns in the s- and p-polarized light beam paths can have the same size to enable an implementation of the designed common path with dual polarizations.

D. Performance

The existing SPI methods are further employed for comparisons, as shown in Fig. 6. The reconstructed object images obtained by using differential ghost imaging (DGI, a sampling ratio of 100%) [41,42] and Fourier SPI (FSI, a sampling ratio of 100%) [24] are shown in Figs. 6(a) and 6(b) with CNR values of 0.48 and 2.52, respectively. It is experimentally demonstrated that DGI is totally incapable of recovering any effective object information, and the reconstruction result obtained by using FSI also fails to clearly render object information. This is attributed to the effects from simultaneous distortions of the illumination and detection paths. As shown in Fig. 6(c), when spatial-temporal encoded patterns (STEPs) [37] are used, the method is highly susceptible to the disturbances of dynamic scattering environments with very low contrast in the reconstructed object image. To visually render the details, 128 pixels at the bottom of Fig. 6(c) are further removed, and a reconstructed object image is obtained as shown in Fig. 6(d), which is contaminated by background noise. The CNR value in Fig. 6(d) is 10.23. Here, the proposed method can always be applied to reconstruct high-quality object images, such as with the CNR value of 17.28 in Fig. 3(c). It is experimentally demonstrated that the proposed method is feasible and effective in complex scenarios. Figures 7(a) and 7(b) show the profiles along a line (indicated in insets) of the reconstructed object images. Three peaks in Group 2 Element 2 or 3 can be clearly observed using the proposed method. High resolution, i.e., a line width of 99.21 μm, is achieved by using the proposed method.

Figure 6.The reconstructed object images obtained in experiments using (a) the DGI, (b) the FSI, (c) the STEP, and (d) a further removal of 128 pixels at the bottom of (c).

Download full size

View all figures

Figure 7.The profiles along the line (indicated in insets) of the reconstructed object images with a target of (a) Group 2 Element 2 and (b) Group 2 Element 3.

Download full size

View all figures

Other targets are further tested using the proposed method with the optical setup in Fig. 2, and experimental results are shown in Figs. 8(a)–8(d). It is demonstrated again that the proposed method is feasible and effective in complex environments where illumination and detection paths are severely distorted at the same time.

Figure 8.Experimental results: (a)–(d) the reconstructed object images obtained by using the proposed method.

Download full size

View all figures

4. CONCLUSION

We have reported high-resolution common-path SPI using dual polarizations and the random-frequency encoding scheme in complex scenarios. The proposed method leverages a dual-polarization common-path configuration to enable an accurate correction of dynamic scaling factors, and the random-frequency encoding scheme is designed and applied to encode information and distribute noise into pixel-level artifacts. Experimental validation shows high performance of the proposed method in complex scenes, and high resolution, i.e., 198.42 μm, is achieved with a high CNR of 17.28. The proposed method does not require complex optical components or prior knowledge about complex scattering media, and can pave the way for high-resolution optical imaging in complex scenarios where illumination and detection paths are severely distorted at the same time.

Special Issue:

Received: Jun. 3, 2025

Accepted: Jul. 23, 2025

Published Online: Sep. 22, 2025

The Author Email: Wen Chen (owen.chen@polyu.edu.hk)

DOI:10.1364/PRJ.569507

CSTR:32188.14.PRJ.569507