Advanced Photonics Nexus, Volume. 4, Issue 4, 046006(2025)

Remote picometric acoustic sensing via ultrastable laser homodyne interferometry

Yoon-Soo Jang1,2,3、*, Dong Il Lee1, Jaime Flor Flores1, Wenting Wang1,4, and Chee Wei Wong1、*
Author Affiliations
  • 1University of California, Los Angeles, Fang Lu Mesoscopic Optics and Quantum Electronics Laboratory, Los Angeles, California, United States
  • 2Korea Research Institute of Standards and Science (KRISS), Division of Physical Metrology, Length and Dimensional Metrology Group, Daejeon, Republic of Korea
  • 3University of Science and Technology (UST), Department of Science of Measurement, Daejeon, Republic of Korea
  • 4Chinese Academy of Sciences, Xiongan Institute of Innovation, Communication and Integrated Photonics Laboratory, Xiong’an New Area, China
  • show less

    Acoustic detection has many applications across science and technology from medicine to imaging and communications. However, most acoustic sensors have a common limitation in that the detection must be near the acoustic source. Alternatively, laser interferometry with picometer-scale motional displacement detection can rapidly and precisely measure sound-induced minute vibrations on remote surfaces. Here, we demonstrate the feasibility of sound detection up to 100 kHz at remote sites with ≈60 m optical path length via laser homodyne interferometry. Based on our ultrastable hertz linewidth laser with 10 - 15 fractional stability, our laser interferometer achieves 0.5 pm / Hz1/2 displacement sensitivity near 10 kHz, bounded only by laser frequency noise over 10 kHz. Between 140 Hz and 15 kHz, we achieve a homodyne acoustic sensing sensitivity of subnanometer/Pascal across our conversational frequency overtones. The minimal sound pressure detectable over 60 m optical path length is ≈2 mPa, with dynamic ranges over 100 dB. With the demonstrated standoff picometric distance metrology, we successfully detected and reconstructed musical scores of normal conversational volumes with high fidelity. The acoustic detection via this precision laser interferometer could be applied to selective area sound sensing for remote acoustic metrology, optomechanical vibrational motion sensing, and ultrasensitive optical microphones at the laser frequency noise limits.

    Keywords

    1 Introduction

    With length as one of the fundamental physical quantities, ultra-precise length metrology is pervasive across diverse areas of science and technology.13 The current SI definition of the meter is based on the optical path length traveled during 1/299,792,458 of a second in vacuum, with optical metrology serving a pivotal role in ultra-precise dimensional metrology.4,5 Laser interferometry—in homodyne or heterodyne principles for displacement measurement with sub-wavelength precision—has enabled optical dimensional metrology with the advancement of its precision and measurement range.610 However, the precision of interferometric phase–based displacement or distance measurements is practically bounded by several tens of picometer level, from the precision of phase measurements.1115 Alternatively, Fabry–Perot (FP) interferometry, which tracks the cavity resonance frequency1619 or frequency domain analysis of interferometric signals,2023 enables picometer-level displacement measurement and has been demonstrated in applications involving extremely small displacement measurement. Such platforms are widely used to examine gravitational wave searches,1 dynamics on optomechanics,24 membranes,25 and nanomechanical structures.26,27 In addition to length metrology, various optical and laser sensors have contributed to applications ranging from pressure measurement,28 acoustic sensing,2932 force and acceleration measurements,3335 gyroscopes,36 strain sensing,37 and earthquake detection38 to chedetection.39 In this study, acoustic sensing can aid in applications such as voice recognition, biological-medical imaging, and ultrasonic sensing.

    Through interferometric homodyne stabilization with our hertz linewidth laser, the 60 m optical path length is stabilized within a 1σ standard deviation of 2.29 nm, with a measured displacement noise floor of 0.5  pm/Hz1/2 near 10 kHz. Examining the control and error signals in the homodyne metrology up to 100 kHz, the corresponding distance strain noise floor is observed at 1.7×1014  ε/Hz1/2 with a 1.5 kHz servo bandwidth. Subsequently, we demonstrated the remote motional vibrational sensing of a glass beamsplitter under acoustic drives, from 140 Hz to 15 kHz, and quantified the laser homodyne displacement sensitivity as 782.77  nm/V and acoustic sensing sensitivities as subnanometer/Pascal across our conversational frequency overtones. The minimal sound level and pressure are determined to be 40  dB and 2  mPa, respectively, bounded only by the laser frequency noise. Across the acoustic frequencies, the dynamic range is determined to be between 60 and 100 dB, within the laser λ/4 displacement. With the picometric noise metrology and distance stabilization thus demonstrated, we reconstructed real-time sound waveforms and analyzed their frequency spectrograms at the remote 60 m optical path length, comparing the control signal and error signal mapping, along with different acoustic overtones. Our demonstrated platform not only allows for remote acoustic sensing in targeted areas but also allows for ultrasound sensing and the adaptation of optical frequency standards toward sound metrology.

    2 Results

    2.1 Remote Picometer Displacement Measurement using Laser Homodyne Interferometry

    Figure 1(a) shows the concept of our laser homodyne interferometry–based acoustic detection. Sound propagation, when reflected by an interface structure, including a window, results in minute vibrations on the window surface. The sound amplitude and frequency information are embedded in window vibrational overtones. With precision laser interferometry, the vibration overtones are retrieved, enabling remote and rapid sensing. Figure 1(b) depicts the experimental setup for the picometric displacement measurement realization with our precision laser homodyne interferometry. A 1565.54 nm FP cavity-stabilized ultrastable laser (SLS-INT-1550-100-1, Stable Laser Systems, Munich, Germany) with 1 Hz linewidth and 1015 fractional stability at 1 s is used as the light source in this study. After a single-mode fiber splitter with a 9:1 ratio, the shorter arm (10%) is used as the reference path, whereas the longer arm (90%; signal) is used as the measurement path goes through a 40 m fiber, which corresponds to an optical path length about 60 m (30 m distance for Michelson-type interferometer) between target and sensing position. A piezoelectric (PZT) actuator, with displacement sensitivity of 2.8  μm/V considering the interferometer roundtrip beam path, controls the optical phase delay line. The laser beam is launched into free space by a collimating lens and reflected by a beamsplitter window (BSW30, Thorlabs, Newton, New Jersey, United States) with 2 in. (1 in. = 304.8 mm) diameter and 8 mm thickness. The reflected beam is collimated into a single-mode fiber and combined with a reference arm by a 2×2 optical coupler. The combined reference and signal arm is sent to a balanced photodiode (BPD) to extract the interference signal without DC offset. The reflected beam is attenuated to 25  μW to enable a homodyne signal amplitude of ±500  mV, such that the displacement sensitivity of the homodyne signal is 782.77  nm/V.

    Schematic and setup of picometric homodyne laser interferometry-based remote acoustic detection. (a) Laser interferometer detects acoustic information engraved on the picometric vibration of the window. The interferometric signal is converted into a waveform of sound in the time domain. (b) Measurement interferometer setup with an ultrastable few-hertz linewidth laser. BPD, balanced photodetection; PZT, piezoelectric transducer; TX/RX, transmitter and receiver; PI, proportional and integral servo control.

    Figure 1.Schematic and setup of picometric homodyne laser interferometry-based remote acoustic detection. (a) Laser interferometer detects acoustic information engraved on the picometric vibration of the window. The interferometric signal is converted into a waveform of sound in the time domain. (b) Measurement interferometer setup with an ultrastable few-hertz linewidth laser. BPD, balanced photodetection; PZT, piezoelectric transducer; TX/RX, transmitter and receiver; PI, proportional and integral servo control.

    The BPD homodyne signal is sent to a proportional-integral (PI) servo controller, with a small fraction of the homodyne signal sent to a fast Fourier transform (FFT) analyzer and high-speed oscilloscope for data analysis. The control signal from the PI servo controller stabilizes the homodyne signal to zero point for long-term operation and highly sensitive sensing via the PZT actuator. As the control signal range is fixed at ±10  V, the PZT actuator can compensate for a ±28  μm displacement. A speaker is installed behind the beamsplitter to generate the acoustic input and music. Although recording the acoustic frequencies, a passive DC block electrical filter with >1  Hz passband suppresses the long-term drift of optical phase delay due to refractive index change and thermal expansion on the interferometer.

    The homodyne detection–based optical path stabilization has been widely used in frequency transfer40,41 and unbalanced arm interferometry.42Figure 2(a) shows the measurements of in-loop optical path stabilization by homodyne detection and PZT actuator over the long delay line, with Fig. 2(a) as an example path stabilization over 6.5  μs at 50 kHz update rate. The Gaussian-shaped histogram has a 2.3 nm (1σ) standard deviation. To quantitatively evaluate the optical path length stability, the inset in Fig. 2(a) shows the resulting Allan deviation. For the short-time scale (20  μs to 0.1 s), the optical path length stability is determined to be 2.3 nm at 20  μs and gradually improves to 40 pm at 0.1 s, with the measurement fitted relation of 12  pm×τavg0.5, where τavg is the averaging time. For a longer averaging time of more than 0.1 s, the stability remains near 40 pm, equivalent to a time offset of 1.33×1019  s (133 zs).

    Measured displacement amplitude spectral density through optical path stabilization using the interferometric homodyne signal. (a) In-loop measurement of the optical path after interferometric homodyne stabilization, sampled over 6 μs and plotted in the time domain. Right panel: histogram of the stabilized optical path showing Gaussian distribution with 1σ standard deviation of 2.3 nm. Inset: measurement stability verification through Allan deviation. (b) The left axis is the measured displacement amplitude spectral density from the error signal (red) and control signal (blue) of the interferometric homodyne stabilization. The laser homodyne displacement noise floor is determined to be 0.5 pm/Hz1/2 over 60 m optical path length, near 10 kHz. The right axis indicates the corresponding strain, with a noise floor of 1.7×10−14 ε/Hz1/2. For comparison, free-running homodyne signal without locked optical path stabilization is overlaid in the dark gray plot. Outside the ≈1.5 kHz servo bandwidth, displacement from the error signal is dominant, whereas displacement from the control signal is dominant inside the servo bandwidth.

    Figure 2.Measured displacement amplitude spectral density through optical path stabilization using the interferometric homodyne signal. (a) In-loop measurement of the optical path after interferometric homodyne stabilization, sampled over 6  μs and plotted in the time domain. Right panel: histogram of the stabilized optical path showing Gaussian distribution with 1σ standard deviation of 2.3 nm. Inset: measurement stability verification through Allan deviation. (b) The left axis is the measured displacement amplitude spectral density from the error signal (red) and control signal (blue) of the interferometric homodyne stabilization. The laser homodyne displacement noise floor is determined to be 0.5  pm/Hz1/2 over 60 m optical path length, near 10 kHz. The right axis indicates the corresponding strain, with a noise floor of 1.7×1014  ε/Hz1/2. For comparison, free-running homodyne signal without locked optical path stabilization is overlaid in the dark gray plot. Outside the 1.5  kHz servo bandwidth, displacement from the error signal is dominant, whereas displacement from the control signal is dominant inside the servo bandwidth.

    Figure 2(b) shows the displacement amplitude spectral densities obtained from the error and control signals. After the stabilization of the optical path, the error (red line) and control signals (blue line) are plotted together. For comparison, the error signal without the stabilization is plotted in a black line, representing the background displacement noise of the target window. From Fig. 2(b), we can estimate the PZT servo bandwidth in the optical path stabilization system to be 1.5  kHz. Inside the servo bandwidth (<1.5  kHz), displacement amplitude spectral density from the control signal is dominant and its structure is similar to the amplitude spectral density from the error signal without stabilization. In other words, the displacement amplitude spectral density from the control signal inside the servo bandwidth is equivalent to the background noise of the interferometer. The error signal inside the servo bandwidth is the residual component of the optical path stabilization. Outside the servo bandwidth (>1.5  kHz), the displacement amplitude spectral density from the error signal is dominant and its structure is overlaid directly onto the amplitude spectral density of the error signal without stabilization. Above 10 kHz, the displacement amplitude spectral density is bounded by the ultrastable laser and it fluctuates based on the ultrastable laser frequency noise (further detailed in Sec. S1 and Fig. S1 in the Supplementary Material).19,4351 The lowest background noise attained is at the few pm/Hz1/2 level near the Fourier frequency of 10 kHz, based on the frequency noise of the ultrastable 1 Hz laser. As shown in Fig. 2(b), the lowest background noise is estimated to be 0.5  pm/Hz1/2, corresponding to a strain of 1.7×1014  ε/Hz1/2. By combining the control signal amplitude spectral density inside the servo bandwidth and the error signal amplitude spectral density outside the servo bandwidth, the amplitude spectral density of displacement can be fully reconstructed over a wide range of Fourier frequencies.52

    2.2 Sound Sensing via Measurement of Picometric Displacement on the Reflectance Surface

    Figure 3(a) shows the resulting frequency spectra with input acoustic signals from 140 Hz to 15 kHz by analysis of the error and control signals. The blue and red lines indicate the background noise level from control and error signals, respectively, with sensitivities below 10  pm/Hz1/2 up to 100 kHz. Control signals for sound detection are used from 140 Hz to 1 kHz and error signals are used from 2 to 15 kHz (the periodic 60 to 300 Hz peaks are from the 60 Hz harmonics of the electrical power line noise). At lower frequency ranges up to 5 kHz, the background noise is mostly limited by environmental noise as the whole system is not isolated from its surroundings. Note that diagonal dashed lines (light gray, gray, and black) indicate theoretically predicted fluctuation of optical path length power spectral density for wind speeds of 0.1, 1, and 10  m/s by the theoretical Kolmogorov’s spectrum of weak turbulence.53,54 Based on the previous study on long-distance outdoor measurement,55 the optical path variation over a 60 m distance in an outdoor environment with temperature fluctuation and air turbulence is expected to be on the order of a few micrometers, which can be sufficiently compensated by the PZT actuator used in this study. At higher frequencies over 10 kHz, the background noise is limited by the frequency noise of the ultrastable laser.

    Frequency spectra of acoustic sensing for 140 Hz to 15 kHz input signals and varied sound levels, along with mechanical displacement responses. (a) Spectral densities of laser homodyne control and the error signal in the frequency domain. The blue and red lines indicate background displacement noise measured by the control and error signals, respectively. A minimum background displacement noise is found to be ≈0.5 pm/Hz1/2 near 10 kHz. The dashed lines indicate theoretical predictions of optical path length power spectral density for wind speeds of 0.1, 1, and 10 m/s.53" target="_self" style="display: inline;">53,54" target="_self" style="display: inline;">54 (b) Intensity-dependent displacement amplitude spectral density displacement in linear scale. Sound level is first measured by a commercial sound meter with a unit of dBA. This is then converted into decibel units for direct conversion to sound pressure. For (i) 200 Hz, displacements are measured by the control signal. For (ii) 2 kHz and (iii) 15 kHz, displacements are measured by the error signal. (c) Summary displacement response of the target window. Measurement results show that from 140 Hz to 15 kHz, it is a function of sound level (dB) and sound pressure (Pa). The dashed lines indicate linear fitted lines and open circles are estimated minimum and maximum measurable ranges for each frequency.

    Figure 3.Frequency spectra of acoustic sensing for 140 Hz to 15 kHz input signals and varied sound levels, along with mechanical displacement responses. (a) Spectral densities of laser homodyne control and the error signal in the frequency domain. The blue and red lines indicate background displacement noise measured by the control and error signals, respectively. A minimum background displacement noise is found to be 0.5  pm/Hz1/2 near 10 kHz. The dashed lines indicate theoretical predictions of optical path length power spectral density for wind speeds of 0.1, 1, and 10  m/s.53,54 (b) Intensity-dependent displacement amplitude spectral density displacement in linear scale. Sound level is first measured by a commercial sound meter with a unit of dBA. This is then converted into decibel units for direct conversion to sound pressure. For (i) 200 Hz, displacements are measured by the control signal. For (ii) 2 kHz and (iii) 15 kHz, displacements are measured by the error signal. (c) Summary displacement response of the target window. Measurement results show that from 140 Hz to 15 kHz, it is a function of sound level (dB) and sound pressure (Pa). The dashed lines indicate linear fitted lines and open circles are estimated minimum and maximum measurable ranges for each frequency.

    Figure 3(b) shows the acoustic intensity–dependent displacement amplitude spectral densities for 200 Hz, 500 Hz, 2 kHz, and 15 kHz, with clearly detected acoustic signatures. The vertical y-axis is linear; we use a sound level unit of dB, which does not consider the weighting factor for human ear response to directly convert sound intensity to sound pressure with units of Pascal. We measure the sound level for all frequencies from the speaker using a sound level meter with units of dB. We use control signals for 200 and 500 Hz sound detection and error signals for 2 and 15 kHz sound detection. Within intensities from 60 to 100 dB for all frequencies, we observe linear transduction from the input acoustic intensity to the detected intensity in the optical spectrum.

    Figure 3(c) shows the summary mapping of the sound intensity (sound pressure)–dependent displacement response of the target window from 140 Hz to 15 kHz. The upper horizontal axis is the equivalent sound pressure converted from the sound intensity shown in the lower x-axis. Sound intensity is determined by the peak intensity measured in the frequency domain. Sound signals start to appear at 140 Hz due to the relatively high background noise and low output of the driving speaker below this frequency. Detected sound signals of all frequencies have a linear proportion to the input sound pressure and are proportional to the square of sound level in dB units. Across the eight frequencies shown in Fig. 3(c), the typical displacement-to-acoustic intensity sensitivity is determined to be subnanometer/Pascal. The difference in the sound signal intensity response for each (mechanical and acoustic) frequency arises from the mechanical response transfer function of the target window. In addition, we note that as the PZT used in this study can cover displacement of ±28  μm, our system detects much louder sound within the servo bandwidth frequency range. Considering the background noise level marked with the open circle, the possible sound level measurement range for all frequencies is plotted with the dashed line. For the higher frequency (2 to 15 kHz) where the error signal is used, the minimum detectable sound level is 40  dB, corresponding to a sound pressure of 2  mPa. In this range, we define a maximum measurable range as less than the ±λ/4 vibration level with a marginal safety coefficient of 2, corresponding to an 100  dB dynamic range in our measurements. For the lower frequency (140 Hz to 1 kHz) where the control signal is used, the minimum detectable sound level varies from 40 to 65 dB, equivalent from 2 to 36 mPa. In this range, the maximum measurable range for lower frequencies is estimated to be much higher than the case for higher frequencies as the optical delay line range of the PZT actuator is ±28  μm these lower frequencies, a dynamic range up to 100  dB is estimated. We further note that vibration amplitudes larger than λ/2 can drop the locking state, and hence a tighter level of locking is required to measure a large sound when the control signal is used, with a less definite pinpoint of the maximum measurable range for the lower frequencies. As a sound level of 60 dB is typical for conversations, our scheme is sufficient to detect human voices over the remote window. If our system is operated with a shorter than 60 m optical path, the detectable sound level would be lower than 60 dB as the background noise is decreased.

    With the hertz-level laser metrology in place, subsequently, we recorded and reconstructed several music pieces in the laboratory environment using a high-speed 16-bit oscilloscope to show the feasibility of remote and covert sound detection. Figure 4(a) shows the real-time sound detection of “UCLA fight song” and its spectrogram, with Fig. 4(a) as the original sound waveform and Fig. 4(b) as its spectrogram over 10 s. Figures 4(b) and 4(c) show the recorded and reconstructed waveforms from the control (blue) and error (pink) signals, respectively. Slow-varying drift of the control signal is suppressed by a radio frequency (RF) high pass filter (EF599, Thorlabs, Newton, New Jersey, United States) with a 400 Hz cutoff frequency. Even though the error signal is locked to the zero point, the unsuppressed components have sound signal information. As the control signal has more information below 1.5 kHz where most human voice is distributed, the control signal-based sound information is clearer than the error signal-based information. However, the error signal-based sound signal includes higher frequency overtones than the control signal-based one, from the impulse response measurements noted in Sec. S2 and Figs. S2 and S3 in the Supplementary Material. As shown in Multimedias 1 and 2, respectively, the reconstructed lyrics of “UCLA fight song” are clearly audible for both control and error signal-based music records. In the frequency domain, error signal-based sound signal shows a relatively clearer sound signal over the locking bandwidth, as described in the previous section. A comparison of our millimeter-thickness beamsplitter with a few micrometer-thickness pellicle beam splitter is also noted in Sec. S3 and Figs. S4 and S5 in the Supplementary Material. Sound signal higher than 5 kHz is attenuated as it is above the mechanical transfer function of the target window but it is still sufficient to receive and distinguish male and female voices remotely, as illustrated in Multimedias 3, 4, 5, and 6 (female vocals: Shallow1, control signal; Shallow1, error signal; Shallow2, control signal; Shallow2, error signal), and Multimedias 7 and 8 (male vocals: Hotel California, control signal; Hotel California error signal).

    Real-time sound reconstruction of “UCLA fight song” and corresponding spectrograms. (a)–(c) Time-domain waveforms of the original sound (gray), control signal-based reconstruction measurement (blue) (Multimedia 1, WAV, 1.91 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s1]), and error signal-based reconstruction measurement (pink) (Multimedia 2, WAV, 1.91 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s2]). (d)–(f) Spectrograms corresponding to panels (a)–(c), respectively. The control signal-based reconstruction has a larger signal and higher signal-to-noise ratio than the error signal-based reconstruction. By contrast, the error signal-based reconstruction has higher frequency components.

    Figure 4.Real-time sound reconstruction of “UCLA fight song” and corresponding spectrograms. (a)–(c) Time-domain waveforms of the original sound (gray), control signal-based reconstruction measurement (blue) (Multimedia 1, WAV, 1.91 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s1]), and error signal-based reconstruction measurement (pink) (Multimedia 2, WAV, 1.91 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s2]). (d)–(f) Spectrograms corresponding to panels (a)–(c), respectively. The control signal-based reconstruction has a larger signal and higher signal-to-noise ratio than the error signal-based reconstruction. By contrast, the error signal-based reconstruction has higher frequency components.

    Figure 5 shows further examples of the real-time music recording waveform reconstructions and their corresponding spectrograms. Left panels [(a),(c),(e)] are control signal-based results and right panels [(b),(d),(f)] are error signal-based results. Figures 5(a)5(d) are songs from a female singer, and Figs. 5(e)5(f) are songs from a male singer. All data are converted into “.wav” file format (Multimedias 1, 2, 3, 4, 5, 6, 7, and 8), and these .wav files are converted into spectrograms as shown in Figs. 5(b), 5(d), and 5(e). As described in the main text, the control signal-based waveforms have a stronger signal than error signal-based waveforms, whereas the error signal-based waveforms have higher frequency components. From these results, we confirm that our metrology can record both male and female voice overtones at the remote site. Table S1 in the Supplementary Material presents a performance comparison with state-of-the-art acoustic sensing techniques.30,32,5661

    Comparison of real-time music recording reconstructions across different acoustic overtones. (a) Reconstructed control signal waveform and spectrogram of “Shallow1.wav” (Multimedia 3, WAV, 3.81 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s3]). (b) Reconstructed error signal waveform and spectrogram of “Shallow1.wav” (Multimedia 4, WAV, 3.81 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s4]) with a lower amplitude but with higher frequency components distinguished. (c) Reconstructed control signal waveform and spectrogram of “Shallow2.wav” (Multimedia 5, WAV, 0.39 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s5]). (d) Reconstructed error signal waveform and spectrogram of “Shallow2.wav” (Multimedia 6, WAV, 0.39 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s6]). Likewise, a lower amplitude is observed but higher frequency components are distinguished. (e) Reconstructed control signal waveform and spectrogram of “HotelCalifornia.wav” (Multimedia 7, WAV, 1.91 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s7]). (f) Reconstructed error signal waveform and spectrogram of “HotelCalifornia.wav” (Multimedia 8, WAV, 1.91 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s8]) with a lower amplitude and higher frequency metrology.

    Figure 5.Comparison of real-time music recording reconstructions across different acoustic overtones. (a) Reconstructed control signal waveform and spectrogram of “Shallow1.wav” (Multimedia 3, WAV, 3.81 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s3]). (b) Reconstructed error signal waveform and spectrogram of “Shallow1.wav” (Multimedia 4, WAV, 3.81 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s4]) with a lower amplitude but with higher frequency components distinguished. (c) Reconstructed control signal waveform and spectrogram of “Shallow2.wav” (Multimedia 5, WAV, 0.39 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s5]). (d) Reconstructed error signal waveform and spectrogram of “Shallow2.wav” (Multimedia 6, WAV, 0.39 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s6]). Likewise, a lower amplitude is observed but higher frequency components are distinguished. (e) Reconstructed control signal waveform and spectrogram of “HotelCalifornia.wav” (Multimedia 7, WAV, 1.91 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s7]). (f) Reconstructed error signal waveform and spectrogram of “HotelCalifornia.wav” (Multimedia 8, WAV, 1.91 MB; [URL: https://doi.org/10.1117/1.APN.4.4.046006.s8]) with a lower amplitude and higher frequency metrology.

    3 Discussion and Conclusion

    In summary, we have shown remote and local sound detection via picometric homodyne laser interferometry. An FP cavity stabilized hertz-level linewidth laser with 1015 fractional frequency instability enables picometric displacement measurement over 60  m optical path length. Our precision homodyne laser interferometer achieves displacement noise background of 1.5  pm/Hz1/2 near 10 kHz, limited by laser frequency noise. We show the measurement capability of sound detection up to 100 kHz at remote locations about 60 m away, with a measurement range extended using high-speed electronics. The measurement method demonstrated in this study shows long-term operation via stabilization of the homodyne signal regardless of phase wrapping by long-term drift. We measure sounds from 140 Hz to 15 kHz to verify frequency-dependent displacement intensities, with acoustic sensing sensitivities as subnanometer/Pascal across our conversational frequency overtones. We confirm that our methodology is able to measure sounds ranging from 2 mPa to 2 kPa, with a dynamic range determined between 60 and 100 dB, within the laser λ/4 displacement. With the noise floors and sensitivities determined, we successfully recorded and recreated several music sounds, including female and male voices behind a window at typical conversation volumes. Our proposed system enables long-distance measurements through the use of a laser with a long coherence length and achieves picometer-level displacement sensitivity by stabilizing the homodyne signal at the zero point. In addition, it employs near-infrared lasers, which are invisible to the human eye, thereby offering the potential for undetectable laser eavesdropping. We further believe that our proposed system has the potential for laser-based sound sensing, ultrasound sensing, and the practical realization of optical frequency standards for acoustic measurements.

    4 Appendix: Materials and Methods

    4.1 Displacement Measurement by Control Signal Analysis

    The fiber stretcher (PZ2, Optiphase, Van Nuys, California, USA) based on a piezoelectric actuator has a displacement sensitivity of 5.6  μm/V. In this study, we directly use the output port signal of the servo controller as a control signal, without a voltage amplifier to avoid its voltage noise. As the output voltage range of the control signal is ±10  V and our interferometer has a double path of the optical beam line, the fiber stretcher compensates for the displacement of ±28  μm on the target window with a sensitivity of 2.8  μm/V. The control signal is directly converted into displacement in a high-speed oscilloscope and FFT analyzer. The control signal is used inside the locking bandwidth of 1.5 kHz.

    4.2 Displacement Measurement by Error Signal Analysis

    The amplitude of the homodyne signal is fixed to ±500  mV, equivalent to ±391.39  nm (λ/4). We assume that the error signal is linearly proportional to the displacement near the zero-point, which we use in this study. The displacement sensitivity of the homodyne signal is determined to be 782.77  nm/V. As the voltage information of the error signal is directly converted into displacement in the time domain, the displacement information can be rapidly recorded by a high-speed oscilloscope and FFT analyzer. As the error signal below the locking bandwidth is suppressed by the servo control mechanism, the outside-locking bandwidth of the error signal is valid to detect the acoustic signatures.

    Acknowledgments

    Acknowledgment. The authors appreciate the helpful discussions with Jinkang Lim, Jiagui Wu, and Qingsong Bai. This work was supported by the Office of Naval Research (Grant Nos. N00014-16-1-2094 and N00014-24-1-2547), the Lawrence Livermore National Laboratory (Grant No. B622827), and the National Science Foundation. Y.-S.J. acknowledges support from KRISS (Grant Nos. 25011026 and 25011211).

    Yoon-Soo Jang received his BS degree in mechanical engineering from the Inha University, Incheon, Republic of Korea, in 2007, and his MS degree and PhD from the Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea, in 2013 and 2017, respectively. From 2018 to 2019, he was a postdoctoral fellow at the University of California, Los Angeles, United States. He is a senior research scientist at the Korea Research Institute of Standards and Science (KRISS), Daejeon, Republic of Korea, and an associate professor at the University of Science & Technology (UST), Daejeon, Republic of Korea. His research interests include optical metrology, ultrafast photonics, frequency combs, laser interferometry, and length standards.

    Chee Wei Wong received his BS degree in mechanical engineering from the University of California at Berkeley, United States, in 1999, and his MS degree and PhD from the Massachusetts Institute of Technology (MIT), Cambridge, United States, in 2001 and 2003, respectively. He was a postdoctoral fellow at the MIT in 2003. He is a professor at the Electrical Engineering Department of the University of California, Los Angeles, United States. He is a fellow of the American Physical Society. His current research interests include nonlinear and quantum optics in nanophotonics, silicon electronic–photonic circuits and photonic crystals, quantum dot interactions in nanocavities, nano-electromechanical systems, and nanofabrication.

    Biographies of the other authors are not available.

    Tools

    Get Citation

    Copy Citation Text

    Yoon-Soo Jang, Dong Il Lee, Jaime Flor Flores, Wenting Wang, Chee Wei Wong, "Remote picometric acoustic sensing via ultrastable laser homodyne interferometry," Adv. Photon. Nexus 4, 046006 (2025)

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Research Articles

    Received: Mar. 11, 2025

    Accepted: May. 30, 2025

    Published Online: Jun. 30, 2025

    The Author Email: Yoon-Soo Jang (ysj@kirss.ac.kr), Chee Wei Wong (cheewei.wong@ucla.edu)

    DOI:10.1117/1.APN.4.4.046006

    CSTR:32397.14.1.APN.4.4.046006

    Topics