Speech enhancement method of laser microphone based on ResUnet and TFGAN network

Xinxue Dai; Songtao Fan; Yan Zhou

doi:10.3788/IRLA20230051

Infrared and Laser Engineering, Volume. 52, Issue 10, 20230051(2023)

Speech enhancement method of laser microphone based on ResUnet and TFGAN network

Xinxue Dai1...2, Songtao Fan1, and Yan Zhou12 |Show fewer author(s)

Author Affiliations

¹Optoelectronics System Laboratory, Institute of Semiconductors, Chinese Academy of Sciences, Beĳing 100083, China

²University of Chinese Academy of Sciences, Beĳing 100049, China

show less

Abstract Get PDF(in Chinese)

References(19)

[1] Hongxing Fan, Yan Zhou, Songtao Fan, . Research on nanometer displacement telemetry based on digital zero intermediate frequency. Infrared and Laser Engineering, 47, 1117008(2018).

[2] Chunhui Yan, Tingfeng Wang, Heyong Zhang, . Arctangent compensation algorithm of laser speech detection system. Infrared and Laser Engineering, 46, 0906004(2017).

[3] Liyan Li, Songtao Fan, Yan Zhou. Eliminating light intensity disturbance algorithm based on phase demodulation carrier. Infrared and Laser Engineering, 50, 20210485(2021).

[4] [4] Wang X. Research on several denoising methods f laser sound detection [D]. Hefei: Hefei University of Technology, 2018. (in Chinese)

[5] Xinwei Luo, Xi Zhang, Benhai Lin, . Experimental study on the response of soil nails with different materials and shapes to vibration wave. Journal of Safety and Environment, 21, 1712-1719(2021).

[6] [6] Li Weihong, Liu Ming, Zhu Zhigang, et al. LDV remote voice acquisition enhancement[C]18th International Conference on Pattern Recognition (ICPR''06), 2006: 262265.

[7] Tao Lv, Heyong Zhang, Jin Guo, . Acquisition and enhancement of remote voice based on laser coherent method. Optics and Precision Engineering, 25, 569-575(2017).

[8] Zhi Qu, Bohu Zhang. An improved wavelet threshold algorithm applied in laser interception. Laser Technology, 38, 218-224(2014).

[9] Tao Bai, Jin Wu, Minglei Li, . Application of DRNN in voice measurement system of laser Doppler vibrometer. Laser Technology, 43, 109-114(2019).

[10] [10] Plapous C, Marro C, Scalart P. Speech enhancement using harmonic regeneration[C]ICASSP''05. IEEE International Conference on Acoustics, Speech, Signal Processing, 2005. IEEE, 2005.

[11] [11] Shoji U, Iwai, K, Fukumi T, et al. Sound quality improvement f speech acquisition based on deep learning harmonic reconstruction with laser microphone[C]Proceedings of the ICA Congress, 2019: 69376944.

[12] [12] Bregman A S. Audity scene analysis: The Perceptual ganization of Sound[M]. Cambridge: MIT Press, 1994.

[13] T D Griffiths, J D Warren. The planum temporale as a com-putational hub. Trends in Neurosciences, 25, 348-353(2002).

[14] [14] Dan K H. Neural cognitive mechanisms affecting perceptual adaptation to distted speech[D]. London: University College London, 2019.

[15] [15] Ronneberger O, Fischer P, Brox T. U: Convolutional wks f biomedical image segmentation[C]International Conference on Medical image computing computerassisted intervention. Cham: Springer, 2015: 234241.

[16] [16] Choi H S, Park S, Lee J H, et al, Realtime denoising dereverberation wtih tiny recurrent U[C]ICASSP 20212021 IEEE International Conference on Acoustics, Speech Signal Processing (ICASSP), 2021: 57895793.

[17] [17] Tian Q, Chen Y, Zhang Z, et al. TFGAN: Time frequency domain based generative adversarial wk f highfidelity speech synthesis[EBOL]. (20201124)[20230206]. https:doi.g10.48550arXiv.2011.12206.

[18] [18] Liu H, Liu X, Kong Q, et al. VoiceFixer: A unified framewk f highfidelity speech restation[EBOL]. (20220412)[20230203]. https:arxiv.gabs2204.05841.

[19] [19] Rix A W, Beerends J G, Hollier M P, et al. Perceptual evaluation of speech quality (PESQ)a new method f speech quality assessment of telephone wks codecs[C]2001 IEEE International Conference on Acoustics, Speech, Signal Processing. Proceedings (Cat. No. 01 CH37221), 2001, 2: 749752.

Tools

Get Citation

Copy Citation Text

Xinxue Dai, Songtao Fan, Yan Zhou. Speech enhancement method of laser microphone based on ResUnet and TFGAN network[J]. Infrared and Laser Engineering, 2023, 52(10): 20230051

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Lasers & Laser optics

Received: Feb. 4, 2023

Accepted: --

Published Online: Nov. 21, 2023

The Author Email:

DOI:10.3788/IRLA20230051

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology

微信扫一扫：分享