Infrared and Laser Engineering, Volume. 52, Issue 10, 20230051(2023)
Speech enhancement method of laser microphone based on ResUnet and TFGAN network
[4] [4] Wang X. Research on several denoising methods f laser sound detection [D]. Hefei: Hefei University of Technology, 2018. (in Chinese)
[6] [6] Li Weihong, Liu Ming, Zhu Zhigang, et al. LDV remote voice acquisition enhancement[C]18th International Conference on Pattern Recognition (ICPR''06), 2006: 262265.
[10] [10] Plapous C, Marro C, Scalart P. Speech enhancement using harmonic regeneration[C]ICASSP''05. IEEE International Conference on Acoustics, Speech, Signal Processing, 2005. IEEE, 2005.
[11] [11] Shoji U, Iwai, K, Fukumi T, et al. Sound quality improvement f speech acquisition based on deep learning harmonic reconstruction with laser microphone[C]Proceedings of the ICA Congress, 2019: 69376944.
[12] [12] Bregman A S. Audity scene analysis: The Perceptual ganization of Sound[M]. Cambridge: MIT Press, 1994.
[14] [14] Dan K H. Neural cognitive mechanisms affecting perceptual adaptation to distted speech[D]. London: University College London, 2019.
[15] [15] Ronneberger O, Fischer P, Brox T. U: Convolutional wks f biomedical image segmentation[C]International Conference on Medical image computing computerassisted intervention. Cham: Springer, 2015: 234241.
[16] [16] Choi H S, Park S, Lee J H, et al, Realtime denoising dereverberation wtih tiny recurrent U[C]ICASSP 20212021 IEEE International Conference on Acoustics, Speech Signal Processing (ICASSP), 2021: 57895793.
[17] [17] Tian Q, Chen Y, Zhang Z, et al. TFGAN: Time frequency domain based generative adversarial wk f highfidelity speech synthesis[EBOL]. (20201124)[20230206]. https:doi.g10.48550arXiv.2011.12206.
[18] [18] Liu H, Liu X, Kong Q, et al. VoiceFixer: A unified framewk f highfidelity speech restation[EBOL]. (20220412)[20230203]. https:arxiv.gabs2204.05841.
[19] [19] Rix A W, Beerends J G, Hollier M P, et al. Perceptual evaluation of speech quality (PESQ)a new method f speech quality assessment of telephone wks codecs[C]2001 IEEE International Conference on Acoustics, Speech, Signal Processing. Proceedings (Cat. No. 01 CH37221), 2001, 2: 749752.
Get Citation
Copy Citation Text
Xinxue Dai, Songtao Fan, Yan Zhou. Speech enhancement method of laser microphone based on ResUnet and TFGAN network[J]. Infrared and Laser Engineering, 2023, 52(10): 20230051
Category: Lasers & Laser optics
Received: Feb. 4, 2023
Accepted: --
Published Online: Nov. 21, 2023
The Author Email: