Infrared and Laser Engineering, Volume. 52, Issue 10, 20230051(2023)

Speech enhancement method of laser microphone based on ResUnet and TFGAN network

Xinxue Dai1,2, Songtao Fan1, and Yan Zhou1,2
Author Affiliations
  • 1Optoelectronics System Laboratory, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China
  • 2University of Chinese Academy of Sciences, Beijing 100049, China
  • show less
    References(19)

    [4] [4] Wang X. Research on several denoising methods f laser sound detection [D]. Hefei: Hefei University of Technology, 2018. (in Chinese)

    [6] [6] Li Weihong, Liu Ming, Zhu Zhigang, et al. LDV remote voice acquisition enhancement[C]18th International Conference on Pattern Recognition (ICPR''06), 2006: 262265.

    [10] [10] Plapous C, Marro C, Scalart P. Speech enhancement using harmonic regeneration[C]ICASSP''05. IEEE International Conference on Acoustics, Speech, Signal Processing, 2005. IEEE, 2005.

    [11] [11] Shoji U, Iwai, K, Fukumi T, et al. Sound quality improvement f speech acquisition based on deep learning harmonic reconstruction with laser microphone[C]Proceedings of the ICA Congress, 2019: 69376944.

    [12] [12] Bregman A S. Audity scene analysis: The Perceptual ganization of Sound[M]. Cambridge: MIT Press, 1994.

    [14] [14] Dan K H. Neural cognitive mechanisms affecting perceptual adaptation to distted speech[D]. London: University College London, 2019.

    [15] [15] Ronneberger O, Fischer P, Brox T. U: Convolutional wks f biomedical image segmentation[C]International Conference on Medical image computing computerassisted intervention. Cham: Springer, 2015: 234241.

    [16] [16] Choi H S, Park S, Lee J H, et al, Realtime denoising dereverberation wtih tiny recurrent U[C]ICASSP 20212021 IEEE International Conference on Acoustics, Speech Signal Processing (ICASSP), 2021: 57895793.

    [17] [17] Tian Q, Chen Y, Zhang Z, et al. TFGAN: Time frequency domain based generative adversarial wk f highfidelity speech synthesis[EBOL]. (20201124)[20230206]. https:doi.g10.48550arXiv.2011.12206.

    [18] [18] Liu H, Liu X, Kong Q, et al. VoiceFixer: A unified framewk f highfidelity speech restation[EBOL]. (20220412)[20230203]. https:arxiv.gabs2204.05841.

    [19] [19] Rix A W, Beerends J G, Hollier M P, et al. Perceptual evaluation of speech quality (PESQ)a new method f speech quality assessment of telephone wks codecs[C]2001 IEEE International Conference on Acoustics, Speech, Signal Processing. Proceedings (Cat. No. 01 CH37221), 2001, 2: 749752.

    Tools

    Get Citation

    Copy Citation Text

    Xinxue Dai, Songtao Fan, Yan Zhou. Speech enhancement method of laser microphone based on ResUnet and TFGAN network[J]. Infrared and Laser Engineering, 2023, 52(10): 20230051

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Lasers & Laser optics

    Received: Feb. 4, 2023

    Accepted: --

    Published Online: Nov. 21, 2023

    The Author Email:

    DOI:10.3788/IRLA20230051

    Topics