Laser & Optoelectronics Progress, Volume. 59, Issue 7, 0707001(2022)
Syllable Matching Algorithm with Spectral Peak Point Feature for Chinese Speech
Based on the spectral peak point characteristics of Chinese speech, this study proposes a syllable matching algorithm to improve the matching effect of Chinese speech syllables in noisy environments. First, a discrete cosine transform is used to extract the speech signal envelope spectrogram, and the human ear masking effect is used for spectral energy judgment to obtain the extreme value points of spectral energy in each frame. Then, the syllable signal is corresponded to a binary sequence by performing binary quantization in the logarithmic frequency range. Finally, the syllable matching result is determined based on the template comparison of the binary sequence. The results show that the proposed algorithm outperforms the conventional methods for matching syllables in the noiseless Chinese speech. Additionally, it has a high matching accuracy at low signal-to-noise ratios.
Get Citation
Copy Citation Text
Weikang Tang, Yubin Shao, Hua Long, Qingzhi Du, Yi Peng, Liang Chen. Syllable Matching Algorithm with Spectral Peak Point Feature for Chinese Speech[J]. Laser & Optoelectronics Progress, 2022, 59(7): 0707001
Category: Fourier Optics and Signal Processing
Received: Jun. 7, 2021
Accepted: Jul. 6, 2021
Published Online: Mar. 8, 2022
The Author Email: Shao Yubin (shaoyubin@kust.edu.cn)