Optoelectronics Letters, Volume. 20, Issue 4, 222(2024)
MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology
The distributed acoustic sensing technology was used for real-time speech reproduction and recognition, in which the voiceprint can be extracted by the Mel frequency cepstral coefficient (MFCC) method. A classic ancient Chinese poem “You Zi Yin”, also called “A Traveler’s Song”, was analyzed both in time and frequency domains, where its real-time reproduction was achieved with a 116.91 ms time delay. The smaller scaled MFCC0 at 1/12 of MFCC matrix was taken as a feature vector of each line against the ambient noise, which provides a recognition method via cross-correlation among the 6 original and recovered verse pairs. The averaged cross-correlation coefficient of the matching pairs is calculated to be 0.580 6 higher than 0.188 3 of the nonmatched pairs, promising an accurate and fast method for real-time speech reproduction and recognition over a passive optical fiber.
Get Citation
Copy Citation Text
ZHOU Ran, ZHAO Shuai, LUO Mingming, MENG Xin, MA Jie, LIU Jianfei. MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology[J]. Optoelectronics Letters, 2024, 20(4): 222
Received: Aug. 18, 2023
Accepted: Oct. 13, 2023
Published Online: Aug. 9, 2024
The Author Email: Mingming LUO (mmluo@hebut.edu.cn)