Optics and Precision Engineering, Volume. 21, Issue 6, 1598(2013)
Speaker recognition based on adapted Gaussian mixture model and static and dynamic auditory feature fusion
[1] [1] KINNUNEN T, LI H Z. An overview of text-independent speaker recognition: from features to supervectors [J]. Speech Communication, 2010,52:12-40.
[2] [2] HAMID R,SEYYED A ,HOSSEIN B,et al.. A new representation for speech frame recognition based on redundant wavelet filter banks [J].Speech Communication, 2012, 54:256-271.
[3] [3] TYLER K P, STEPHANIE N,JOHN D,et al.. Human voice recognition depends on language ability [J]. Science, 2011,333:595.
[4] [4] PARVIN Z,SEYYED A. Robust speech recognition by extracting invariant features [J].Procedia - Social and Behavioral Sciences, 2012,32(3):230-237.
[5] [5] SHAO Y,JIN ZH ZH,WANG D L. An auditory based feature for robust speech recognition [C]. ICASSP,2009:4625-4628.
[6] [6] MAK B K W, LAI T C, TSANG I W, et al.. Maximum penalized likelihood kernel regression for fast adaptation [J]. IEEE Transactions on Audio, Speech and Language Processing, 2009, 17(7): 1372-1381.
[8] [8] DU J,HUO Q.A feature compensation approach using high-order vector taylor series approximation of an explicit distortion model for noisy speech recognition[J].IEEE Transactions on Adio, Speech, and Language Processing,2011,19(8):2285-2293.
[9] [9] JEONG Y. Speaker adaptation based on the multilinear decomposition of training speaker models [C]. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Dallas, USA: IEEE, 2010:4870-4873.
[10] [10] HE Y J,HAN J.Gaussian specific compensation for channel distortion in speech recognition [J]. IEEE SIGNAL PROCESSING LETTERS, 2011, 18(10): 599-602.
[11] [11] OMID D,BIN M,ENG S,et al.. Discriminative feature extraction for speech recognition using continuous output codes [J]. Pattern Recognition Letters, 2012,33:1703-1709.
[13] [13] BALWANT A. SONKAMBLE,DOYE D D. A novel linear-polynomial kernel to construct support vector machines for speech recognition[J].Journal of Computer Science, 2011,7 (7): 991-996.
[14] [14] TOMAS P,PETER R. Real-time recognition of affective states from nonverbal features of speech and its application for public speaking skill analysis [J]. IEEE Transactions on Affetive Computing, 2011,2(2):66-78.
[15] [15] SANTHOSH K C, MOHANDAS V P. Robust features for multilingual acoustic modeling[J]. Int J Speech Technol ,2011, 14:147-155.
Get Citation
Copy Citation Text
WU Di, CAO Jie, WANG Jin-hua. Speaker recognition based on adapted Gaussian mixture model and static and dynamic auditory feature fusion[J]. Optics and Precision Engineering, 2013, 21(6): 1598
Category:
Received: Feb. 7, 2013
Accepted: --
Published Online: Jul. 1, 2013
The Author Email: WU Di (wudi6152007@163.com)