Speaker recognition based on adapted Gaussian mixture model and static and dynamic auditory feature fusion

[1] [1] KINNUNEN T, LI H Z. An overview of text-independent speaker recognition: from features to supervectors ［J］. Speech Communication, 2010,52:12-40.

[2] [2] HAMID R,SEYYED A ,HOSSEIN B,et al.. A new representation for speech frame recognition based on redundant wavelet filter banks ［J］.Speech Communication, 2012, 54:256-271.

[3] [3] TYLER K P, STEPHANIE N,JOHN D,et al.. Human voice recognition depends on language ability ［J］. Science, 2011,333:595.

[4] [4] PARVIN Z,SEYYED A. Robust speech recognition by extracting invariant features ［J］.Procedia - Social and Behavioral Sciences, 2012,32(3):230-237.

[5] [5] SHAO Y,JIN ZH ZH,WANG D L. An auditory based feature for robust speech recognition ［C］. ICASSP,2009:4625-4628.

[6] [6] MAK B K W, LAI T C, TSANG I W, et al.. Maximum penalized likelihood kernel regression for fast adaptation ［J］. IEEE Transactions on Audio, Speech and Language Processing, 2009, 17(7): 1372-1381.

[7] [7] ZHAI Y,ZENG L, XIONG W.Star matching based on invariant feature descriptor ［J］. Opt. Precision Eng., 2012,20(11):2531-2539. (in Chinese)

[8] [8] DU J,HUO Q.A feature compensation approach using high-order vector taylor series approximation of an explicit distortion model for noisy speech recognition［J］.IEEE Transactions on Adio, Speech, and Language Processing,2011,19(8):2285-2293.

[9] [9] JEONG Y. Speaker adaptation based on the multilinear decomposition of training speaker models ［C］. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Dallas, USA: IEEE, 2010:4870-4873.

[10] [10] HE Y J,HAN J.Gaussian specific compensation for channel distortion in speech recognition ［J］. IEEE SIGNAL PROCESSING LETTERS, 2011, 18(10): 599-602.

[11] [11] OMID D,BIN M,ENG S,et al.. Discriminative feature extraction for speech recognition using continuous output codes ［J］. Pattern Recognition Letters, 2012,33:1703-1709.

[12] [12] SHI S Q,SHI G M,LI F.Partially occluded object matching via multi-level description and evaluation of contour feature ［J］.Opt. Precision Eng., 2012,20(12):2804-2811.(in Chinese)

[13] [13] BALWANT A. SONKAMBLE,DOYE D D. A novel linear-polynomial kernel to construct support vector machines for speech recognition［J］.Journal of Computer Science, 2011,7 (7): 991-996.

[14] [14] TOMAS P,PETER R. Real-time recognition of affective states from nonverbal features of speech and its application for public speaking skill analysis ［J］. IEEE Transactions on Affetive Computing, 2011,2(2):66-78.

[15] [15] SANTHOSH K C, MOHANDAS V P. Robust features for multilingual acoustic modeling［J］. Int J Speech Technol ,2011, 14:147-155.

CLP Journals

[1] LIU Jian-lei, SUI Qing-mei, ZHU Wen-xing. MR image segmentation based on probability density function and active contour model[J]. Optics and Precision Engineering, 2014, 22(12): 3435

Tools

Get Citation

Copy Citation Text

WU Di, CAO Jie, WANG Jin-hua. Speaker recognition based on adapted Gaussian mixture model and static and dynamic auditory feature fusion[J]. Optics and Precision Engineering, 2013, 21(6): 1598

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Feb. 7, 2013

Accepted: --

Published Online: Jul. 1, 2013

The Author Email: WU Di (wudi6152007@163.com)

DOI:10.3788/ope.20132106.1598

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology