Optics and Precision Engineering, Volume. 21, Issue 6, 1598(2013)

Speaker recognition based on adapted Gaussian mixture model and static and dynamic auditory feature fusion

WU Di1、*, CAO Jie1,2, and WANG Jin-hua1
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less
    References(15)

    [1] [1] KINNUNEN T, LI H Z. An overview of text-independent speaker recognition: from features to supervectors [J]. Speech Communication, 2010,52:12-40.

    [2] [2] HAMID R,SEYYED A ,HOSSEIN B,et al.. A new representation for speech frame recognition based on redundant wavelet filter banks [J].Speech Communication, 2012, 54:256-271.

    [3] [3] TYLER K P, STEPHANIE N,JOHN D,et al.. Human voice recognition depends on language ability [J]. Science, 2011,333:595.

    [4] [4] PARVIN Z,SEYYED A. Robust speech recognition by extracting invariant features [J].Procedia - Social and Behavioral Sciences, 2012,32(3):230-237.

    [5] [5] SHAO Y,JIN ZH ZH,WANG D L. An auditory based feature for robust speech recognition [C]. ICASSP,2009:4625-4628.

    [6] [6] MAK B K W, LAI T C, TSANG I W, et al.. Maximum penalized likelihood kernel regression for fast adaptation [J]. IEEE Transactions on Audio, Speech and Language Processing, 2009, 17(7): 1372-1381.

    [8] [8] DU J,HUO Q.A feature compensation approach using high-order vector taylor series approximation of an explicit distortion model for noisy speech recognition[J].IEEE Transactions on Adio, Speech, and Language Processing,2011,19(8):2285-2293.

    [9] [9] JEONG Y. Speaker adaptation based on the multilinear decomposition of training speaker models [C]. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Dallas, USA: IEEE, 2010:4870-4873.

    [10] [10] HE Y J,HAN J.Gaussian specific compensation for channel distortion in speech recognition [J]. IEEE SIGNAL PROCESSING LETTERS, 2011, 18(10): 599-602.

    [11] [11] OMID D,BIN M,ENG S,et al.. Discriminative feature extraction for speech recognition using continuous output codes [J]. Pattern Recognition Letters, 2012,33:1703-1709.

    [13] [13] BALWANT A. SONKAMBLE,DOYE D D. A novel linear-polynomial kernel to construct support vector machines for speech recognition[J].Journal of Computer Science, 2011,7 (7): 991-996.

    [14] [14] TOMAS P,PETER R. Real-time recognition of affective states from nonverbal features of speech and its application for public speaking skill analysis [J]. IEEE Transactions on Affetive Computing, 2011,2(2):66-78.

    [15] [15] SANTHOSH K C, MOHANDAS V P. Robust features for multilingual acoustic modeling[J]. Int J Speech Technol ,2011, 14:147-155.

    CLP Journals

    [1] LIU Jian-lei, SUI Qing-mei, ZHU Wen-xing. MR image segmentation based on probability density function and active contour model[J]. Optics and Precision Engineering, 2014, 22(12): 3435

    Tools

    Get Citation

    Copy Citation Text

    WU Di, CAO Jie, WANG Jin-hua. Speaker recognition based on adapted Gaussian mixture model and static and dynamic auditory feature fusion[J]. Optics and Precision Engineering, 2013, 21(6): 1598

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Feb. 7, 2013

    Accepted: --

    Published Online: Jul. 1, 2013

    The Author Email: WU Di (wudi6152007@163.com)

    DOI:10.3788/ope.20132106.1598

    Topics