Electro-Optic Technology Application, Volume. 34, Issue 5, 37(2019)

Research on Speech Segmentation and Clustering Based on Mixed Features

LIU Jing-tian and JIANG Nan
Author Affiliations
  • [in Chinese]
  • show less

    The problem of extracting the target speaker speech from multiple speaker speech is researched. In order to improve the accuracy of multi-speaker speech segmentation and clustering, a speech segmentation and clustering algorithm based on Mel frequency cepstral coefficient (MFCC) and Gammatone frequency cepstral coefficient (GFCC) hybrid features is proposed, which can effectively avoid problems such as poor robustness of noisy speech segmentation and clustering. For the superimposed pink noise and factory noise speech, a comparative analysis is made based on the conventional algorithm and the improved segmentation clustering algorithm respectively. The results show that the proposed segmentation clustering algorithm based on mixed features is more accurate in extracting target human speech.

    Tools

    Get Citation

    Copy Citation Text

    LIU Jing-tian, JIANG Nan. Research on Speech Segmentation and Clustering Based on Mixed Features[J]. Electro-Optic Technology Application, 2019, 34(5): 37

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Jul. 19, 2019

    Accepted: --

    Published Online: Oct. 23, 2019

    The Author Email:

    DOI:

    CSTR:32186.14.

    Topics