Laser & Optoelectronics Progress, Volume. 57, Issue 18, 181702(2020)

Speaker-Dependent Speech Recognition Algorithm for Laparoscopic Supporter Control

Kailong Ren, Yi Wang*, Xiaodong Chen, and Huaiyu Cai
Author Affiliations
  • School of Precision Instruments and Optoelectronics Engineering, Tianjin University, Tianjin 300072, China
  • show less
    Figures & Tables(9)
    Diagram of simple RNN and its expansion
    Diagram of the unit of LSTM recurrent neural network hidden layer
    Diagram of BiLSTM RNN structure
    Diagram of LSTM recurrent neural network model with i-vector feature
    Diagrams of i-vector parameter fusion and adding rejection identification unit. (a) Parameter fusion of i-vector; (b) adding rejection identification unit
    • Table 1. LSTM recurrent neural network model structure with i-vector feature

      View table

      Table 1. LSTM recurrent neural network model structure with i-vector feature

      Layer IDNameNumberof unitsActivationfunction
      1Input layer--
      2FC164ReLU
      3FC264ReLU
      4FC364ReLU
      5BiLSTM64-
      6FC464ReLU
      7FC564ReLU
      8Output layer-Softmax
    • Table 2. Recognition results of surgeon speech by three models

      View table

      Table 2. Recognition results of surgeon speech by three models

      Word IDDTWGMM-HMMLSTM RNN with i-vector
      TotalCorrectErrorTotalCorrectErrorTotalCorrectError
      FRFAFRFA
      FRFA
      1605046605316605910
      2605424605424606000
      3605424605613606000
      4605325605514605910
      5605235605415606000
      6605433605532606000
      7605262605217606000
      8605424605415606000
      Sum4804232433480433113648047820
    • Table 3. Recognition results of assistant doctors speech by three models

      View table

      Table 3. Recognition results of assistant doctors speech by three models

      Word IDDTWGMM-HMMLSTM RNN with i-vector
      TotalRejectionFATotalRejectionFATotalRejectionFA
      1605376054660600
      2605556056460600
      3605826057360600
      4605466055560600
      5605646056460600
      6605376052860600
      7605556056460600
      8605646057360600
      Sum48044040480443374804800
    • Table 4. Recognition results of interference speech by three models

      View table

      Table 4. Recognition results of interference speech by three models

      Word IDDTWGMM-HMMLSTM RNN with i-vector
      ToatlRejectionFATotalRejectionFATotalRejectionFA
      18072880701080800
      2807288075580773
      3807558076480782
      4807378072880764
      5807378074680773
      6807468075580782
      7807558075580791
      8807378072880791
      Sum640587536405895164062416
    Tools

    Get Citation

    Copy Citation Text

    Kailong Ren, Yi Wang, Xiaodong Chen, Huaiyu Cai. Speaker-Dependent Speech Recognition Algorithm for Laparoscopic Supporter Control[J]. Laser & Optoelectronics Progress, 2020, 57(18): 181702

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Medical Optics and Biotechnology

    Received: Feb. 5, 2020

    Accepted: Mar. 19, 2020

    Published Online: Sep. 2, 2020

    The Author Email: Wang Yi (koala_wy@tju.edu.cn)

    DOI:10.3788/LOP57.181702

    Topics