Laser & Optoelectronics Progress, Volume. 56, Issue 20, 201004(2019)

Image Annotation Based on Convolutional Neural Network and Topic Model

Lei Zhang* and Ming Cai
Author Affiliations
  • School of Internet of Things Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China
  • show less
    Figures & Tables(7)
    Graphical model of LDA
    Structure of CNN based on transfer learning
    Framework of image annotation that combines CNN and topic model
    • Table 1. Symbols and their meaning

      View table

      Table 1. Symbols and their meaning

      SymbolMeaning of symbolSymbolMeaning of symbol
      MSize of training setNNumber of vocabulary
      KNumber of topicswVocabulary
      zPotential topicθProportion of topic
      αParameter of modelβParameter of model
      γVariational parameter αφVariational parameter β
      PdirDirichlet distributionMult(·)Polynomial distribution
    • Table 2. Parameters of different layers of CNN

      View table

      Table 2. Parameters of different layers of CNN

      Type ofnetwork layerKfFSPDf
      conv19611×114055×55×96
      Max-Pooling1-3×32027×27×96
      conv22565×51227×27×256
      Max-Pooling2-3×32013×13×256
      conv33843×31113×13×384
      conv43843×31113×13×384
      conv52563×31113×13×256
      Max-Pooling5-3×3206×6×256
    • Table 3. Annotation results of different models on Corel5K

      View table

      Table 3. Annotation results of different models on Corel5K

      ModelVisual featureAPARF1
      PLSA-WORDSTVS0.1210.2210.191
      fc70.2170.2750.269
      HGDMTVS0.2930.3210.263
      fc70.3050.3640.297
      Proposed modelfc70.3800.4900.420
    • Table 4. Annotation results of all image annotation models on common datasets

      View table

      Table 4. Annotation results of all image annotation models on common datasets

      ModelCorel5KIAPR TC-12
      ARAPF1ARAPF1
      MBRM0.250.240.250.230.240.24
      JEC0.320.270.290.290.280.29
      TagProp-ML0.370.310.340.250.480.33
      2PKNN0.400.390.400.320.490.39
      CNN-R0.410.320.370.310.490.37
      CNN-MSE0.350.410.380.350.400.37
      Proposed model0.490.380.430.400.440.42
    Tools

    Get Citation

    Copy Citation Text

    Lei Zhang, Ming Cai. Image Annotation Based on Convolutional Neural Network and Topic Model[J]. Laser & Optoelectronics Progress, 2019, 56(20): 201004

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Mar. 20, 2019

    Accepted: Apr. 26, 2019

    Published Online: Oct. 22, 2019

    The Author Email: Lei Zhang (289253808@qq.com)

    DOI:10.3788/LOP56.201004

    Topics