Acta Optica Sinica, Volume. 42, Issue 19, 1920001(2022)

VGG16-Based Diffractive Optical Neural Network and Context-Dependent Processing

Xingya Zhao, Zhiwei Yang, Jian Dai, Tian Zhang*, and Kun Xu
Author Affiliations
  • State Key Laboratory of Information Photonics and Optical Communications, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • show less
    Figures & Tables(16)
    Schematic diagram of VGG16-ECNN,and convolution process. (a) Schematic diagram of VGG16-ECNN; (b) convolution process
    Schematic diagrams of VGG16-DONN structure. (a) Simplified diagram of VGG16-ECNN; (b) 4f system [fx,y is input complex amplitude function, Fu,v is Fourier transform of fx,y, and fξ,η is reconstructed signal restored by second convex lens]
    Weights of convolution kernels
    Dataset examples. (a) CelebA dataset; (b) cat and dog dataset
    Training flow chart
    Original image and output images of two network structures through partial convolution kernel in the first convolution layer.(a) Face image; (b) VGG16-ECNN; (c) VGG16-DONN
    Output of face image convolved with all convolution kernels in the first convolution layer
    Output graph of each convolution block after inputting face image to VGG16-DONN structure. (a) Conv1; (b) Conv2; (c) Conv3; (d) Conv4; (e) Conv5
    Original image and output graphs of two network structures through partial convolution kernel in the first convolution layer.(a) Dog image; (b) VGG16-ECNN; (c) VGG16-DONN
    VGG16-DONN for CDP
    OWM algorithm flow chart
    Training results of VGG16-DONN and VGG16-ECNN. (a) Training accuracy and validation accuracy of VGG16-DONN and VGG16-ECNN; (b) training loss and validation loss of VGG16-DONN and VGG16-ECNN
    Training/validation accuracy and time obtained by varying different parameters. (a) Weights of different layers trained by VGG16-ECNN structure; (b) weights of different layers trained by VGG16-DONN structure; (c) VGG16-DONN structure changes optimizer; (d) VGG16-DONN structure changes learning rate
    • Table 1. Time of each convolutional layer of VGG16-ECNN

      View table

      Table 1. Time of each convolutional layer of VGG16-ECNN

      Convolution layerTime /sProportion of total time /%
      Conv1_10.9222.3
      Conv1_20.338.0
      Conv2_10.245.8
      Conv2_20.276.6
      Conv3_10.225.3
      Conv3_20.256.1
      Conv3_30.256.1
      Conv4_10.245.8
      Conv4_20.276.6
      Conv4_30.276.6
      Conv5_10.286.8
      Conv5_20.297.0
      Conv5_30.297.0
      Total time4.12100.0
    • Table 2. Classification accuracy and time of VGG16-DONN and AlexNet-DONN for cat and dog dataset

      View table

      Table 2. Classification accuracy and time of VGG16-DONN and AlexNet-DONN for cat and dog dataset

      Network structureClassification accuracy /%Time /min
      VGG16-DONN88.53320
      AlexNet-DONN83.811117
    • Table 3. Accuracy of 40 kinds of face attributes obtained by VGG16-DONN and VGG16-ECNN combined with CDP module

      View table

      Table 3. Accuracy of 40 kinds of face attributes obtained by VGG16-DONN and VGG16-ECNN combined with CDP module

      AttributeVGG16-DONNVGG16-ECNNAttributeVGG16-DONNVGG16-ECNN
      5_o_Clock_Shadow90.2090.70Mouth_Slightly_Open62.0065.30
      Arched_Eyebrows70.8071.70Mustache96.8096.30
      Attractive68.0071.60Narrow_Eyes87.0086.10
      Bags_Under_Eyes79.6078.60No_Beard86.0086.00
      Bald98.2098.20Oval_Face71.0073.40
      Bangs83.2086.00Pale_Skin95.8095.70
      Big_Lips67.2066.20Pointy_Nose71.8071.00
      Big_Nose81.8080.30Receding_Hairline92.0092.90
      Black_Hair75.8076.00Rosy_Cheeks93.2093.40
      Blond_Hair87.4086.90Sideburns94.6095.30
      Blurry96.2094.60Smiling67.0075.00
      Brown_Hair81.2081.10Straight_Hair79.2077.50
      Bushy_Eyebrows85.0086.40Wavy_Hair66.6073.60
      Chubby95.0095.90Wearing_Earrings79.4078.10
      Double_Chin97.4095.70Wearing_Hat96.0095.80
      Eyeglasses94.6093.70Wearing_Lipstick75.2081.60
      Goatee94.8095.60Wearing_Necklace84.2085.70
      Gray_Hair97.2097.50Wearing_Necktie91.0093.30
      Heavy_Makeup73.8079.00Young77.6076.90
      High_Cheekbones62.8071.20Male77.6084.90
    Tools

    Get Citation

    Copy Citation Text

    Xingya Zhao, Zhiwei Yang, Jian Dai, Tian Zhang, Kun Xu. VGG16-Based Diffractive Optical Neural Network and Context-Dependent Processing[J]. Acta Optica Sinica, 2022, 42(19): 1920001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Optics in Computing

    Received: Jan. 13, 2022

    Accepted: Apr. 15, 2022

    Published Online: Oct. 18, 2022

    The Author Email: Zhang Tian (ztian@bupt.edu.cn)

    DOI:10.3788/AOS202242.1920001

    Topics