Opto-Electronic Advances, Volume. 7, Issue 4, 230182(2024)

Efficient stochastic parallel gradient descent training for on-chip optical processor

Yuanjian Wan1...2,†, Xudong Liu1,2,†, Guangze Wu1,2, Min Yang1,2, Guofeng Yan1,2, Yu Zhang1,2, and Jian Wang12,* |Show fewer author(s)
Author Affiliations
  • 1Wuhan National Laboratory for Optoelectronics and School of Optical and Electronic Information, Huazhong University of Science and Technology, Wuhan 430074, China
  • 2Optics Valley Laboratory, Wuhan 430074, China
  • show less
    Figures & Tables(9)
    (a) Conceptual diagram of the on-chip optical processor for optical switching and channel descrambling in MDM communication systems. (b) Schematic configuration of the integrated reconfigurable optical processor. θ and ϕ mean the phase shift of the phase shifters. MDM: mode-division multiplexing; MUX: multiplexer; DEMUX: demultiplexer.
    Flow chart of Stochastic Parallel Gradient Descent (SPGD) algorithm.
    Training results in electronic computer for optical switching, optical channel descrambling, and optical channel descrambling and switching. (a) Emulated light power distributions and (b) normalized light intensity distributions after training when the switching state is I1−O2, I2−O1, I3−O5, I4−O6, I5−O3, I6−O4. (d, e) Normalized light intensity distributions (d) before and (e) after training when randomly generating a set of phases in the part (1) of our chip to emulate crosstalk. (g, h) Normalized light intensity distributions (g) before and (h) after training with crosstalk when the switching state is: I1−O5, I2−O3, I3−O2, I4−O4, I5−O1, I6−O6. (c, f, i) The evaluation function changing with iteration rounds.
    (a) Schematic of experimental configuration. (b) Microscopy image of optical processor. VSA: voltage source array; PD: photodetector array.
    Online training results for optical switching at a wavelength of 1550 nm. (a) The evaluation function changing with iteration rounds when the switching state is I1−O3, I2−O1, I3−O4, I4−O6, I5−O2, I6−O5. The insets figures show the light power distributions when the round of iteration equals 50, 300, and 600, respectively. (b) The measured light power distributions after training. (c) The normalized light intensity distributions of measured results. (d, e) The measured light power distributions and normalized light intensity distributions when the switching state is I1−O3, I2−O6, I3−O4, I4−O2, I5−O1, I6−O5.
    Online training results for optical channel descrambling at a wavelength of 1550 nm. (a) The evaluation function changing with iteration rounds. The insets show the light power distributions when the round of iteration equals 1, 300, and 600, respectively. (b) The light power distributions before training. (c) The light power distributions after training. (d, e) The results of training when generating another matrix U˜.
    Online training results for optical channel descrambling and switching at a wavelength of 1550 nm. (a) The evaluation function changing with iteration rounds when the switching state is I1−O4, I2−O1, I3−O5, I4−O6, I5−O3, I6−O2. The insets show the light power distributions when the round of iteration equals 1, 100, and 400, respectively. (b) The light power distributions before training. (c) The light power distributions after training. (d, e) The results of training when generating another matrix U˜ and the switching state is I1−O5, I2−O3, I3−O1, I4−O6, I5−O2, I6−O4.
    Experimental setup and measured results for optical channel descrambling. (a) Experimental setup for the 6×6 optical descrambling systems. (b) The measured BER performance for back-to-back, optimization without crosstalk, before optimization with crosstalk, and after optimization with crosstalk systems. (c) The measured constellation chart at the back-to-back. (d) The measured constellation chart without crosstalk. (e) The measured constellation chart before optimization with crosstalk. (f) The measured constellation chart after optimization with crosstalk. PC: polarization controller; AWG: arbitrary waveform generator; EDFA: erbium-doped fiber amplifier; VOA: variable optical attenuator; OSC: oscilloscope; DSP: digital signal processing.
    • Table 1. Performance of different algorithms.

      View table
      View in Article

      Table 1. Performance of different algorithms.

      AlgorithmNumbers of updateMatrix sizes
      N=6N=10N=16N=32
      GDN(N−1)×T69038701320093248
      GAM×T10489046.6739732171200
      PSOM×T1024591231056116145
      SPGDT297.91092.64752.618053.1
    Tools

    Get Citation

    Copy Citation Text

    Yuanjian Wan, Xudong Liu, Guangze Wu, Min Yang, Guofeng Yan, Yu Zhang, Jian Wang. Efficient stochastic parallel gradient descent training for on-chip optical processor[J]. Opto-Electronic Advances, 2024, 7(4): 230182

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Research Articles

    Received: Sep. 30, 2023

    Accepted: Feb. 4, 2024

    Published Online: Jul. 19, 2024

    The Author Email: Wang Jian (JWang)

    DOI:10.29026/oea.2024.230182

    Topics