Deep reinforcement with spectrum series learning control for a mode-locked fiber laser

Zhan Li; Shuaishuai Yang; Qi Xiao; Tianyu Zhang; Yong Li; Lu Han; Dean Liu; Xiaoping Ouyang; Jianqiang Zhu

doi:10.1364/PRJ.455493

Photonics Research, Volume. 10, Issue 6, 1491(2022)

Deep reinforcement with spectrum series learning control for a mode-locked fiber laser

Zhan Li^1,2, Shuaishuai Yang^1,3, Qi Xiao^1,2, Tianyu Zhang^1,2, Yong Li^1,2, Lu Han^1,2, Dean Liu^1,4、*, Xiaoping Ouyang^1,5、*, and Jianqiang Zhu¹

Author Affiliations

¹Key Laboratory of High Power Laser and Physics, Shanghai Institute of Optics and Fine Mechanics, Chinese Academy of Sciences, Shanghai 201800, China

²Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China

³Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China

⁴e-mail: liudean@siom.ac.cn

⁵e-mail: oyxp@siom.ac.cn

show less

Figures & Tables(9)

Fig. 1. GNLSE simulation result from the NPE-based mode-locking laser system. (a) Spectral evolution when EPC is in TL. (b) Spectral evolution when EPC is in TH. (c) Spectral evolution when EPC is in TL initially and then converted to TH after 400 round trips. (d) Light transmittance caused by NPE when EPC is in TL (orange line) and TH (purple line). (e) Spectrum output after 800 round trips when EPC is in TL (orange line), TH (purple line), and Tm (green line). (f) Temporal output after 800 round trips when EPC is in TL (orange line), TH (purple line), and Tm (green line).

Download full size

View in Article

Fig. 2. Feedback time-series spectrum control model.

Download full size

View in Article

Fig. 3. MDRL agent layout.

Download full size

View in Article

Fig. 4. MDRL environment layout. LD, laser diode; WDM, 980/1060 nm wavelength division multiplexer; YDF, ytterbium-doped fiber; C, coupler; SMF, single-mode fiber; P, polarizer; I, isolator; EPC, electrical polarization controller; SF, optical spectrum filter; D, diagnostic optical spectrum analyzer.

Download full size

View in Article

Fig. 5. Spectrum and time-wave evolution during MDRL search. (a) Spectrum evolution data from the spectrum analyzer. (b) Time-wave evolution data from the high-speed photodetector and oscilloscope. (c) Obtained reward at each search step. (d) Direct autocorrelation output (blue line) and autocorrelation output after dispersion compensation (orange square, purple line).

Download full size

View in Article

Fig. 6. Mode-locked state switch by MSP. (a) Mode-locked state switch by minimizing the difference between PMSP(Wt) (purple line) and PMSP(Wc). (b) Pump power control error LMSP(Wc) (blue line) and MSP predicted error (green dashed line). (c), (g) Typical spectrum and temporal output in FML state. (d), (h) Typical spectrum and temporal output in HML state. (e), (i) Typical spectrum and temporal output in QML state. (f), (j) Typical spectrum and temporal output in QS output.

Download full size

View in Article

Fig. 7. Algorithm performance. (a) Total search step from 100 random initial states to the mode-locked state using MDRL (purple solid circle), DDPG (orange solid square), and genetic algorithm (green solid triangle). (b) Search stability test at different temperatures with MDRL (purple), DDPG (orange), and genetic algorithm (green).

Download full size

View in Article

Fig. 8. Search stability test at different temperatures with MDRL (purple), DDPG (orange), and genetic algorithm (green).

Download full size

View in Article

Table 1. Time Consumption Comparison with Recent Works
View table
View in Article
Table 1. Time Consumption Comparison with Recent Works
Algorithm Average Time Average Search Step
Genetic algorithm [7] 30 min 6000
HLA [6] 3.1 s 3100
DDPG [18] 1.948 s
DDPG in this environment 5.8 s 116.1
MDRL in this environment 0.69 s 13.8

Tools

Get Citation

Copy Citation Text

Zhan Li, Shuaishuai Yang, Qi Xiao, Tianyu Zhang, Yong Li, Lu Han, Dean Liu, Xiaoping Ouyang, Jianqiang Zhu. Deep reinforcement with spectrum series learning control for a mode-locked fiber laser[J]. Photonics Research, 2022, 10(6): 1491

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites