A Reinforcement Learning based Adaptive and Efficient RWA in All Optical Networks

Zhaoyang LIU; Bitao PAN

doi:10.13756/j.gtxyj.2024.240024

Study On Optical Communications, Volume. 50, Issue 5, 24002401(2024)

A Reinforcement Learning based Adaptive and Efficient RWA in All Optical Networks

Zhaoyang LIU and Bitao PAN^*

School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China

show less

Abstract Get PDF(in Chinese)

【Objective】

Recent research efforts on Routing and Wavelength Assignment (RWA) for all optical networks are focused on Deep Reinforcement Learning (DRL) based algorithms. The DRL based RWA algorithms are mostly rely on the K Shortest Paths (KSP) routing to calculate candidate paths in advance, hence the DRL agent can choose possible actions from the precomputed paths. These KSP based models lack of flexibility and dynamicity, since they need to re-calculate the KSP for all the node pairs once the topology changes occur. To address this issue, this paper proposes an Adaptive and Efficient(ADE)-RWA algorithm based on DRL.

【Methods】

The key points and innovations of the ADE-RWA lie in that during the training process, the DRL agent takes actions in a step-by-step way instead of selecting from the precomputed K complete paths. Therefore, the routing strategies are dynamically adjustable in training even under the case of topology changes. It is because that the actions are open for the agent to take without concerning the limitations of the K fixed paths. Moreover, the ADE-RWA records the successfully assigned routes during the training in a LookUp Table (LUT). The algorithm turns to LUT checking for finding the available routes once the DRL training is converged, since at that time the LUT has acquired enough information for the RWA from the DRL training. The LUT based routing can effectively reduce the computational costs and improve the efficiency of RWA. In addition, the DRL training phase and LUT routing phase are real-time switchable. The algorithm turns to the DRL training phase when a link failure caused topology change occurs, and turns back to LUT checking when the model training is converged again.

【Results】

Experimental results show that compared with KSP-First Fit(FF)and Deep Reinforcement Learning Framework for Routing, Modulation and Spectrum Assignment (DeepRMSA), the blocking probability of ADE-RWA is reduced by 36% and 30% respectively. When a link failure occurs, the algorithm can quickly adapt to the changes in network topology.

【Conclusion】

The proposed DRL based RWA framework ADE-RWA can achieve adaptive routing and wavelength allocation under dynamic network conditions with low computational cost.

Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.

Keywords

digital twin DRL RWA wavelength-routed optical network

Tools

Get Citation

Copy Citation Text

Zhaoyang LIU, Bitao PAN. A Reinforcement Learning based Adaptive and Efficient RWA in All Optical Networks[J]. Study On Optical Communications, 2024, 50(5): 24002401

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Feb. 3, 2024

Accepted: --

Published Online: Oct. 15, 2024

The Author Email: PAN Bitao (bitao.pan@bupt.edu.cn)

DOI:10.13756/j.gtxyj.2024.240024

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology