Electronics Optics & Control, Volume. 29, Issue 2, 53(2022)

An Online Q-Learning Algorithm for a Model-Free Infinite Horizon System

DAI Xiaoqing1 and ZHAO Xu2
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less

    For the online implementation of infinite horizon optimal control for continuous linear systems, an online Q-learning algorithm is designed under the condition that the system dynamics are completely unknown.Based on the Hamiltonian function and the optimal cost function in the infinite horizon optimal control theory, the Q function of the continuous linear system is constructed.An Actor/Critic approximator structure is designed by using the integral reinforcement learning method.With asymptotic stability of the closed-loop system and convergence to the optimal solution, the parameters of the Q function are estimated online.The 6th-order linear system model of the turbocharged engine is numerically simulated, and the results show that both the Critic weight and the Actor weight asymptotically converge to the optimal value, and the model-free optimal control is realized.

    Tools

    Get Citation

    Copy Citation Text

    DAI Xiaoqing, ZHAO Xu. An Online Q-Learning Algorithm for a Model-Free Infinite Horizon System[J]. Electronics Optics & Control, 2022, 29(2): 53

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Jan. 2, 2021

    Accepted: --

    Published Online: Mar. 4, 2022

    The Author Email:

    DOI:10.3969/j.issn.1671-637x.2022.02.012

    Topics