skip to main content
Tipo de recurso Mostra resultados com: Mostra resultados com: Índice

Hierarchical Sliding-Mode Surface-Based Adaptive Actor-Critic Optimal Control for Switched Nonlinear Systems With Unknown Perturbation

Zhang, Haoyan ; Zhao, Xudong ; Wang, Huanqing ; Zong, Guangdeng ; Xu, Ning

IEEE transaction on neural networks and learning systems, 2024-02, Vol.35 (2), p.1559-1571

United States: IEEE

Texto completo disponível

Citações Citado por
  • Título:
    Hierarchical Sliding-Mode Surface-Based Adaptive Actor-Critic Optimal Control for Switched Nonlinear Systems With Unknown Perturbation
  • Autor: Zhang, Haoyan ; Zhao, Xudong ; Wang, Huanqing ; Zong, Guangdeng ; Xu, Ning
  • Assuntos: Actor–critic (AC) neural networks (NNs) architecture ; adaptive optimal control ; Adaptive systems ; Artificial neural networks ; Control systems ; hierarchical sliding-mode surface (HSMS) ; Optimal control ; Perturbation methods ; switched nonlinear systems ; Switches ; Uncertainty ; unknown perturbation
  • É parte de: IEEE transaction on neural networks and learning systems, 2024-02, Vol.35 (2), p.1559-1571
  • Notas: ObjectType-Article-1
    SourceType-Scholarly Journals-1
    ObjectType-Feature-2
    content type line 23
  • Descrição: This article studies the hierarchical sliding-mode surface (HSMS)-based adaptive optimal control problem for a class of switched continuous-time (CT) nonlinear systems with unknown perturbation under an actor-critic (AC) neural networks (NNs) architecture. First, a novel perturbation observer with a nested parameter adaptive law is designed to estimate the unknown perturbation. Then, by constructing an especial cost function related to HSMS, the original control issue is further converted into the problem of finding a series of optimal control policies. The solution to the HJB equation is identified by the HSMS-based AC NNs, where the actor and critic updating laws are developed to implement the reinforcement learning (RL) strategy simultaneously. The critic update law is designed via the gradient descent approach and the principle of standardization, such that the persistence of excitation (PE) condition is no longer needed. Based on the Lyapunov stability theory, all the signals of the closed-loop switched nonlinear systems are strictly proved to be bounded in the sense of uniformly ultimate boundedness (UUB). Finally, the simulation results are presented to verify the validity of the proposed adaptive optimal control scheme.
  • Editor: United States: IEEE
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.