Document Type
|
:
|
BL
|
Record Number
|
:
|
889987
|
Main Entry
|
:
|
Song, Ruizhuo
|
Title & Author
|
:
|
Adaptive dynamic programming : : single and multiple controllers /\ Ruizhuo Song, Qinglai Wei, Qing Li.
|
Publication Statement
|
:
|
Singapore :: Springer,, [2019]
|
Series Statement
|
:
|
Studies in systems, decision and control ;; volume 166
|
Page. NO
|
:
|
1 online resource
|
ISBN
|
:
|
9789811317125
|
|
:
|
: 9789811317132
|
|
:
|
: 9811317127
|
|
:
|
: 9811317135
|
|
:
|
9789811317118
|
|
:
|
9811317119
|
Notes
|
:
|
9.3.2 Off-Policy IRL for Nonzero Disturbance
|
|
:
|
Includes index.
|
Contents
|
:
|
Intro; Preface; Contents; About the Authors; Symbols; 1 Introduction; 1.1 Optimal Control; 1.1.1 Continuous-Time LQR; 1.1.2 Discrete-Time LQR; 1.2 Adaptive Dynamic Programming; 1.3 Review of Matrix Algebra; References; 2 Neural-Network-Based Approach for Finite-Time Optimal Control; 2.1 Introduction; 2.2 Problem Formulation and Motivation; 2.3 The Data-Based Identifier; 2.4 Derivation of the Iterative ADP Algorithm with Convergence Analysis; 2.5 Neural Network Implementation of the Iterative Control Algorithm; 2.6 Simulation Study; 2.7 Conclusions; References
|
|
:
|
3 Nearly Finite-Horizon Optimal Control for Nonaffine Time-Delay Nonlinear Systems3.1 Introduction; 3.2 Problem Statement; 3.3 The Iteration ADP Algorithm and Its Convergence; 3.3.1 The Novel ADP Iteration Algorithm; 3.3.2 Convergence Analysis of the Improved Iteration Algorithm; 3.3.3 Neural Network Implementation of the Iteration ADP Algorithm; 3.4 Simulation Study; 3.5 Conclusion; References; 4 Multi-objective Optimal Control for Time-Delay Systems; 4.1 Introduction; 4.2 Problem Formulation; 4.3 Derivation of the ADP Algorithm for Time-Delay Systems
|
|
:
|
4.4 Neural Network Implementation for the Multi-objective Optimal Control Problem of Time-Delay Systems4.5 Simulation Results; 4.6 Conclusions; References; 5 Multiple Actor-Critic Optimal Control via ADP; 5.1 Introduction; 5.2 Problem Statement; 5.3 SIANN Architecture-Based Classification; 5.4 Optimal Control Based on ADP; 5.4.1 Model Neural Network; 5.4.2 Critic Network and Action Network; 5.5 Simulation Study; 5.6 Conclusions; References; 6 Optimal Control for a Class of Complex-Valued Nonlinear Systems; 6.1 Introduction; 6.2 Motivations and Preliminaries
|
|
:
|
6.3 ADP-Based Optimal Control Design6.3.1 Critic Network; 6.3.2 Action Network; 6.3.3 Design of the Compensation Controller; 6.3.4 Stability Analysis; 6.4 Simulation Study; 6.5 Conclusion; References; 7 Off-Policy Neuro-Optimal Control for Unknown Complex-Valued Nonlinear Systems; 7.1 Introduction; 7.2 Problem Statement; 7.3 Off-Policy Optimal Control Method; 7.3.1 Convergence Analysis of Off-Policy PI Algorithm; 7.3.2 Implementation Method of Off-Policy Iteration Algorithm; 7.3.3 Implementation Process; 7.4 Simulation Study; 7.5 Conclusion; References
|
|
:
|
8 Approximation-Error-ADP-Based Optimal Tracking Control for Chaotic Systems8.1 Introduction; 8.2 Problem Formulation and Preliminaries; 8.3 Optimal Tracking Control Scheme Based ... ; 8.3.1 Description of Approximation-Error ADP Algorithm; 8.3.2 Convergence Analysis of The Iterative ADP Algorithm; 8.4 Simulation Study; 8.5 Conclusion; References; 9 Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems with Disturbances; 9.1 Introduction; 9.2 Problem Statement; 9.3 Off-Policy Actor-Critic Integral Reinforcement Learning; 9.3.1 On-Policy IRL for Nonzero Disturbance
|
Abstract
|
:
|
This book presents a class of novel optimal control methods and games schemes based on adaptive dynamic programming techniques. For systems with one control input, the ADP-based optimal control is designed for different objectives, while for systems with multi-players, the optimal control inputs are proposed based on games. In order to verify the effectiveness of the proposed methods, the book analyzes the properties of the adaptive dynamic programming methods, including convergence of the iterative value functions and the stability of the system under the iterative control laws. Further, to substantiate the mathematical analysis, it presents various application examples, which provide reference to real-world practices.
|
Subject
|
:
|
Dynamic programming.
|
Subject
|
:
|
Dynamic programming.
|
Dewey Classification
|
:
|
519.7/03
|
LC Classification
|
:
|
T57.83.S66 2019
|
Added Entry
|
:
|
Li, Qing
|
|
:
|
Wei, Qinglai
|