Adaptive Optimal Control for Prescribed Constraints Robotic Manipulators Based On Approximate Dynamic Programming
Abstract
the optimal control problem in the 2-Degree-of-Freedom (2-DOF) Robot Manipulator (RM) dynamic system. Prescribed constraints are introduced from realistic applications and bound the system states for safe operation. Using a space transformation system, the constrained state
space is mapped onto an unconstrained one, facilitating the control design process. Moreover, the online ADP algorithm with critic neural
network is implemented in the transformed RM system to find the optimal control strategy, with an auxiliary term introduced to ensure initial
admissible control requirements. By using the concurrent learning method, the persistence of excitation condition is relaxed and the control
performance is enhanced. Finally, simulations of the 2-DOF RM are conducted and the simulation results validate the feasibility of the refined
ADP control algorithm in optimal control of RM systems.
Keywords
Full Text:
PDFReferences
[1] Bellman, Richard, and Robert E. Kalaba. Dynamic programming and modern control theory. Vol. 81. New York: Academic Press, 1965.
[2] F. L. Lewis and D. Vrabie, Reinforcement learning and adaptive dynamic programming for feedback control, IEEE Circuits Syst.
Mag., vol. 9, no. 3, pp. 3250, Aug. 2009.
[3] C. Mu and K. Wang, Single-network ADP for near optimal control of continuous-time zero-sum games without using initial stabilising
control laws, IET Control Theory Appl., vol. 12, no. 18, pp. 24492458, 2018.
[4] T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear continuous-time systems, " Proceedings of the 2010 American Control Conference, 2010, pp. 1568-1573.
[5] Q. Wei, D. Liu, Q. Lin, and R. Song, Adaptive dynamic programming for discrete-time zero-sum games, IEEE Trans. Neural Netw.
Learn. Syst., vol. 29, no. 4, pp. 957969, Apr. 2018.
[6] Y. Ouyang, L. Dong and C. Sun, "Critic Learning-Based Control for Robotic Manipulators With Prescribed Constraints, " in IEEE
Transactions on Cybernetics, vol. 52, no. 4, pp. 2274-2283, April 2022.
[7] B. D. O. Anderson, "Exponential convergence and persistent excitation, " 1982 21st IEEE Conference on Decision and Control, 1982,
pp. 12-17.
[8] G. Chowdhary and E. Johnson, "Concurrent learning for convergence in adaptive control without persistency of excitation, " 49th IEEE
Conference on Decision and Control (CDC), 2010, pp. 3674-3679.
[9] S. Xue, B. Luo and D. Liu, "Event-Triggered Adaptive Dynamic Programming for Unmatched Uncertain Nonlinear Continuous-Time
Systems, " in IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 7, pp. 2939-2951, July 2021.
[10] Q. Zhang, D. Zhao and Y. Zhu, "Event-Triggered H? Control for Continuous-Time Nonlinear System via Concurrent Learning, " in
IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 47, no. 7, pp. 1071-1081, July 2017.
[11] C. Li, F. Liu, Y. Wang and M. Buss, "Concurrent Learning-Based Adaptive Control of an Uncertain Robot Manipulator with Guaranteed
Safety and Performance, " in IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 52, no. 5, pp. 3299-3313, May 2022.
DOI: http://dx.doi.org/10.70711/aitr.v2i4.4868
Refbacks
- There are currently no refbacks.