pisco_log
banner

Adaptive Optimal Control for Prescribed Constraints Robotic Manipulators Based On Approximate Dynamic Programming

Xiang Wang, Zhinan Peng

Abstract


In this paper, an approximate dynamic programming (ADP) approach integrated with concurrent learning is proposed to cope with
the optimal control problem in the 2-Degree-of-Freedom (2-DOF) Robot Manipulator (RM) dynamic system. Prescribed constraints are introduced from realistic applications and bound the system states for safe operation. Using a space transformation system, the constrained state
space is mapped onto an unconstrained one, facilitating the control design process. Moreover, the online ADP algorithm with critic neural
network is implemented in the transformed RM system to find the optimal control strategy, with an auxiliary term introduced to ensure initial
admissible control requirements. By using the concurrent learning method, the persistence of excitation condition is relaxed and the control
performance is enhanced. Finally, simulations of the 2-DOF RM are conducted and the simulation results validate the feasibility of the refined
ADP control algorithm in optimal control of RM systems.

Keywords


Robot Manipulator; Development of robot; Approximate Dynamic Programming

Full Text:

PDF

Included Database


References


[1] Bellman, Richard, and Robert E. Kalaba. Dynamic programming and modern control theory. Vol. 81. New York: Academic Press, 1965.

[2] F. L. Lewis and D. Vrabie, Reinforcement learning and adaptive dynamic programming for feedback control, IEEE Circuits Syst.

Mag., vol. 9, no. 3, pp. 3250, Aug. 2009.

[3] C. Mu and K. Wang, Single-network ADP for near optimal control of continuous-time zero-sum games without using initial stabilising

control laws, IET Control Theory Appl., vol. 12, no. 18, pp. 24492458, 2018.

[4] T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear continuous-time systems, " Proceedings of the 2010 American Control Conference, 2010, pp. 1568-1573.

[5] Q. Wei, D. Liu, Q. Lin, and R. Song, Adaptive dynamic programming for discrete-time zero-sum games, IEEE Trans. Neural Netw.

Learn. Syst., vol. 29, no. 4, pp. 957969, Apr. 2018.

[6] Y. Ouyang, L. Dong and C. Sun, "Critic Learning-Based Control for Robotic Manipulators With Prescribed Constraints, " in IEEE

Transactions on Cybernetics, vol. 52, no. 4, pp. 2274-2283, April 2022.

[7] B. D. O. Anderson, "Exponential convergence and persistent excitation, " 1982 21st IEEE Conference on Decision and Control, 1982,

pp. 12-17.

[8] G. Chowdhary and E. Johnson, "Concurrent learning for convergence in adaptive control without persistency of excitation, " 49th IEEE

Conference on Decision and Control (CDC), 2010, pp. 3674-3679.

[9] S. Xue, B. Luo and D. Liu, "Event-Triggered Adaptive Dynamic Programming for Unmatched Uncertain Nonlinear Continuous-Time

Systems, " in IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 7, pp. 2939-2951, July 2021.

[10] Q. Zhang, D. Zhao and Y. Zhu, "Event-Triggered H? Control for Continuous-Time Nonlinear System via Concurrent Learning, " in

IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 47, no. 7, pp. 1071-1081, July 2017.

[11] C. Li, F. Liu, Y. Wang and M. Buss, "Concurrent Learning-Based Adaptive Control of an Uncertain Robot Manipulator with Guaranteed

Safety and Performance, " in IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 52, no. 5, pp. 3299-3313, May 2022.




DOI: http://dx.doi.org/10.70711/aitr.v2i4.4868

Refbacks

  • There are currently no refbacks.