Wei He
Reinforcement learning control of a flexible two-link manipulator: an experimental investigation
He, Wei; Gao, Hejia; Zhou, Chen; Yang, Chenguang; Li, Zhijun
Abstract
This article discusses the control design and experiment validation of a flexible two-link manipulator (FTLM) system represented by ordinary differential equations (ODEs). A reinforcement learning (RL) control strategy is developed that is based on actor-critic structure to enable vibration suppression while retaining trajectory tracking. Subsequently, the closed-loop system with the proposed RL control algorithm is proved to be semi-global uniform ultimate bounded (SGUUB) by Lyapunov's direct method. In the simulations, the control approach presented has been tested on the discretized ODE dynamic model and the analytical claims have been justified under the existence of uncertainty. Eventually, a series of experiments in a Quanser laboratory platform are investigated to demonstrate the effectiveness of the presented control and its application effect is compared with PD control.
Journal Article Type | Article |
---|---|
Acceptance Date | Jan 1, 2020 |
Online Publication Date | Mar 5, 2020 |
Publication Date | Dec 1, 2021 |
Deposit Date | Mar 29, 2020 |
Publicly Available Date | Apr 20, 2020 |
Journal | IEEE Transactions on Systems, Man, and Cybernetics: Systems |
Print ISSN | 2168-2216 |
Publisher | Institute of Electrical and Electronics Engineers |
Peer Reviewed | Peer Reviewed |
Volume | 51 |
Issue | 12 |
Pages | 7326-7336 |
DOI | https://doi.org/10.1109/TSMC.2020.2975232 |
Keywords | Control and Systems Engineering; Human-Computer Interaction; Electrical and Electronic Engineering; Software; Computer Science Applications |
Public URL | https://uwe-repository.worktribe.com/output/5827665 |
Files
RL-FTLS-plain
(3.8 Mb)
PDF
Copyright Statement
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.
You might also like
Head-raising of snake robots based on a predefined spiral curve method
(2018)
Journal Article
Enhanced teleoperation performance using hybrid control and virtual fixture
(2019)
Journal Article
Efficient 3D object recognition via geometric information preservation
(2019)
Journal Article
Downloadable Citations
About UWE Bristol Research Repository
Administrator e-mail: repository@uwe.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search