CLC number:
On-line Access: 2024-08-27
Received: 2023-10-17
Revision Accepted: 2024-05-08
Crosschecked: 0000-00-00
Cited: 0
Clicked: 1061
Yang YANG, Fanming HUANG, Dong YUE. Reinforcement learning-based privacy-preserving consensus tracking control of nonstrict-feedback discrete-time multi-agent systems[J]. Frontiers of Information Technology & Electronic Engineering, 1998, -1(-1): .
@article{title="Reinforcement learning-based privacy-preserving consensus tracking control of nonstrict-feedback discrete-time multi-agent systems",
author="Yang YANG, Fanming HUANG, Dong YUE",
journal="Frontiers of Information Technology & Electronic Engineering",
volume="-1",
number="-1",
pages="",
year="1998",
publisher="Zhejiang University Press & Springer",
doi="10.1631/FITEE.2300532"
}
%0 Journal Article
%T Reinforcement learning-based privacy-preserving consensus tracking control of nonstrict-feedback discrete-time multi-agent systems
%A Yang YANG
%A Fanming HUANG
%A Dong YUE
%J Journal of Zhejiang University SCIENCE C
%V -1
%N -1
%P
%@ 2095-9184
%D 1998
%I Zhejiang University Press & Springer
%DOI 10.1631/FITEE.2300532
TY - JOUR
T1 - Reinforcement learning-based privacy-preserving consensus tracking control of nonstrict-feedback discrete-time multi-agent systems
A1 - Yang YANG
A1 - Fanming HUANG
A1 - Dong YUE
J0 - Journal of Zhejiang University Science C
VL - -1
IS - -1
SP -
EP -
%@ 2095-9184
Y1 - 1998
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/FITEE.2300532
Abstract: This paper investigates a privacy-preserving consensus tracking problem for a class of nonstrict-feedback discrete-time multi-agent systems (MASs). An improved Liu cryptosystem is developed to alleviate the errors between encryption and decryption on the plaintext, which ensures satisfactory recovery of the plaintext information. A reinforcement learning (RL) technique is then employed to compensate for unknown dynamics and errors between true signals and decrypted ones. Based on the backstepping and graph theory, a RL-based privacy-preserving consensus tracking control strategy is further designed. By virtue of graph theory and Lyapunov stability theory, it is shown that the consensus tracking errors and all signals in the MAS are ultimately bounded. Finally, simulation examples are presented for verification of the effectiveness of the control strategy.
Open peer comments: Debate/Discuss/Question/Opinion
<1>