Publishing Service

Polishing & Checking

Journal of Zhejiang University SCIENCE A

ISSN 1673-565X(Print), 1862-1775(Online), Monthly

Three-degree-of-freedom motion posture stabilization control of platform based on DTW-LSTM-MATD3 under high and low frequency disturbances of ships

Abstract: In the complex and variable deep-sea environment, the compensation control of ship motion ensures the safety and efficiency of equipment installation and transportation in offshore wind farms. However, the ship motion posture compensation control system is severely affected by uncertainties, which significantly impact the accuracy of compensation control. In this paper, we propose a ship three-degree-of-freedom (3-DoF) motion posture stabilization control method based on the DTW-LSTM-MATD3 algorithm. We use the multi-agent twin delayed deep deterministic policy gradient (MATD3) to control a platform with six electric cylinders to achieve stable control. However, owing to random noise affecting the ship’s motion posture, we use a dynamic time warping (DTW) algorithm to distinguish between high-frequency noise and low-frequency tracking signals. Further, we embed a long short-term memory (LSTM) network into the MATD3 network to better align the Critic network’s training with the true Q-value. We use a combined reward function to enhance the agent’s exploration capability in complex dynamic environments. Finally, verification was conducted under sixth-level, abrupt sea conditions with high-frequency noise, as well as under real abrupt sea conditions, and a generalization test was also carried out. Simulation results show that the proposed DTW-LSTM-MATD3 method has great compensation control ability.

Key words: Compensation control; Multi-agent twin delayed deep deterministic policy gradient (MATD3) algorithm; Dynamic time warping (DTW) algorithm; Long short-term memory (LSTM) network

Chinese Summary  <2> 基于DTW-LSTM-MATD3的船舶高低频干扰下平台三自由度运动姿态稳定控制

作者:张琴,周静宜,顾邦平,胡雄
机构:上海海事大学,物流工程学院,中国上海,201306
目的:在复杂多变的深海环境中,船舶运动姿态补偿控制系统受到不确定性的严重影响,显著降低补偿控制的精度。本文旨在提升船舶运动补偿控制精度,确保海上风电场设备安装与运输的安全性和效率。
创新点:1.通过动态规整算法(DTW),区分高频噪声和低频跟踪信号;2.将长短期记忆(LSTM)网络嵌入到多智能体双延迟深度确定性策略梯度(MATD3)算法中,更好地使critic网络的训练与真实Q值接近;3.采用组合奖励函数提高智能体的探索能力。
方法:1.针对复杂海况下的船舶三自由度运动,构建船舶补偿系统的强化学习环境;2.运用DTW算法区分噪声信号,采用MATD3算法训练模型,并结合LSTM网络和组合奖励函数,提高传统MATD3的补偿效率;3.通过仿真模拟,补偿系统在六级海况、突变海况及真实含噪声情况下均实现更高的补偿效率,验证所提方法的有效性。
结论:1.DTW算法能确定高低频噪声信号分界点,使补偿系统实现高频抗噪和低频信号的跟踪运动;2. MATD3算法中采用LSTM神经网络和组合奖励函数,能提升智能体的训练效果和决策能力;3.运用所设计方法能够提高船舶三自由度补偿的泛化性和补偿效率。

关键词组:补偿控制;MATD3算法;DTW算法;LSTM网络


Share this article to: More

Go to Contents

References:

<Show All>

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





DOI:

10.1631/jzus.A2500146

CLC number:

Download Full Text:

Click Here

Downloaded:

697

Download summary:

<Click Here> 

Downloaded:

5

Clicked:

983

Cited:

0

On-line Access:

2026-03-25

Received:

2025-04-26

Revision Accepted:

2025-11-10

Crosschecked:

2026-03-25

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952276; Fax: +86-571-87952331; E-mail: jzus@zju.edu.cn
Copyright © 2000~ Journal of Zhejiang University-SCIENCE