Affiliation(s): 1Naval Aeronautic University, Yantai 264001, China
2Shandong Key Laboratory of Sea and Air Information Perception and Processing Technology, Naval Aeronautic University, Yantai 264001, China
3PLA 91001 Unit, Beijing 100000, China
Zhongyang MAO1,2, Zhilin ZHANG1,2, Faping LU1,2, Xiguo LIU1,2, Zhichao XU1,2,Yaozong PAN1,2, Jiafang KANG1,2, Yang YOU3. Dynamic joint resource allocation in maritime wireless communication networks: a meta-reinforcement learning approach based on knowledge embedding[J]. Frontiers of Information Technology & Electronic Engineering,in press.https://doi.org/10.1631/FITEE.2500007
@article{title="Dynamic joint resource allocation in maritime wireless communication networks: a meta-reinforcement learning approach based on knowledge embedding", author="Zhongyang MAO1,2, Zhilin ZHANG1,2, Faping LU1,2, Xiguo LIU1,2, Zhichao XU1,2,Yaozong PAN1,2, Jiafang KANG1,2, Yang YOU3", journal="Frontiers of Information Technology & Electronic Engineering", year="in press", publisher="Zhejiang University Press & Springer", doi="https://doi.org/10.1631/FITEE.2500007" }
%0 Journal Article %T Dynamic joint resource allocation in maritime wireless communication networks: a meta-reinforcement learning approach based on knowledge embedding %A Zhongyang MAO1 %A 2 %A Zhilin ZHANG1 %A 2 %A Faping LU1 %A 2 %A Xiguo LIU1 %A 2 %A Zhichao XU1 %A 2 %A Yaozong PAN1 %A 2 %A Jiafang KANG1 %A 2 %A Yang YOU3 %J Frontiers of Information Technology & Electronic Engineering %P %@ 2095-9184 %D in press %I Zhejiang University Press & Springer doi="https://doi.org/10.1631/FITEE.2500007"
TY - JOUR T1 - Dynamic joint resource allocation in maritime wireless communication networks: a meta-reinforcement learning approach based on knowledge embedding A1 - Zhongyang MAO1 A1 - 2 A1 - Zhilin ZHANG1 A1 - 2 A1 - Faping LU1 A1 - 2 A1 - Xiguo LIU1 A1 - 2 A1 - Zhichao XU1 A1 - 2 A1 - Yaozong PAN1 A1 - 2 A1 - Jiafang KANG1 A1 - 2 A1 - Yang YOU3 J0 - Frontiers of Information Technology & Electronic Engineering SP - EP - %@ 2095-9184 Y1 - in press PB - Zhejiang University Press & Springer ER - doi="https://doi.org/10.1631/FITEE.2500007"
Abstract: As human exploration of the ocean expands, the demand for continuous, high-quality, and ubiquitous maritime communication is steadily increasing. However, the dynamic nature of the marine environment and resource constraints present significant challenges for traditional heuristic resource allocation methods, complicating the balance between high-quality communication and limited network resources. This results in suboptimal system throughput and an over-reliance on specific problem structures. To address these issues, in this paper we introduce a joint resource allocation method based on knowledge embedding. The proposed approach includes an action distribution alignment module designed to improve resource utilization by preventing unreasonable action-output combinations. Furthermore, by integrating knowledge embedding with meta-reinforcement learning techniques, a physical guidance loss function is formulated, which effectively reduces the sample size required for model training, thereby enhancing the algorithm's generalization capabilities. Simulation results show that the proposed method achieves an increase in average system throughput of 31.19% compared to the MAML-PPO algorithm and 80.91% compared to the RL2 algorithm, across various channel environments.
Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article
Reference
Open peer comments: Debate/Discuss/Question/Opinion
Open peer comments: Debate/Discuss/Question/Opinion
<1>