|
|
Frontiers of Information Technology & Electronic Engineering
ISSN 2095-9184 (print), ISSN 2095-9230 (online)
2025 Vol.26 No.12 P.2397-2420
SPID: a deep reinforcement learning-based solution framework for siting low-altitude takeoff and landing facilities#
Abstract: Siting low-altitude takeoff and landing platforms (vertiports) is a fundamental challenge for developing urban air mobility (UAM). This study formulates this issue as a variant of the capacitated facility location problem, incorporating flight range and service capacity constraints, and proposes SPID, a deep reinforcement learning (DRL)-based solution framework that models the problem as a Markov decision process. To handle dynamic coverage, the designed DRL framework-based SPID uses a multi-head attention mechanism to capture spatiotemporal patterns, followed by integrating dynamic and static information into a unified input state vector. Afterward, a gated recurrent unit (GRU) is used to generate the query vector, thereby enhancing sequential decision-making. The action network within the DRL network is regulated by a loss function that integrates service distance costs with unmet demand penalties, enabling end-to-end optimization. Subsequent experimental results demonstrate that SPID significantly enhances solution efficiency and robustness compared with traditional methods under flight and capacity constraints. Especially, across the social performance metrics emphasized in this study, SPID outperforms the suboptimal solutions produced by traditional clustering and graph neural network (GNN)-based methods by up to approximately 29%. This improvement comes with an increase in distance-based cost that is kept within 10%. Overall, we demonstrate an efficient, scalable approach for vertiport siting, supporting rapid decision-making in large-scale UAM scenarios.
Key words: Low-altitude planning; Vertiport siting; Deep reinforcement learning; Algorithm exploration
南京航空航天大学民航学院,中国南京市,211106
摘要:低空起降平台(垂直起降机场)的选址是发展城市空中交通(UAM)面临的核心挑战。本研究将该问题转化为带容量约束的设施选址问题,整合飞行距离与服务容量限制,并提出基于深度强化学习(DRL)的SPID解决方案框架,通过马尔可夫决策过程对问题进行建模。为处理动态覆盖需求,基于DRL框架设计的SPID采用多头注意力机制捕捉时空模式,将动态与静态信息整合为统一输入状态向量。随后通过门控循环单元(GRU)生成查询向量,从而增强序列决策能力。DRL网络中的动作网络通过损失函数进行调控,该函数整合了服务距离成本与未满足需求惩罚,实现端到端优化。后续实验结果表明,在飞行与容量约束条件下,SPID相较传统方法显著提升了解决方案的效率与鲁棒性。尤其在本文重点关注的社会绩效指标维度,SPID相较传统聚类法和基于图神经网络(GNN)的方法所产生的次优解,性能提升高达约29%。该提升伴随距离相关成本的增加,但增幅控制在10%以内。总体而言,我们为垂直起降机场选址提供了高效且可扩展的解决方案,为大规模的城市空中交通场景提供快速决策支持。
关键词组:
References:
Open peer comments: Debate/Discuss/Question/Opinion
<1>
DOI:
10.1631/FITEE.2500534
CLC number:
TP181
Download Full Text:
Downloaded:
490
Clicked:
539
Cited:
0
On-line Access:
2026-01-09
Received:
2025-07-28
Revision Accepted:
2025-11-26
Crosschecked:
2026-01-11