|
|
Frontiers of Information Technology & Electronic Engineering
ISSN 2095-9184 (print), ISSN 2095-9230 (online)
2025 Vol.26 No.3 P.385-399
Significance extraction based on data augmentation for reinforcement learning
Abstract: Deep reinforcement learning has shown remarkable capabilities in visual tasks, but it does not have a good generalization ability in the context of interference signals in the input images; this approach is therefore hard to be applied to trained agents in a new environment. To enable agents to distinguish between noise signals and important pixels in images, data augmentation techniques and the establishment of auxiliary networks are proven effective solutions. We introduce a novel algorithm, namely, saliency-extracted Q-value by augmentation (SEQA), which encourages the agent to explore unknown states more comprehensively and focus its attention on important information. Specifically, SEQA masks out interfering features and extracts salient features and then updates the mask decoder network with critic losses to encourage the agent to focus on important features and make correct decisions. We evaluate our algorithm on the DeepMind Control generalization benchmark (DMControl-GB), and the experimental results show that our algorithm greatly improves training efficiency and stability. Meanwhile, our algorithm is superior to state-of-the-art reinforcement learning methods in terms of sample efficiency and generalization in most DMControl-GB tasks.
Key words: Deep reinforcement learning; Visual tasks; Generalization; Data augmentation; Significance; DeepMind Control generalization benchmark
1浙江大学医学院附属口腔医院, 浙江大学口腔医学院, 浙江省口腔疾病临床医研究中心, 浙江省口腔生物医学研究重点实验室, 浙江大学癌症研究院, 中国杭州市, 310006
2广西口腔颌面修复与重建研究重点实验室, 中国南宁市, 530021
摘要:包括骨质疏松症、骨关节炎、类风湿性关节炎、骨折和牙周炎在内的骨相关疾病,显著影响了人类健康。琥珀酸作为三羧酸循环中的一种代谢中间产物,已被发现不仅在代谢中起作用,还能作为细胞功能的调节因子发挥作用。应激状态下,琥珀酸在线粒体中积累,作为信号分子调节细胞功能。值得注意的是,琥珀酸可通过稳定缺氧诱导因子1α(HIF-1α)促进血管生成和炎症发展。此外,琥珀酸还可通过与琥珀酸受体1(SUCNR1)作用介导多种病理生理过程,如免疫反应、炎症、癌症转移和骨稳态等。琥珀酸作为信号分子的多重作用取决于其在细胞中的位置和浓度。近期的代谢组学分析发现,骨相关疾病中琥珀酸水平升高,提示其可能与这些疾病相关。本综述旨在阐明琥珀酸对不同骨相关疾病的影响,并基于其作用机制探讨潜在的治疗靶点和相关药物分子。
关键词组:
References:
Open peer comments: Debate/Discuss/Question/Opinion
<1>
DOI:
10.1631/FITEE.2400406
CLC number:
TP391.4
Download Full Text:
Downloaded:
2759
Download summary:
<Click Here>Downloaded:
722Clicked:
1627
Cited:
0
On-line Access:
2025-04-03
Received:
2024-05-17
Revision Accepted:
2024-09-18
Crosschecked:
2025-04-07