Deep search:Searching for "markov decision process" in 'ABSTRACT and KEYWORDGot 8 items.
index Title
1Modified reward function on abstract features in inverse reinforcement learning
Author(s):Shen-yi Chen, Hui Qian, Jia Fan, Z...  Clicked:10539  Download:3731  Cited:4  <Full Text>
Journal of Zhejiang University Science C  2010 Vol.11 No.9 P.718-723  DOI:10.1631/jzus.C0910486
2Convergence analysis of an incremental approach to online inverse reinforcement learning
Author(s):Zhuo-jun Jin, Hui Qian, Shen-yi Ch...  Clicked:10052  Download:4159  Cited:0  <Full Text>
Journal of Zhejiang University Science C  2011 Vol.12 No.1 P.17-24  DOI:10.1631/jzus.C1010010
3NIG-AP: a new method for automated penetration testing
Author(s):Tian-yang Zhou, Yi-chao Zang, Jun-...  Clicked:6951  Download:4268  Cited:0  <Full Text>  <PPT> 2129
Frontiers of Information Technology & Electronic Engineering  2019 Vol.20 No.9 P.1277-1288  DOI:10.1631/FITEE.1800532
4Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrial wirel...
Author(s):Xiaoyu LIU, Chi XU, Haibin YU, Pen...  Clicked:7421  Download:13273  Cited:0  <Full Text>  <PPT> 2157
Frontiers of Information Technology & Electronic Engineering  2022 Vol.23 No.1 P.47-60  DOI:10.1631/FITEE.2100331
5Joint power control and passive beamforming optimization in RIS-assisted anti-jamming communication
Author(s):Yang LIU, Kui XU, Xiaochen XIA, We...  Clicked:4211  Download:3826  Cited:0  <Full Text>  <PPT> 1065
Frontiers of Information Technology & Electronic Engineering  2023 Vol.24 No.12 P.1791-1802  DOI:10.1631/FITEE.2200646
6Multi-agent reinforcement learning behavioral control for nonlinear second-order systems
Author(s):Zhenyi ZHANG, Jie HUANG, Congjie PAN  Clicked:2904  Download:2939  Cited:0  <Full Text>  <PPT> 870
Frontiers of Information Technology & Electronic Engineering  2024 Vol.25 No.6 P.869-886  DOI:10.1631/FITEE.2300394
7PPDO: a privacy-preservation-aware delay optimization task-offloading algorithm for collaborative edge comp...
Author(s):Chao JING, Jianwu XU  Clicked:3343  Download:3452  Cited:0  <Full Text>  <PPT> 838
Frontiers of Information Technology & Electronic Engineering  2025 Vol.26 No.1 P.27-41  DOI:10.1631/FITEE.2300741
8SPID: a deep reinforcement learning-based solution framework for siting low-altitude takeoff and landing fa...
Author(s):Xiaocheng LIU, Meilong LE, Yupu LI...  Clicked:670  Download:570  Cited:0  <Full Text>
Frontiers of Information Technology & Electronic Engineering  2025 Vol.26 No.12 P.2397-2420  DOI:10.1631/FITEE.2500534
Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2026 Journal of Zhejiang University-SCIENCE