Full Text:   <4162>

Summary:  <1642>

CLC number: TP18

On-line Access: 2024-08-27

Received: 2023-10-17

Revision Accepted: 2024-05-08

Crosschecked: 2020-06-04

Cited: 0

Clicked: 6179

Citations:  Bibtex RefMan EndNote GB/T7714

 ORCID:

Hao-nan Wang

https://orcid.org/0000-0002-0792-3858

-   Go to

Article info.
Open peer comments

Frontiers of Information Technology & Electronic Engineering  2020 Vol.21 No.12 P.1726-1744

http://doi.org/10.1631/FITEE.1900533


Deep reinforcement learning: a survey


Author(s):  Hao-nan Wang, Ning Liu, Yi-yun Zhang, Da-wei Feng, Feng Huang, Dong-sheng Li, Yi-ming Zhang

Affiliation(s):  Science and Technology on Parallel and Distributed Processing Laboratory, National University of Defense Technology, Changsha 410000, China

Corresponding email(s):   wanghaonan14@nudt.edu.cn, liuning17a@nudt.edu.cn, zhangyiyun213@163.com, fengdawei@nudt.edu.cn, huangfeng@nudt.edu.cn, dsli@nudt.edu.cn, zhangyiming@nudt.edu.cn

Key Words:  Reinforcement learning, Deep reinforcement learning, Reinforcement learning applications



Abstract: 
Deep reinforcement learning (RL) has become one of the most popular topics in artificial intelligence research. It has been widely used in various fields, such as end-to-end control, robotic control, recommendation systems, and natural language dialogue systems. In this survey, we systematically categorize the deep RL algorithms and applications, and provide a detailed review over existing deep RL algorithms by dividing them into model-based methods, model-free methods, and advanced RL methods. We thoroughly analyze the advances including exploration, inverse RL, and transfer RL. Finally, we outline the current representative applications, and analyze four open problems for future research.

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2025 Journal of Zhejiang University-SCIENCE