Full Text:  <542>

CLC number: 

On-line Access: 2024-08-27

Received: 2023-10-17

Revision Accepted: 2024-05-08

Crosschecked: 0000-00-00

Cited: 0

Clicked: 956

Citations:  Bibtex RefMan EndNote GB/T7714

-   Go to

Article info.
Open peer comments

Frontiers of Information Technology & Electronic Engineering 

Accepted manuscript available online (unedited version)


Domain adaptation in reinforcement learning: a comprehensive and systematic study


Author(s):  Amirfarhad FARHADI, Mitra MIRZAREZAEE, Arash SHARIFI, Mohammad TESHNEHLAB

Affiliation(s):  Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Sattari Hwy, Iran; more

Corresponding email(s):  arash.sharifi@srbiau.ac.ir

Key Words:  Reinforcement learning; Domain adaptation; Machine learning; Survey


Share this article to: More <<< Previous Paper|Next Paper >>>

Amirfarhad FARHADI, Mitra MIRZAREZAEE, Arash SHARIFI, Mohammad TESHNEHLAB. Domain adaptation in reinforcement learning: a comprehensive and systematic study[J]. Frontiers of Information Technology & Electronic Engineering,in press.https://doi.org/10.1631/FITEE.2300668

@article{title="Domain adaptation in reinforcement learning: a comprehensive and systematic study",
author="Amirfarhad FARHADI, Mitra MIRZAREZAEE, Arash SHARIFI, Mohammad TESHNEHLAB",
journal="Frontiers of Information Technology & Electronic Engineering",
year="in press",
publisher="Zhejiang University Press & Springer",
doi="https://doi.org/10.1631/FITEE.2300668"
}

%0 Journal Article
%T Domain adaptation in reinforcement learning: a comprehensive and systematic study
%A Amirfarhad FARHADI
%A Mitra MIRZAREZAEE
%A Arash SHARIFI
%A Mohammad TESHNEHLAB
%J Frontiers of Information Technology & Electronic Engineering
%P
%@ 2095-9184
%D in press
%I Zhejiang University Press & Springer
doi="https://doi.org/10.1631/FITEE.2300668"

TY - JOUR
T1 - Domain adaptation in reinforcement learning: a comprehensive and systematic study
A1 - Amirfarhad FARHADI
A1 - Mitra MIRZAREZAEE
A1 - Arash SHARIFI
A1 - Mohammad TESHNEHLAB
J0 - Frontiers of Information Technology & Electronic Engineering
SP -
EP -
%@ 2095-9184
Y1 - in press
PB - Zhejiang University Press & Springer
ER -
doi="https://doi.org/10.1631/FITEE.2300668"


Abstract: 
Reinforcement learning (RL) has shown significant potential for dealing with complex decision-making problems. However, its performance relies heavily on the availability of a large amount of high-quality data. In many real-world situations, the data distribution in the target domain may differ significantly from that in the source domain, leading to a significant drop in the performance of RL algorithms. Domain adaptation (DA) strategies have been proposed to address this issue by transferring knowledge from a source domain to a target domain. However, there have been no comprehensive and in-depth studies to evaluate these approaches. In this paper we present a comprehensive and systematic study of DA in RL. We first introduce the basic concepts and formulations of DA in RL and then review the existing DA methods used in RL. Our main objective was to fill the existing literature gap regarding DA in RL. To achieve this, we conducted a rigorous evaluation of state-of-the-art DA approaches. We aimed to provide comprehensive insights into DA in RL and contribute to advancing knowledge in this field. The existing DA approaches are divided into seven categories based on application domains. The approaches in each category are discussed based on the important data adaptation metrics, and then their key characteristics are described. Finally, challenging issues and future research trends are highlighted to assist researchers in developing innovative improvements.

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article

Reference

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2024 Journal of Zhejiang University-SCIENCE