2024_MARKO RUMAN
URL: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10752398
Title: Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function.
doi: 10.1109/ACCESS.2024.3497589
Open access: Yes
Working Group: WG1-WG2.
Resource Type: Research Paper.
Resource Format: PDF.
Author(s): MARKO RUMAN, TATIANA V. GUY
Description/Abstract: Deep reinforcement learning has demonstrated superhuman performance in complex decisionmaking tasks, but it struggles with generalization and knowledge reuse—key aspects of true intelligence. This article introduces a novel approach that modifies Cycle Generative Adversarial Networks specifically for reinforcement learning, enabling effective one-to-one knowledge transfer between two tasks. Our method enhances the loss function with two new components: model loss, which captures dynamic relationships between source and target tasks, and Q-loss, which identifies states significantly influencing the target decision policy. Tested on the 2-D Atari game Pong, our method achieved 100% knowledge transfer in identical tasks and either 100% knowledge transfer or a 30% reduction in training time for a rotated task, depending on the network architecture. In contrast, using standard Generative Adversarial Networks or Cycle Generative Adversarial Networks led to worse performance than training from scratch in the majority of cases. The results demonstrate that the proposed method ensured enhanced knowledge generalization in deep reinforcement learning.
Keywords:
Affiliation(s): A) Department of Adaptive Systems, Institute of Information Theory and Automation; B) Czech Academy of Sciences, 182 00 Prague, Czech Republic; C) Department of Information Engineering, Faculty of Economics and Management, Czech University of Life Sciences, 165 00 Prague, Czech Republic.
Publication/Creation Date: 13 November 2024.
Additional Information
Data last updated | December 18, 2024 |
---|---|
Metadata last updated | December 18, 2024 |
Created | December 18, 2024 |
Format | HTML |
License | Open Data Commons Attribution License |
Id | b4049262-188f-40f8-be5b-f988a751b442 |
Package id | a109b981-825e-44c5-bd01-26a9ae0dea23 |
State | active |