dc.rights.license | Creative Commons "Este é um artigo publicado em acesso aberto sob uma licença Creative Commons (CC BY-NC-ND 4.0). Fonte: https://www.sciencedirect.com/science/article/pii/S095741742030244X?via%3Dihub. Acesso em: 23 nov. 2021. | |
dc.contributor.author | GLATT, RUBEN | |
dc.contributor.author | SILVA, FELIPE LENO DA | |
dc.contributor.author | Reinaldo Bianchi | |
dc.contributor.author | COSTA, ANNA HELENA REALI | |
dc.date.accessioned | 2021-11-24T00:35:41Z | |
dc.date.accessioned | 2023-05-03T20:33:19Z | |
dc.date.available | 2021-11-24T00:35:41Z | |
dc.date.available | 2023-05-03T20:33:19Z | |
dc.date.issued | 2020-10-15 | |
dc.identifier.citation | GLATT, R.; S., F. L. DA; BIANCHI, R. A. DA C.; COSTA, A. H R. DECAF: Deep Case-based Policy Inference for Knowledge Transfer in Reinforcement Learning. EXPERT SYSTEMS WITH APPLICATIONS, v. 1, p. 113420, 2020. | |
dc.identifier.issn | 0957-4174 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12032/88596 | |
dc.description.abstract | Having the ability to solve increasingly complex problems using Reinforcement Learning (RL) has prompted researchers to start developing a greater interest in systematic approaches to retain and reuse knowledge over a variety of tasks. With Case-based Reasoning (CBR) there exists a general methodology that provides a framework for knowledge transfer which has been underrepresented in the RL literature so far. We for- mulate a terminology for the CBR framework targeted towards RL researchers with the goal of facilitating communication between the respective research communities. Based on this framework, we propose the Deep Case-based Policy Inference (DECAF) algorithm to accelerate learning by building a library of cases and reusing them if they are similar to a new task when training a new policy. DECAF guides the train- ing by dynamically selecting and blending policies according to their usefulness for the current target task, reusing previously learned policies for a more effective exploration but still enabling the adaptation to particularities of the new task. We show an empirical evaluation in the Atari game playing domain depicting the benefits of our algorithm with regards to sample efficiency, robustness against negative transfer, and performance increase when compared to state-of-the-art methods. | |
dc.relation.ispartof | EXPERT SYSTEMS WITH APPLICATIONS | |
dc.rights | Acesso Restrito | |
dc.subject | Deep Reinforcement Learning | |
dc.subject | Case-based Reasoning | |
dc.subject | Transfer Learning | |
dc.subject | Knowledge discovery | |
dc.subject | Knowledge management | |
dc.subject | Neural networks | |
dc.title | DECAF: Deep Case-based Policy Inference for Knowledge Transfer in Reinforcement Learning | pt_BR |
dc.type | Artigo | pt_BR |