DECAF: Deep Case-based Policy Inference for Knowledge Transfer in Reinforcement Learning

GLATT, RUBEN; SILVA, FELIPE LENO DA; Reinaldo Bianchi; COSTA, ANNA HELENA REALI

dc.rights.license	Creative Commons "Este é um artigo publicado em acesso aberto sob uma licença Creative Commons (CC BY-NC-ND 4.0). Fonte: https://www.sciencedirect.com/science/article/pii/S095741742030244X?via%3Dihub. Acesso em: 23 nov. 2021.
dc.contributor.author	GLATT, RUBEN
dc.contributor.author	SILVA, FELIPE LENO DA
dc.contributor.author	Reinaldo Bianchi
dc.contributor.author	COSTA, ANNA HELENA REALI
dc.date.accessioned	2021-11-24T00:35:41Z
dc.date.accessioned	2023-05-03T20:33:19Z
dc.date.available	2021-11-24T00:35:41Z
dc.date.available	2023-05-03T20:33:19Z
dc.date.issued	2020-10-15
dc.identifier.citation	GLATT, R.; S., F. L. DA; BIANCHI, R. A. DA C.; COSTA, A. H R. DECAF: Deep Case-based Policy Inference for Knowledge Transfer in Reinforcement Learning. EXPERT SYSTEMS WITH APPLICATIONS, v. 1, p. 113420, 2020.
dc.identifier.issn	0957-4174
dc.identifier.uri	https://hdl.handle.net/20.500.12032/88596
dc.description.abstract	Having the ability to solve increasingly complex problems using Reinforcement Learning (RL) has prompted researchers to start developing a greater interest in systematic approaches to retain and reuse knowledge over a variety of tasks. With Case-based Reasoning (CBR) there exists a general methodology that provides a framework for knowledge transfer which has been underrepresented in the RL literature so far. We for- mulate a terminology for the CBR framework targeted towards RL researchers with the goal of facilitating communication between the respective research communities. Based on this framework, we propose the Deep Case-based Policy Inference (DECAF) algorithm to accelerate learning by building a library of cases and reusing them if they are similar to a new task when training a new policy. DECAF guides the train- ing by dynamically selecting and blending policies according to their usefulness for the current target task, reusing previously learned policies for a more effective exploration but still enabling the adaptation to particularities of the new task. We show an empirical evaluation in the Atari game playing domain depicting the benefits of our algorithm with regards to sample efficiency, robustness against negative transfer, and performance increase when compared to state-of-the-art methods.
dc.relation.ispartof	EXPERT SYSTEMS WITH APPLICATIONS
dc.rights	Acesso Restrito
dc.subject	Deep Reinforcement Learning
dc.subject	Case-based Reasoning
dc.subject	Transfer Learning
dc.subject	Knowledge discovery
dc.subject	Knowledge management
dc.subject	Neural networks
dc.title	DECAF: Deep Case-based Policy Inference for Knowledge Transfer in Reinforcement Learning	pt_BR
dc.type	Artigo	pt_BR

Ficheros en el ítem

Ficheros	Tamaño	Formato	Ver

Este ítem aparece en la(s) siguiente(s) colección(ones)

Documentos - CUFEI

Mostrar el registro sencillo del ítem