dc.contributor.author | Bianchi R.A.C. | |
dc.contributor.author | Santos P.E. | |
dc.contributor.author | da Silva I.J. | |
dc.contributor.author | Celiberto L.A. | |
dc.contributor.author | Lopez de Mantaras R. | |
dc.date.accessioned | 2019-08-19T23:45:19Z | |
dc.date.available | 2019-08-19T23:45:19Z | |
dc.date.issued | 2018 | |
dc.identifier.citation | Bianchi, Reinaldo A. C.; Santos, Paulo E.; DA SILVA, ISAAC J.; CELIBERTO, LUIZ A.; LOPEZ DE MANTARAS, RAMON. Heuristically Accelerated Reinforcement Learning by Means of Case-Based Reasoning and Transfer Learning. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, v. 1, p. 1, 2017. | |
dc.identifier.issn | 1573-0409 | |
dc.identifier.uri | https://repositorio.fei.edu.br/handle/FEI/1211 | |
dc.description.abstract | © 2017, Springer Science+Business Media B.V.Reinforcement Learning (RL) is a well-known technique for learning the solutions of control problems from the interactions of an agent in its domain. However, RL is known to be inefficient in problems of the real-world where the state space and the set of actions grow up fast. Recently, heuristics, case-based reasoning (CBR) and transfer learning have been used as tools to accelerate the RL process. This paper investigates a class of algorithms called Transfer Learning Heuristically Accelerated Reinforcement Learning (TLHARL) that uses CBR as heuristics within a transfer learning setting to accelerate RL. The main contributions of this work are the proposal of a new TLHARL algorithm based on the traditional RL algorithm Q(λ) and the application of TLHARL on two distinct real-robot domains: a robot soccer with small-scale robots and the humanoid-robot stability learning. Experimental results show that our proposed method led to a significant improvement of the learning rate in both domains. | |
dc.relation.ispartof | Journal of Intelligent and Robotic Systems: Theory and Applications | |
dc.rights | Acesso Restrito | |
dc.title | Heuristically Accelerated Reinforcement Learning by Means of Case-Based Reasoning and Transfer Learning | |
dc.type | Artigo | |