Recent advances in neurosciences and cognitive sciences show us that the human neocortex is not a slave to the experiences from our perception and that the memories stored in hippocampus are goal weighted during the replay of the experiences for the purpose of relearning from them. Temporal difference reinforcement learning systems that use neural networks as function approximators rely on an experience replay memory structure similar to the hippocampus. We bring forward this similarity and present a novel way of using a goal weighted prioritization of the memory that is biologically inspired. Furthermore, we introduce a novel prioritization criteria called Variety of Experience Index, or VEI, for weighting the selection of the experiences that are stored in the replay memory. Weighting the experiences based on two different extremes of VEI can behaviourally modify the agent's learning process, generating different types of learning agents that exhibit different personality traits along the dimension of Openness to Experience. (C) 2019 Elsevier B.V. All rights reserved.
|Titolo:||Towards learning agents with personality traits: Modeling Openness to Experience|
|Data di pubblicazione:||2019|
|Appare nelle tipologie:||01.1 Articolo in Rivista|