In this paper, we discuss situations arising with reinforcement learning algorithms, when the reinforcement is delayed. The decision to consider delayed reinforcement is typical in many applications, and we discuss some motivations for it. Then, we summarize Q-Learning, a popular algorithm to deal with delayed reinforcement, and its recent extensions to use it to learn fuzzy logic structures (Fuzzy Q-Learning). Moreover, we present how a reinforcement learning algorithm we have developed in the past (ELF - Evolutionary Learning of Fuzzy rules) implements an extension of the popular Q-Learning algorithm for the distribution of delayed reinforcement when the controller to be learnt is a Fuzzy Logic Controller (FLC). Finally, we present some examples of the application of ELF to learning FLCs that implement behaviors for an autonomous agent.

Delayed Reinforcement, Fuzzy Q-Learning and Fuzzy Logic Controllers

BONARINI, ANDREA
1996-01-01

Abstract

In this paper, we discuss situations arising with reinforcement learning algorithms, when the reinforcement is delayed. The decision to consider delayed reinforcement is typical in many applications, and we discuss some motivations for it. Then, we summarize Q-Learning, a popular algorithm to deal with delayed reinforcement, and its recent extensions to use it to learn fuzzy logic structures (Fuzzy Q-Learning). Moreover, we present how a reinforcement learning algorithm we have developed in the past (ELF - Evolutionary Learning of Fuzzy rules) implements an extension of the popular Q-Learning algorithm for the distribution of delayed reinforcement when the controller to be learnt is a Fuzzy Logic Controller (FLC). Finally, we present some examples of the application of ELF to learning FLCs that implement behaviors for an autonomous agent.
1996
Genetic Algorithms and Soft Computing
9783790809565
Reinforcement learning; fuzzy systems; Q-learning
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/666845
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact