PIROTTA, MATTEO
PIROTTA, MATTEO
DIPARTIMENTO DI ELETTRONICA, INFORMAZIONE E BIOINGEGNERIA
Mostra
records
Risultati 1 - 7 di 7 (tempo di esecuzione: 0.013 secondi).
Gaussian approximation for bias reduction in Q-learning
2021-01-01 D'Eramo, C.; Cini, A.; Nuara, A.; Pirotta, M.; Alippi, C.; Peters, J.; Restelli, M.
Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation
2016-01-01 Parisi, Simone; Pirotta, Matteo; Restelli, Marcello
On the use of the policy gradient and Hessian in inverse reinforcement learning
2020-01-01 Metelli, A. M.; Pirotta, M.; Restelli, M.
Policy gradient in Lipschitz Markov Decision Processes
2015-01-01 Pirotta, Matteo; Restelli, Marcello; Bascetta, Luca
Policy Search for the Optimal Control of Markov Decision Processes: A Novel Particle-Based Iterative Scheme
2016-01-01 Manganini, Giorgio; Pirotta, Matteo; Restelli, Marcello; Piroddi, Luigi; Prandini, Maria
Safe policy iteration: A monotonically improving approximate policy iteration approach
2021-01-01 Metelli, A. M.; Pirotta, M.; Calandriello, D.; Restelli, M.
Smoothing policies and safe policy gradients
2022-01-01 Papini, M.; Pirotta, M.; Restelli, M.