PAPINI, MATTEO

PAPINI, MATTEO  

DIPARTIMENTO DI ELETTRONICA, INFORMAZIONE E BIOINGEGNERIA  

Mostra records
Risultati 1 - 20 di 23 (tempo di esecuzione: 0.044 secondi).
Titolo Data di pubblicazione Autori File
Adaptive Batch Size for Safe Policy Gradients 1-gen-2017 PAPINI, MATTEOM. PirottaM. Restelli
Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration 1-gen-2020 M. PapiniA. BattistelloM. Restelli
Feature Selection via Mutual Information: New Theoretical Insights 1-gen-2019 Beraha M.Metelli A. M.Papini M.Tirinzoni A.Restelli M.
Gradient-Aware Model-Based Policy Search 1-gen-2020 Pierluca D'OroAlberto Maria MetelliAndrea TirinzoniMatteo PapiniMarcello Restelli
Importance Sampling Techniques for Policy Optimization 1-gen-2020 Metelli Alberto MariaPapini MatteoMontali NicoRestelli Marcello
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning 1-gen-2024 Alessandro MontenegroMarco MussiMatteo PapiniAlberto Maria Metelli
Learning Optimal Deterministic Policies with Stochastic Policy Gradients 1-gen-2024 Alessandro MontenegroMarco MussiAlberto Maria MetelliMatteo Papini
Leveraging Good Representations in Linear Contextual Bandits 1-gen-2021 Matteo PapiniAndrea TirinzoniMarcello RestelliAlessandro LazaricMatteo Pirotta
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits 1-gen-2022 Papini M. +
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs 1-gen-2024 Davide MaranAlberto Maria MetelliMatteo PapiniMarcello Restelli
No-Regret Reinforcement Learning in Smooth MDPs 1-gen-2024 Davide MaranAlberto Maria MetelliMatteo PapiniMarcello Restelli
Online Learning with Off-Policy Feedback 1-gen-2023 Gabbianelli G.Papini M. +
Online Learning with Off-Policy Feedback in Adversarial MDPs 1-gen-2024 F. BacchiocchiFE. StradiM. PapiniAM. MetelliN. Gatti
Optimistic Policy Optimization via Multiple Importance Sampling 1-gen-2019 Papini, MatteoMetelli, Alberto MariaLupo, LorenzoRestelli, Marcello
Policy Optimization as Online Learning with Mediator Feedback 1-gen-2021 Alberto Maria MetelliMatteo PapiniPierluca D'OroMarcello Restelli
Policy optimization via importance sampling 1-gen-2018 Metelli A. M.Papini M.Restelli M. +
Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs 1-gen-2024 Davide MaranAlberto Maria MetelliMatteo PapiniMarcello Restelli
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection 1-gen-2021 Matteo PapiniAndrea TirinzoniMarcello RestelliAlessandro LazaricMatteo Pirotta +
Risk-Averse Trust Region Optimization for Reward-Volatility Reduction 1-gen-2020 L. BisiL. SabbioniE. VittoriM. PapiniM. Restelli
Sample complexity of variance-reduced policy gradient: weaker assumptions and lower bounds 1-gen-2024 Papini, MatteoMetelli, Alberto MariaRestelli, Marcello +