PAPINI, MATTEO

PAPINI, MATTEO  

DIPARTIMENTO DI ELETTRONICA, INFORMAZIONE E BIOINGEGNERIA  

Mostra records
Risultati 1 - 20 di 26 (tempo di esecuzione: 0.02 secondi).
Titolo Data di pubblicazione Autori File
Adaptive Batch Size for Safe Policy Gradients 1-gen-2017 PAPINI, MATTEOM. PirottaM. Restelli
Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration 1-gen-2020 M. PapiniA. BattistelloM. Restelli
Convergence Analysis of Policy Gradient Methods with Dynamic Stochasticity 1-gen-2025 A. MontenegroM. MussiM. PapiniA. M. Metelli
Feature Selection via Mutual Information: New Theoretical Insights 1-gen-2019 Beraha M.Metelli A. M.Papini M.Tirinzoni A.Restelli M.
Gradient-Aware Model-Based Policy Search 1-gen-2020 Pierluca D'OroAlberto Maria MetelliAndrea TirinzoniMatteo PapiniMarcello Restelli
Importance Sampling Techniques for Policy Optimization 1-gen-2020 Metelli Alberto MariaPapini MatteoMontali NicoRestelli Marcello
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning 1-gen-2024 Alessandro MontenegroMarco MussiMatteo PapiniAlberto Maria Metelli
Learning Optimal Deterministic Policies with Stochastic Policy Gradients 1-gen-2024 Alessandro MontenegroMarco MussiAlberto Maria MetelliMatteo Papini
Leveraging Good Representations in Linear Contextual Bandits 1-gen-2021 Matteo PapiniAndrea TirinzoniMarcello RestelliAlessandro LazaricMatteo Pirotta
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits 1-gen-2022 Papini M. +
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs 1-gen-2024 Davide MaranAlberto Maria MetelliMatteo PapiniMarcello Restelli
No-Regret Reinforcement Learning in Smooth MDPs 1-gen-2024 Davide MaranAlberto Maria MetelliMatteo PapiniMarcello Restelli
Online Learning with Off-Policy Feedback 1-gen-2023 Gabbianelli G.Papini M. +
Online Learning with Off-Policy Feedback in Adversarial MDPs 1-gen-2024 F. BacchiocchiFE. StradiM. PapiniAM. MetelliN. Gatti
Optimistic Policy Optimization via Multiple Importance Sampling 1-gen-2019 Papini, MatteoMetelli, Alberto MariaLupo, LorenzoRestelli, Marcello
Policy Gradient Methods with Adaptive Policy Spaces 1-gen-2024 Gianmarco TedeschiMatteo PapiniAlberto Maria MetelliMarcello Restelli
Policy Optimization as Online Learning with Mediator Feedback 1-gen-2021 Alberto Maria MetelliMatteo PapiniPierluca D'OroMarcello Restelli
Policy optimization via importance sampling 1-gen-2018 Metelli A. M.Papini M.Restelli M. +
Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs 1-gen-2024 Davide MaranAlberto Maria MetelliMatteo PapiniMarcello Restelli
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection 1-gen-2021 Matteo PapiniAndrea TirinzoniMarcello RestelliAlessandro LazaricMatteo Pirotta +