PAPINI, MATTEO

PAPINI, MATTEO  

DIPARTIMENTO DI ELETTRONICA, INFORMAZIONE E BIOINGEGNERIA  

Mostra records
Risultati 1 - 18 di 18 (tempo di esecuzione: 0.04 secondi).
Titolo Data di pubblicazione Autori File
Adaptive Batch Size for Safe Policy Gradients 1-gen-2017 PAPINI, MATTEOM. PirottaM. Restelli
Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration 1-gen-2020 M. PapiniA. BattistelloM. Restelli
Feature Selection via Mutual Information: New Theoretical Insights 1-gen-2019 Beraha M.Metelli A. M.Papini M.Tirinzoni A.Restelli M.
Gradient-Aware Model-Based Policy Search 1-gen-2020 Pierluca D'OroAlberto Maria MetelliAndrea TirinzoniMatteo PapiniMarcello Restelli
Importance Sampling Techniques for Policy Optimization 1-gen-2020 Metelli Alberto MariaPapini MatteoMontali NicoRestelli Marcello
Learning Optimal Deterministic Policies with Stochastic Policy Gradients 1-gen-2024 Alessandro MontenegroMarco MussiAlberto Maria MetelliMatteo Papini
Leveraging Good Representations in Linear Contextual Bandits 1-gen-2021 Matteo PapiniAndrea TirinzoniMarcello RestelliAlessandro LazaricMatteo Pirotta
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits 1-gen-2022 Papini M. +
Online Learning with Off-Policy Feedback 1-gen-2023 Gabbianelli G.Papini M. +
Optimistic Policy Optimization via Multiple Importance Sampling 1-gen-2019 Papini, MatteoMetelli, Alberto MariaLupo, LorenzoRestelli, Marcello
Policy Optimization as Online Learning with Mediator Feedback 1-gen-2021 Alberto Maria MetelliMatteo PapiniPierluca D'OroMarcello Restelli
Policy optimization via importance sampling 1-gen-2018 Metelli A. M.Papini M.Restelli M. +
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection 1-gen-2021 Matteo PapiniAndrea TirinzoniMarcello RestelliAlessandro LazaricMatteo Pirotta +
Risk-Averse Trust Region Optimization for Reward-Volatility Reduction 1-gen-2020 L. BisiL. SabbioniE. VittoriM. PapiniM. Restelli
Sample complexity of variance-reduced policy gradient: weaker assumptions and lower bounds 1-gen-2024 Papini, MatteoMetelli, Alberto MariaRestelli, Marcello +
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees 1-gen-2022 Tirinzoni A.Papini M.Lazaric A.Pirotta M. +
Smoothing policies and safe policy gradients 1-gen-2022 Papini M.Pirotta M.Restelli M.
Stochastic Variance-Reduced Policy Gradient 1-gen-2018 PAPINI, MATTEOCANONACO, GIUSEPPEPirotta, MatteoRestelli, Marcello +