MUSSI, MARCO
MUSSI, MARCO
DIPARTIMENTO DI ELETTRONICA, INFORMAZIONE E BIOINGEGNERIA
A Reinforcement Learning controller optimizing costs and battery State of Health in smart grids
2024-01-01 Mussi, Marco; Pellegrino, Luigi; Pindaro, Oscar Francesco; Restelli, Marcello; Trovo, Francesco
A voltage dynamic-based state of charge estimation method for batteries storage systems
2021-01-01 Mussi, M.; Pellegrino, L.; Restelli, M.; Trovò, Francesco
An online state of health estimation method for lithium-ion batteries based on time partitioning and data-driven model identification
2022-01-01 Mussi, M.; Pellegrino, L.; Restelli, M.; Trovo', Francesco
ARLO: A framework for Automated Reinforcement Learning
2023-01-01 Mussi, Marco; Lombarda, Davide; Metelli, Alberto Maria; Trovo, Francesco; Restelli, Marcello
Autoregressive Bandits
2024-01-01 Bacchiocchi, Francesco; Genalti, Gianmarco; Maran, Davide; Mussi, Marco; Restelli, Marcello; Gatti, Nicola; Metelli, ALBERTO MARIA
Best Arm Identification for Stochastic Rising Bandits
2024-01-01 Mussi, Marco; Montenegro, Alessandro; Trovo, Francesco; Restelli, Marcello; Metelli, ALBERTO MARIA
Convergence Analysis of Policy Gradient Methods with Dynamic Stochasticity
2025-01-01 Montenegro, A.; Mussi, M.; Papini, M.; Metelli, A. M.
Dynamic Pricing with Volume Discounts in Online Settings
2023-01-01 Mussi, M.; Genalti, G.; Nuara, A.; Trovo', F.; Restelli, M.; Gatti, N.
Dynamical Linear Bandits
2023-01-01 Mussi, Marco; Metelli, ALBERTO MARIA; Restelli, Marcello
Factored-Reward Bandits with Intermediate Observations
2024-01-01 Mussi, M.; Drago, S.; Restelli, M.; Metelli, A. M.
Factored-Reward Bandits with Intermediate Observations: Regret Minimization and Best Arm Identification
2025-01-01 Mussi, Marco; Drago, Simone; Restelli, Marcello; Metelli, Alberto Maria
Generalizing the Regret: an Analysis of Lower and Upper Bounds
2025-01-01 Mussi, M.; Metelli, A. M.
Graph-Triggered Rising Bandits
2024-01-01 Genalti, G.; Mussi, M.; Gatti, N.; Restelli, M.; Castiglioni, M.; Metelli, A. M.
Human-AI interaction in safety-critical network infrastructures
2025-01-01 Mussi, M.; Metelli, A. M.; Restelli, M.; Losapio, G.; Bessa, R. J.; Boos, D.; Borst, C.; Leto, G.; Castagna, A.; Chavarriaga, R.; Dias, D.; Egli, A.; Eisenegger, A.; El Manyari, Y.; Fuxjager, A.; Geraldes, J.; Hamouche, S.; Hassouna, M.; Lemetayer, B.; Leyli-Abadi, M.; Liessner, R.; Lundberg, J.; Marot, A.; Meddeb, M.; Schiaffonati, V.; Schneider, M.; Stadelmann, T.; Usher, J.; Van Hoof, H.; Viebahn, J.; Waefler, T.; Zanotti, G.
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
2024-01-01 Montenegro, Alessandro; Mussi, Marco; Papini, Matteo; Metelli, ALBERTO MARIA
Learning Optimal Deterministic Policies with Stochastic Policy Gradients
2024-01-01 Montenegro, Alessandro; Mussi, Marco; Metelli, ALBERTO MARIA; Papini, Matteo
Position: Constants are Critical in Regret Bounds for Reinforcement Learning
2025-01-01 Drago, Simone; Mussi, Marco; Metelli, Alberto Maria
Power Grid Control with Graph-Based Distributed Reinforcement Learning
2025-01-01 Fabrizio, Carlo; Losapio, Gianvito; Mussi, Marco; Metelli, Alberto Maria; Restelli, Marcello
Pricing the Long Tail by Explainable Product Aggregation and Monotonic Bandits
2022-01-01 Mussi, M.; Genalti, G.; Trovo, F.; Nuara, A.; Gatti, N.; Restelli, M.
Sleeping Reinforcement Learning
2025-01-01 Drago, Simone; Mussi, Marco; Metelli, Alberto Maria