RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

We study the problem of designing mechanisms for information acquisition scenarios. This setting models strategic interactions between an uninformed receiver and a set of informed senders. In our model the senders receive information about the underlying state of nature and communicate their observation (either truthfully or not) to the receiver, which, based on this information, selects an action. Our goal is to design mechanisms maximizing the receiver's utility while incentivizing the senders to report truthfully their information. First, we provide an algorithm that efficiently computes an optimal incentive compatible (IC) mechanism. Then, we focus on the online problem in which the receiver sequentially interacts in an unknown game, with the objective of minimizing the cumulative regret w.r.t. the optimal IC mechanism, and the cumulative violation of the incentive compatibility constraints. We investigate two different online scenarios, i.e., the full and bandit feedback settings. For the full feedback problem, we propose an algorithm that guarantees Õ(√T) regret and violation, while for the bandit feedback setting we present an algorithm that attains Õ(Tα) regret and Õ(T1-α/2) violation for any α ∈ [1/2,1]. Finally, we complement our results providing a tight lower bound.

Online Mechanism Design for Information Acquisition

Cacciamani F.;Castiglioni Matteo;Gatti N.

2023-01-01

Abstract

We study the problem of designing mechanisms for information acquisition scenarios. This setting models strategic interactions between an uninformed receiver and a set of informed senders. In our model the senders receive information about the underlying state of nature and communicate their observation (either truthfully or not) to the receiver, which, based on this information, selects an action. Our goal is to design mechanisms maximizing the receiver's utility while incentivizing the senders to report truthfully their information. First, we provide an algorithm that efficiently computes an optimal incentive compatible (IC) mechanism. Then, we focus on the online problem in which the receiver sequentially interacts in an unknown game, with the objective of minimizing the cumulative regret w.r.t. the optimal IC mechanism, and the cumulative violation of the incentive compatibility constraints. We investigate two different online scenarios, i.e., the full and bandit feedback settings. For the full feedback problem, we propose an algorithm that guarantees Õ(√T) regret and violation, while for the bandit feedback setting we present an algorithm that attains Õ(Tα) regret and Õ(T1-α/2) violation for any α ∈ [1/2,1]. Finally, we complement our results providing a tight lower bound.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2023
			
	Titolo del libro
	
				Proceedings of Machine Learning Research
			
	Titolo della collana
	
				PROCEEDINGS OF MACHINE LEARNING RESEARCH
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1260586

Citazioni

ND

1

0

social impact