IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

This thesis covers some algorithmic aspects of online machine learning and optimization. In Chapter 1 we design algorithms with state-of-the-art regret guarantees for the problem dynamic pricing. In Chapter 2 we move on to an asynchronous online learning setting in which only some of the agents in the network are active at each time step. We show that when information is shared among neighbors, knowledge about the graph structure might have a significantly different impact on learning rates depending on how agents are activated. In Chapter 3 we investigate the online problem of multivariate non-concave maximization under weak assumptions on the regularity of the objective function. In Chapter 4 we introduce a new performance measure and design an efficient algorithm to learn optimal policies in repeated A/B testing.

ALGORITHMS, LEARNING, AND OPTIMIZATION / T.r. Cesari ; supervisor: N. A. CESA BIANCHI ; phd coordinator: P. Boldi. DIPARTIMENTO DI INFORMATICA "Giovanni Degli Antoni", 2020 Jan 31. 32. ciclo, Anno Accademico 2019. [10.13130/cesari-tommaso-renato_phd2020-01-31].

ALGORITHMS, LEARNING, AND OPTIMIZATION

T.R. Cesari

2020

Abstract

This thesis covers some algorithmic aspects of online machine learning and optimization. In Chapter 1 we design algorithms with state-of-the-art regret guarantees for the problem dynamic pricing. In Chapter 2 we move on to an asynchronous online learning setting in which only some of the agents in the network are active at each time step. We show that when information is shared among neighbors, knowledge about the graph structure might have a significantly different impact on learning rates depending on how agents are activated. In Chapter 3 we investigate the online problem of multivariate non-concave maximization under weak assumptions on the regularity of the objective function. In Chapter 4 we introduce a new performance measure and design an efficient algorithm to learn optimal policies in repeated A/B testing.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di discussione
	
				31-gen-2020
			
	Settori scientifico-disciplinari della tesi
	
				Settore INF/01 - Informatica
			
	Parole chiave
	
				machine learning theory; online learning; online optimization; cooperative learning; dynamic pricing; posted price
			
	Tutor afferenti all'Ateneo
	
				CESA BIANCHI, NICOLO' ANTONIO
			
	Supervisori e coordinatori afferenti all'Ateneo
	
				CESA BIANCHI, NICOLO' ANTONIO
BOLDI, PAOLO
			
	Tipologia
	
				Doctoral Thesis
			
	Citazione
	
				ALGORITHMS, LEARNING, AND OPTIMIZATION / T.r. Cesari ; supervisor: N. A. CESA BIANCHI ; phd coordinator: P. Boldi. DIPARTIMENTO DI INFORMATICA "Giovanni Degli Antoni", 2020 Jan 31. 32. ciclo, Anno Accademico 2019. [10.13130/cesari-tommaso-renato_phd2020-01-31].
			
	Appare nelle tipologie:
	
				Tesi di dottorato

File in questo prodotto:

File	Dimensione	Formato
phd_unimi_R11657.pdf accesso aperto Descrizione: Articolo principale Tipologia: Tesi di dottorato completa Dimensione 1.11 MB Formato Adobe PDF Visualizza/Apri	1.11 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/699354

Citazioni

ND

ND

ND

social impact