Research

Control frequency adaptation via action persistence in batch reinforcement learning

Control frequency adaptation via action persistence in batch reinforcement learning Authors: Alberto Maria Metelli, Flavio Mazzolini, Lorenzo Bisi, Luca Sabbioni, Marcello Restelli Conference: ICML 2020 Abstract: The choice of the control frequency of a system has a relevant impact on the ability of reinforcement learning algorithms to learn a highly performing policy. In this paper, […]
Read More

Driving exploration by maximum distribution in gaussian process bandits

Driving exploration by maximum distribution in gaussian process bandits Authors: Alessandro Nuara, Francesco Trovò, Dominic Crippa, Nicola Gatti, Marcello Restelli Conference: AAMAS 2020 Abstract: The problem of finding optimal solutions of stochastic functions over continuous domains is common in several real-world applications, such as, e.g., advertisement allocation, dynamic pricing, and power control in wireless networks. […]
Read More

Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces

Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces Authors: Alberto Marchesi, Francesco Trovò, Nicola Gatti Conference: AAMAS 2020 Abstract: We tackle the problem of learning equilibria in simulation-based games. In such games, the players’ utility functions cannot be described analytically, as they are given through a black-box simulator that can […]
Read More

A combinatorial-bandit algorithm for the online joint bid/budget optimization of pay-per-click advertising campaigns

A combinatorial-bandit algorithm for the online joint bid/budget optimization of pay-per-click advertising campaigns Authors: Alessandro Nuara, Francesco Trovo, Nicola Gatti, Marcello Restelli Conference: AAAI 2018 Abstract: Pay-per-click advertising includes various formats (eg, search, contextual, and social) with a total investment of more than 140 billion USD per year. An advertising campaign is composed of some […]
Read More

Dealing with interdependencies and uncertainty in multi-channel advertising campaigns optimization

Dealing with interdependencies and uncertainty in multi-channel advertising campaigns optimization Authors: Alessandro Nuara, Nicola Sosio, Francesco TrovÃ, Maria Chiara Zaccardi, Nicola Gatti, Marcello Restelli Conference: WWW 2019 Abstract: In 2017, Internet ad spending reached 209 billion USD worldwide, while, e.g., TV ads brought in 178 billion USD. An Internet advertising campaign includes up to thousands […]
Read More

Targeting optimization for internet advertising by learning from logged bandit feedback

Targeting optimization for internet advertising by learning from logged bandit feedback Authors: Margherita Gasparini, Alessandro Nuara, Francesco Trovò, Nicola Gatti, Marcello Restelli Conference: IJCNN 2018 Abstract: In the last two decades, online advertising has become the most effective way to sponsor a product or an event. The success of this advertising format is mainly due […]
Read More

A characterization of quasi-perfect equilibria

A characterization of quasi-perfect equilibria Authors: Nicola Gatti, Mario Gilli, Alberto Marchesi Journal: Games and Economic Behavior Abstract: We provide a characterization of quasi-perfect equilibria in n-player games, showing that any quasi-perfect equilibrium can be obtained as limit point of a sequence of Nash equilibria of a certain class of perturbed games in sequence form, […]
Read More

Combining reinforcement learning with rule-based controllers for transparent and general decision-making in autonomous driving

Combining reinforcement learning with rule-based controllers for transparent and general decision-making in autonomous driving Authors: Nicola Gatti, Mario Gilli, Alberto MarchesiAmarildo Likmeta, Alberto Maria Metelli, Andrea Tirinzoni, Riccardo Giol, Marcello Restelli, Danilo Romano Journal: Robotics and Autonomous Systems Abstract: The design of high-level decision-making systems is a topical problem in the field of autonomous driving. […]
Read More