Alessandro Lazaric
Alessandro Lazaric
Research Scientist, Facebook Artificial Intelligence Research
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα inria.fr - Αρχική σελίδα
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Best arm identification: A unified approach to fixed budget and fixed confidence
V Gabillon, M Ghavamzadeh, A Lazaric
NIPS-Twenty-Sixth Annual Conference on Neural Information Processing Systems, 2012
2282012
Transfer in reinforcement learning: a framework and a survey
A Lazaric
Reinforcement Learning, 143-173, 2012
2242012
Transfer of samples in batch reinforcement learning
A Lazaric, M Restelli, A Bonarini
Proceedings of the 25th international conference on Machine learning, 544-551, 2008
1562008
Linear thompson sampling revisited
M Abeille, A Lazaric
Artificial Intelligence and Statistics, 176-184, 2017
1382017
Reinforcement learning in continuous action spaces through sequential monte carlo methods
A Lazaric, M Restelli, A Bonarini
Advances in neural information processing systems 20, 833-840, 2007
1372007
Risk-aversion in multi-armed bandits
A Sani, A Lazaric, R Munos
arXiv preprint arXiv:1301.1936, 2013
1222013
Bayesian multi-task reinforcement learning
A Lazaric, M Ghavamzadeh
ICML-27th International Conference on Machine Learning, 599-606, 2010
1102010
Best-arm identification in linear bandits
M Soare, A Lazaric, R Munos
Advances in Neural Information Processing Systems 27, 828-836, 2014
1032014
Finite-sample analysis of least-squares policy iteration
A Lazaric, M Ghavamzadeh, R Munos
Journal of Machine Learning Research 13, 3041-3074, 2012
972012
Multi-bandit best arm identification
V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck
972011
Analysis of a classification-based policy iteration algorithm
A Lazaric, M Ghavamzadeh, R Munos
ICML-27th International Conference on Machine Learning, 607-614, 2010
852010
Upper-confidence-bound algorithms for active learning in multi-armed bandits
A Carpentier, A Lazaric, M Ghavamzadeh, R Munos, P Auer
International Conference on Algorithmic Learning Theory, 189-203, 2011
812011
Finite-sample analysis of LSTD
A Lazaric, M Ghavamzadeh, R Munos
ICML-27th International Conference on Machine Learning, 615-622, 2010
782010
Truthful learning mechanisms for multi-slot sponsored search auctions with externalities
N Gatti, A Lazaric, M Rocco, F Trovò
Artificial Intelligence 227, 93-139, 2015
762015
Reinforcement learning of POMDPs using spectral methods
K Azizzadenesheli, A Lazaric, A Anandkumar
Conference on Learning Theory, 193-256, 2016
722016
Sequential transfer in multi-armed bandit with finite set of models
MG Azar, A Lazaric, E Brunskill
arXiv preprint arXiv:1307.6887, 2013
712013
Learning near optimal policies with low inherent bellman error
A Zanette, A Lazaric, M Kochenderfer, E Brunskill
International Conference on Machine Learning, 10978-10989, 2020
632020
Efficient bias-span-constrained exploration-exploitation in reinforcement learning
R Fruit, M Pirotta, A Lazaric, R Ortner
International Conference on Machine Learning, 1578-1586, 2018
632018
LSTD with random projections
M Ghavamzadeh, A Lazaric, O Maillard, R Munos
Advances in Neural Information Processing Systems 23, 721-729, 2010
632010
Reinforcement distribution in fuzzy Q-learning
A Bonarini, A Lazaric, F Montrone, M Restelli
Fuzzy sets and systems 160 (10), 1420-1443, 2009
622009
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–20