Csaba Szepesvari

Παρατίθεται από

	Όλα	Από το 2019
Παραθέσεις	34681	23030
h-index	79	70
i10-index	245	193

4900

2450

1225

3675

2003200420052006200720082009201020112012201320142015201620172018201920202021202220232024115 96 131 96 216 323 381 531 768 833 927 1116 1148 1359 1308 1741 2429 3377 4246 4671 4873 3307

Δημόσια πρόσβαση

Προβολή όλων

74 άρθρα

0 άρθρα

διαθέσιμα

μη διαθέσιμα

Σύμφωνα με εντολές χρηματοδότησης

Συν-συγγραφείς

Tor LattimoreDeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Yasin Abbasi YadkoriGoogle DeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Rémi MunosGoogle DeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα inria.fr
Branislav KvetonAmazonΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα amazon.com
Dale SchuurmansUniversity of Alberta, Google DeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα cs.ualberta.ca
Kocsis LeventeMTA SZTAKIΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα sztaki.hu
Richard S. SuttonKeen, Amii, and University of AlbertaΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα richsutton.com
Dávid PálStaff Machine Learning Engineer, InstacartΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα instacart.com
Mohammad GhavamzadehAmazonΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα amazon.com
András AntosBudapest University of Technology and EconomicsΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα cs.bme.hu
Amir-massoud FarahmandUniversity of TorontoΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα cs.toronto.edu
Zheng WenGoogle DeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα iisc.ac.in
Lorincz, AndrasEotvos Lorand UniversityΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα inf.elte.hu
Hamid MaeiNetflixΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα netflix.com
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton UniversityΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα princeton.edu
Nevena LazicDeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Michael LittmanBrown UniversityΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα brown.edu
Jincheng MeiResearch Scientist, Google BrainΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Doina PrecupDeepMind and McGill UniversityΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα cs.mcgill.ca

Παρακολούθηση

Csaba Szepesvari

DeepMind & University of Alberta

Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα cs.ualberta.ca - Αρχική σελίδα

machine learning learning theory online learning reinforcement learning Markov Decision Processes


Τίτλος Ταξινόμηση με βάση τις αναφορές Ταξινόμηση κατά έτος Ταξινόμηση κατά τίτλο	Παρατίθεται από Παρατίθεται από	Έτος
Bandit based monte-carlo planning L Kocsis, C Szepesvári European conference on machine learning, 282-293, 2006	4298	2006
Bandit algorithms T Lattimore, C Szepesvári Cambridge University Press, 2020	2867	2020
Algorithms for Reinforcement Learning C Szepesvari Morgan and Claypool, 2010	2117*	2010
Improved algorithms for linear stochastic bandits Y Abbasi-Yadkori, C Szepesvári, D Pál Advances in Neural Information Processing Systems, 2312-2320, 2011	1983	2011
Convergence results for single-step on-policy reinforcement-learning algorithms S Singh, T Jaakkola, ML Littman, C Szepesvári Machine learning 38, 287-308, 2000	1003	2000
Exploration–exploitation tradeoff using variance estimates in multi-armed bandits JY Audibert, R Munos, C Szepesvári Theoretical Computer Science 410 (19), 1876-1902, 2009	778	2009
Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ... Proceedings of the 26th annual international conference on machine learning …, 2009	716	2009
Finite-Time Bounds for Fitted Value Iteration. R Munos, C Szepesvári Journal of Machine Learning Research 9 (5), 2008	631	2008
Parametric bandits: The generalized linear case S Filippi, O Cappe, A Garivier, C Szepesvári Advances in neural information processing systems 23, 2010	535	2010
X-Armed Bandits. S Bubeck, R Munos, G Stoltz, C Szepesvári Journal of Machine Learning Research 12 (5), 2011	505	2011
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path A Antos, C Szepesvári, R Munos Machine Learning 71, 89-129, 2008	502	2008
Learning with a strong adversary R Huang, B Xu, D Schuurmans, C Szepesvári arXiv preprint arXiv:1511.03034, 2015	443	2015
Regret bounds for the adaptive control of linear quadratic systems Y Abbasi-Yadkori, C Szepesvári Proceedings of the 24th Annual Conference on Learning Theory, 1-26, 2011	426	2011
A generalized reinforcement-learning model: Convergence and applications ML Littman, C Szepesvári ICML 96, 310-318, 1996	348	1996
Convergent temporal-difference learning with arbitrary smooth function approximation H Maei, C Szepesvari, S Bhatnagar, D Precup, D Silver, RS Sutton Advances in neural information processing systems 22, 2009	345	2009
Toward off-policy learning control with function approximation. HR Maei, C Szepesvári, S Bhatnagar, RS Sutton ICML 10, 719-726, 2010	337	2010
Tight regret bounds for stochastic combinatorial semi-bandits B Kveton, Z Wen, A Ashkan, C Szepesvari Artificial Intelligence and Statistics, 535-543, 2015	322	2015
The grand challenge of computer Go: Monte Carlo tree search and extensions S Gelly, L Kocsis, M Schoenauer, M Sebag, D Silver, C Szepesvári, ... Communications of the ACM 55 (3), 106-113, 2012	321	2012
Model-based reinforcement learning with value-targeted regression A Ayoub, Z Jia, C Szepesvari, M Wang, L Yang International Conference on Machine Learning, 463-474, 2020	319	2020
Apprenticeship learning using inverse reinforcement learning and gradient methods G Neu, C Szepesvári arXiv preprint arXiv:1206.5264, 2012	318	2012

Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.

Άρθρα 1–20

Παραθέσεις ανά έτος

Διπλότυπες αναφορές

Συγχωνευμένες αναφορές

Προσθήκη από κοινού συγγραφέωνΣυν-συγγραφείς

Παρακολούθηση

Παρατίθεται από

Συν-συγγραφείς