Nino Vieillard

Παρατίθεται από

	Όλα	Από το 2019
Παραθέσεις	1111	1108
h-index	11	11
i10-index	11	11

540

270

135

405

2019202020212022202320246 37 145 167 219 533

Συν-συγγραφείς

Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα univ-lorraine.fr
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα univ-lille.fr
Olivier BachemResearch Scientist, Google BrainΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Robert DadashiGoogle DeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Léonard HussenotGoogle DeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Rémi MunosDeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα inria.fr
Tadashi KozunoOmron Sinic XΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα sinicx.com
Piotr StanczykGoogleΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Shideh RezaeifarPhD Student, Department of Computer Science, University of GenevaΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα unige.ch
Sabela RamosSoftware Engineer. Google.Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Johan FerretResearch Scientist, Google DeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Geoffrey CideronGoogle DeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Rishabh AgarwalSenior Research Scientist, Google DeepMindΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Nikola MomchevGoogleΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Lior ShaniGoogle ResearchΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Roee AharoniGoogle ResearchΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Paul RoitPhD studentΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα cs.biu.ac.il
Idan SzpektorGoogle ResearchΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
Orgad KellerGoogle ResearchΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com
avinatan hassidimΗ διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα biu.ac.il

Παρακολούθηση

Nino Vieillard

Google DeepMind

Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα google.com

Reinforcement Learning


Τίτλος Ταξινόμηση με βάση τις αναφορές Ταξινόμηση κατά έτος Ταξινόμηση κατά τίτλο	Παρατίθεται από Παρατίθεται από	Έτος
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	463	2023
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2022	231	2022
Leverage the average: an analysis of kl regularization in reinforcement learning N Vieillard, T Kozuno, B Scherrer, O Pietquin, R Munos, M Geist Advances in Neural Information Processing Systems 33, 12163-12174, 2020	99*	2020
Munchausen reinforcement learning N Vieillard, O Pietquin, M Geist Advances in Neural Information Processing Systems 33, 4235-4246, 2020	83	2020
Offline reinforcement learning as anti-exploration S Rezaeifar, R Dadashi, N Vieillard, L Hussenot, O Bachem, O Pietquin, ... Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 8106-8114, 2022	41	2022
Offline reinforcement learning with pseudometric learning R Dadashi, S Rezaeifar, N Vieillard, L Hussenot, O Pietquin, M Geist International Conference on Machine Learning, 2307-2318, 2021	35	2021
Momentum in reinforcement learning N Vieillard, B Scherrer, O Pietquin, M Geist International Conference on Artificial Intelligence and Statistics, 2529-2538, 2020	33	2020
On-policy distillation of language models: Learning from self-generated mistakes R Agarwal, N Vieillard, Y Zhou, P Stanczyk, SR Garea, M Geist, ... The Twelfth International Conference on Learning Representations, 2024	32*	2024
Factually consistent summarization via reinforcement learning with textual entailment feedback P Roit, J Ferret, L Shani, R Aharoni, G Cideron, R Dadashi, M Geist, ... arXiv preprint arXiv:2306.00186, 2023	27	2023
Deep conservative policy iteration N Vieillard, O Pietquin, M Geist Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 6070-6077, 2020	27	2020
On connections between constrained optimization and reinforcement learning N Vieillard, O Pietquin, M Geist arXiv preprint arXiv:1910.08476, 2019	18	2019
Warm: On the benefits of weight averaged reward models A Ramé, N Vieillard, L Hussenot, R Dadashi, G Cideron, O Bachem, ... arXiv preprint arXiv:2401.12187, 2024	8	2024
Implicitly regularized rl with implicit q-values N Vieillard, M Andrychowicz, A Raichuk, O Pietquin, M Geist arXiv preprint arXiv:2108.07041, 2021	6	2021
Kl-entropy-regularized rl with a generative model is minimax optimal T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ... arXiv preprint arXiv:2205.14211, 2022	5	2022
Regularization and variance-weighted regression achieves minimax optimality in linear MDPs: theory and practice T Kitamura, T Kozuno, Y Tang, N Vieillard, M Valko, W Yang, J Mei, ... International Conference on Machine Learning, 17135-17175, 2023	2	2023
Training reinforcement learning agents using augmented temporal difference learning MF Geist, N Vieillard, OC Pietquin US Patent App. 17/347,264, 2021	1	2021

Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.

Άρθρα 1–16

Παραθέσεις ανά έτος

Διπλότυπες αναφορές

Συγχωνευμένες αναφορές

Προσθήκη από κοινού συγγραφέωνΣυν-συγγραφείς

Παρακολούθηση

Παρατίθεται από

Συν-συγγραφείς