Παρακολούθηση
Romain Laroche
Romain Laroche
Microsoft Research
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα polytechnique.org - Αρχική σελίδα
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Hybrid reward architecture for reinforcement learning
H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang
Advances in Neural Information Processing Systems 30, 2017
2222017
Safe policy improvement with baseline bootstrapping
R Laroche, P Trichelair, RT Des Combes
International Conference on Machine Learning, 3652-3661, 2019
1422019
Contextual bandit for active learning: Active thompson sampling
D Bouneffouf, R Laroche, T Urvoy, R Féraud, R Allesiardo
International Conference on Neural Information Processing, 405-412, 2014
762014
Counting to explore and generalize in text-based games
X Yuan, MA Côté, A Sordoni, R Laroche, RT Combes, M Hausknecht, ...
arXiv preprint arXiv:1806.11525, 2018
452018
Learning dynamic belief graphs to generalize on text-based games
A Adhikari, X Yuan, MA Côté, M Zelinka, MA Rondeau, R Laroche, ...
Advances in Neural Information Processing Systems 33, 3045-3057, 2020
442020
Transfer reinforcement learning with shared dynamics
R Laroche, M Barlier
Thirty-First AAAI Conference on Artificial Intelligence, 2017
432017
Score-based inverse reinforcement learning
L El Asri, B Piot, M Geist, R Laroche, O Pietquin
International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2016
382016
Transfer Learning for User Adaptation in Spoken Dialogue Systems.
A Genevay, R Laroche
AAMAS, 975-983, 2016
362016
Human-machine dialogue as a stochastic game
M Barlier, J Perolat, R Laroche, O Pietquin
16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015
332015
Nastia: Negotiating appointment setting interface
L El Asri, R Lemonnier, R Laroche, O Pietquin, H Khouzaimi
Proceedings of the Ninth International Conference on Language Resources and …, 2014
302014
Reward function learning for dialogue management
L El Asri, R Laroche, O Pietquin
STAIRS 2012, 95-106, 2012
282012
Optimising turn-taking strategies with reinforcement learning
H Khouzaimi, R Laroche, F Lefevre
Proceedings of the 16th Annual Meeting of the Special Interest Group on …, 2015
272015
Reinforcement learning algorithm selection
R Laroche, R Feraud
arXiv preprint arXiv:1701.08810, 2017
262017
Reward shaping for statistical optimisation of dialogue management
LE Asri, R Laroche, O Pietquin
International Conference on Statistical Language and Speech Processing, 93-101, 2013
262013
Hybridisation of expertise and reinforcement learning in dialogue systems
R Laroche, G Putois, P Bretier, B Bouchon-Meunier
Tenth Annual Conference of the International Speech Communication Association, 2009
262009
Learning dynamic knowledge graphs to generalize on text-based games
A Adhikari, X Yuan, MA Côté, M Zelinka, MA Rondeau, R Laroche, ...
252020
Multi-advisor reinforcement learning
R Laroche, M Fatemi, J Romoff, H van Seijen
arXiv preprint arXiv:1704.00756, 2017
252017
Safe policy improvement with soft baseline bootstrapping
K Nadjahi, R Laroche, R Tachet des Combes
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2019
232019
Task completion transfer learning for reward inference
L El Asri, R Laroche, O Pietquin
Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
222014
Ordinal regression for interaction quality prediction
L El Asri, H Khouzaimi, R Laroche, O Pietquin
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
202014
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–20