Follow
Sungsu Lim
Title
Cited by
Cited by
Year
Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces
S Lim, A Joseph, L Le, Y Pan, M White
NeurIPS 2018, Deep Reinforcement Learning Workshop, https://arxiv.org/abs …, 2018
18*2018
Maximizing Information Gain in Partially Observable Environments via Prediction Rewards
Y Satsangi, S Lim, S Whiteson, F Oliehoek, M White
AAMAS 2020, 2020
132020
Actor-Expert: A Framework for using Q-learning in Continuous Action Spaces
S Lim
University of Alberta, 2019
132019
Greedification operators for policy optimization: Investigating forward and reverse kl divergences
A Chan, H Silva, S Lim, T Kozuno, AR Mahmood, M White
The Journal of Machine Learning Research 23 (1), 11474-11552, 2022
112022
An Empirical and Conceptual Categorization of Value-based Exploration Methods
N Yasui, S Lim, C Linke, A White, M White
12019
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement
S Neumann, S Lim, A Joseph, Y Pan, A White, M White
arXiv preprint arXiv:1810.09103, 2018
12018
Maximizing Information Gain in Partially Observable Environments via Prediction Rewards
S Lim, Y Satsangi, S Whiteson, FA Oliehoek, M White
2020
GREEDY ACTOR-CRITIC: ANew CONDITIONAL CROSS-ENTROPY METHOD FOR POLICY IMPROVE
S Neumann, S Lim, A Joseph, Y Pan, A White, M White
The system can't perform the operation now. Try again later.
Articles 1–8