Adithya M Devraj
Title
Cited by
Cited by
Year
Zap Q-learning
AM Devraj, SP Meyn
Proceedings of the 31st International Conference on Neural Information …, 2017
482017
Fastest convergence for Q-learning
AM Devraj, SP Meyn
arXiv preprint arXiv:1707.03770, 2017
302017
Learning techniques for feedback particle filter design
A Radhakrishnan, A Devraj, S Meyn
2016 IEEE 55th Conference on Decision and Control (CDC), 5453-5459, 2016
152016
Explicit mean-square error bounds for monte-carlo and linear stochastic approximation
S Chen, A Devraj, A Busic, S Meyn
International Conference on Artificial Intelligence and Statistics, 4173-4183, 2020
112020
Differential TD learning for value function approximation
AM Devraj, SP Meyn
Decision and Control (CDC), 2016 IEEE 55th Conference on, 6347-6354, 2016
112016
Power allocation in energy harvesting sensors with ARQ: A convex optimization approach
AM Devraj, MK Sharma, CR Murthy
2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP …, 2014
82014
Zap Q-Learning With Nonlinear Function Approximation
S Chen, AM Devraj, F Lu, A Bušić, SP Meyn
arXiv preprint arXiv:1910.05405, 2019
72019
Zap Q-Learning-A User's Guide
AM Devraj, A Bušić, S Meyn
2019 Fifth Indian Control Conference (ICC), 10-15, 2019
62019
Differential temporal difference learning
AM Devraj, I Kontoyiannis, SP Meyn
IEEE Transactions on Automatic Control, 2020
52020
Q-learning with uniformly bounded variance: Large discounting is not a barrier to fast learning
AM Devraj, SP Meyn
arXiv preprint arXiv:2002.10301, 2020
52020
Fundamental design principles for reinforcement learning algorithms
AM Devraj, A Bušic, S Meyn
Handbook on Reinforcement Learning and Control. Springer, 2020
52020
Optimal matrix momentum stochastic approximation and applications to q-learning
AM Devraj, A Bušić, S Meyn
arXiv preprint arXiv:1809.06277, 2018
52018
Model-free primal-dual methods for network optimization with application to real-time optimal power flow
Y Chen, A Bernstein, A Devraj, S Meyn
2020 American Control Conference (ACC), 3140-3147, 2020
42020
Stochastic variance reduced primal dual algorithms for empirical composition optimization
AM Devraj, J Chen
arXiv preprint arXiv:1907.09150, 2019
42019
Reinforcement learning for control of building HVAC systems
NS Raman, AM Devraj, P Barooah, SP Meyn
2020 American Control Conference (ACC), 2326-2332, 2020
32020
Zap meets momentum: Stochastic approximation algorithms with optimal convergence rate
AM Devraj, A Bušic, S Meyn
arXiv preprint arXiv:1809.06277, 2018
32018
Zap Q-Learning for´ optimal stopping
S Chen, AM Devraj, A Bušić, S Meyn
2020 American Control Conference (ACC), 3920-3925, 2020
12020
Geometric ergodicity in a weighted Sobolev space
A Devraj, I Kontoyiannis, S Meyn
The Annals of Probability 48 (1), 380-403, 2020
12020
Zap Q-Learning for Optimal Stopping Time Problems
S Chen, AM Devraj, A Bušić, SP Meyn
arXiv preprint arXiv:1904.11538, 2019
12019
A Bit Better? Quantifying Information for Bandit Learning
AM Devraj, B Van Roy, K Xu
arXiv preprint arXiv:2102.09488, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20