Dhawal Gupta

Cited by

	All	Since 2019
Citations	157	157
h-index	7	7
i10-index	5	5

202020212022202320247 20 43 47 40

Public access

View all

2 articles

1 article

available

not available

Based on funding mandates

Co-authors

Dr. Pushpak BhattacharyyaProfessor of Computer Science and Engineering, IIT BombayVerified email at cse.iitb.ac.in
Tulika SahaLecturer/Assistant Professor, Dept. of Computer Science, University of Liverpool, United KingdomVerified email at liverpool.ac.uk
Dr. Sriparna SahaAssociate Professor, Department of Computer Science and Engineering, Indian Institute of TechnologyVerified email at iitp.ac.in
Philip ThomasUniversity of Massachusetts AmherstVerified email at cs.umass.edu
Martha WhiteUniversity of AlbertaVerified email at ualberta.ca
Andrew PattersonUniversity of AlbertaVerified email at ualberta.ca
Sina GhiassianResearch Scientist - SpotifyVerified email at ualberta.ca
Scott M. JordanPostdoctoral Fellow, University of AlbertaVerified email at ualberta.ca
Yinlam ChowResearch Scientist, Google ResearchVerified email at google.com
James KostasPhD Student, University of Massachusetts AmherstVerified email at umass.edu
Yash ChandakPostdoctoral Scholar, Stanford UniversityVerified email at stanford.edu
Matthew Kyle SchlegelUniversity of AlbertaVerified email at ualberta.ca

Dhawal Gupta

Graduate Student, University of Massachusetts, Amherst

Verified email at umass.edu - Homepage

Reinforcement Learning Machine Learning Robotics Optimal Control


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gradient Temporal-Difference Learning with Regularized Corrections S Ghiassian, A Patterson, S Garg, D Gupta, A White, M White International Conference on Machine Learning, 3524-3534, 2020	49	2020
Emotion Aided Dialogue Act Classification for Task-Independent Conversations in a Multi-modal Framework T Saha, D Gupta, S Saha, P Bhattacharyya Cognitive Computation, 1-13, 2020	25	2020
Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning T Saha, D Gupta, S Saha, P Bhattacharyya Expert Systems with Applications 162, 113650, 2020	19	2020
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF S Sun, D Gupta, M Iyyer arXiv preprint arXiv:2309.09055, 2023	13	2023
A Mixture-of-Expert Approach to RL-based Dialogue Management Y Chow, A Tulepbergenov, O Nachum, MK Ryu, M Ghavamzadeh, ... arXiv preprint arXiv:2206.00059, 2022	11	2022
A hierarchical approach for efficient multi-intent dialogue policy learning T Saha, D Gupta, S Saha, P Bhattacharyya Multimedia Tools and Applications, 1-26, 2020	9	2020
Reinforcement Learning Based Dialogue Management Strategy T Saha, D Gupta, S Saha, P Bhattacharyya International Conference on Neural Information Processing, 359-372, 2018	9	2018
Structural Credit Assignment in Neural Networks using Reinforcement Learning D Gupta, G Mihucz, MK Schlegel, JE Kostas, PS Thomas, M White Thirty-Fifth Conference on Neural Information Processing Systems, 2021	7	2021
Behavior Alignment via Reward Function Optimization D Gupta, Y Chandak, SM Jordan, PS Thomas, BC da Silva arXiv preprint arXiv:2310.19007, 2023	6	2023
A unified dialogue management strategy for multi-intent dialogue conversations in multiple languages T Saha, D Gupta, S Saha, P Bhattacharyya Transactions on Asian and Low-Resource Language Information Processing 20 (6 …, 2021	4	2021
Bayesian Optimization Based Terrestrial Gait Tuning for a 12-DOF Alligator-Inspired Robot With Active Body Undulation K Agrawal, K Jain, D Gupta, R Srivastav, A Agnihotri, A Thakur ASME 2018 International Design Engineering Technical Conferences and …, 2018	4	2018
Coagent Networks: Generalized and Scaled JE Kostas, SM Jordan, Y Chandak, G Theocharous, D Gupta, M White, ... arXiv preprint arXiv:2305.09838, 2023	1	2023
ICU-Sepsis: A Benchmark MDP Built from Real Medical Data K Choudhary, D Gupta, PS Thomas arXiv preprint arXiv:2406.05646, 2024		2024
From Past to Future: Rethinking Eligibility Traces D Gupta, SM Jordan, S Chaudhari, B Liu, PS Thomas, BC da Silva arXiv preprint arXiv:2312.12972, 2023		2023
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management D Gupta, Y Chow, M Ghavamzadeh, C Boutilier arXiv preprint arXiv:2302.10850, 2023		2023
Applicability of Momentum in the Methods of Temporal Learning D Gupta		2020
A Generic Dialogue Manager using Reinforcement Learning in a Multilingual Multi-intent Multi-domain Setting D Gupta		2019
Utility of accelerated temporal difference methods over gradient based optimizers D Gupta
Investigating the Utility of Off-Policy Data in PPO Algorithm Y Yuan, D Gupta

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors