Follow
Han Shen
Title
Cited by
Cited by
Year
Towards understanding asynchronous advantage actor-critic: Convergence and linear speedup
H Shen, K Zhang, M Hong, T Chen
IEEE Transactions on Signal Processing, 2023
27*2023
Adaptive temporal difference learning with linear function approximation
T Sun, H Shen, T Chen, D Li
IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (12), 8812 …, 2021
272021
Mitigating gradient bias in multi-objective learning: A provably convergent approach
HD Fernando, H Shen, M Liu, S Chaudhury, K Murugesan, T Chen
The Eleventh International Conference on Learning Representations, 2022
26*2022
On penalty-based bilevel gradient descent method
H Shen, T Chen
International Conference on Machine Learning, 2023
212023
Byzantine-resilient decentralized policy evaluation with linear function approximation
Z Wu, H Shen, T Chen, Q Ling
IEEE Transactions on Signal Processing 69, 3839-3853, 2021
212021
Alternating projected sgd for equality-constrained bilevel optimization
Q Xiao, H Shen, W Yin, T Chen
International Conference on Artificial Intelligence and Statistics, 987-1023, 2023
172023
A single-timescale analysis for stochastic approximation with multiple coupled sequences
H Shen, T Chen
Advances in Neural Information Processing Systems 35, 17415-17429, 2022
102022
Alternating implicit projected sgd and its efficient variants for equality-constrained bilevel optimization
Q Xiao, H Shen, W Yin, T Chen
arXiv preprint arXiv:2211.07096, 2022
42022
A Method for Bilevel Optimization with Convex Lower-Level Problem
H Shen, S Paternain, G Liu, R Kompella, T Chen
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
H Shen, Z Yang, T Chen
arXiv preprint arXiv:2402.06886, 2024
2024
Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization
AFM Saif, X Cui, H Shen, S Lu, B Kingsbury, T Chen
arXiv preprint arXiv:2401.06980, 2024
2024
Distributed Offline Policy Optimization Over Batch Data
H Shen, S Lu, X Cui, T Chen
International Conference on Artificial Intelligence and Statistics, 4443-4472, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–12