Yu Bai

Cited by

	All	Since 2019
Citations	2304	2242
h-index	23	23
i10-index	37	37

740

370

185

555

2017201820192020202120222023202417 45 86 166 358 491 739 399

Public access

View all

13 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Song MeiAssistant Professor at UC BerkeleyVerified email at berkeley.edu
Huan WangSalesforce ResearchVerified email at yale.edu
Caiming XiongSalesforce ResearchVerified email at salesforce.com
Chi JinAssistant Professor, Princeton UniversityVerified email at princeton.edu
Yu-Xiang WangAssociate Professor of Computer Science, UC Santa BarbaraVerified email at cs.ucsb.edu
Tiancheng YuTwo SigmaVerified email at mit.edu
Nan JiangAssistant Professor of Computer Science, UIUCVerified email at illinois.edu
Jason D. LeeAssociate Professor of Electrical Engineering and Computer Science, Princeton UniversityVerified email at princeton.edu
Tengyang XieUniversity of Wisconsin-Madison, Microsoft ResearchVerified email at cs.wisc.edu
Andrea MontanariProfessor of Statistics and Mathematics, Stanford UniversityVerified email at stanford.edu
Minshuo ChenPrinceton UniversityVerified email at princeton.edu
Qinghua LiuPrinceton UniversityVerified email at princeton.edu
Fan ChenMassachusetts Institute of TechnologyVerified email at mit.edu
Ming YinPrinceton UniversityVerified email at princeton.edu
Ziang SongStanford UniversityVerified email at stanford.edu
Tuo ZhaoAssistant Professor, Georgia TechVerified email at gatech.edu
Sham M KakadeHarvard UniversityVerified email at seas.harvard.edu
Edo Libertypinecone.ioVerified email at edoliberty.com
Andrej RisteskiCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Tengyu MAStanford UniversityVerified email at stanford.edu

Yu Bai

Research Scientist, Salesforce Research

Verified email at salesforce.com - Homepage

Machine Learning Statistics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The landscape of empirical risk for nonconvex losses S Mei, Y Bai, A Montanari The Annals of Statistics 46 (6A), 2747-2774, 2018	347	2018
Provable self-play algorithms for competitive reinforcement learning Y Bai, C Jin International conference on machine learning, 551-560, 2020	163	2020
Policy finetuning: Bridging sample-efficient offline and online reinforcement learning T Xie, N Jiang, H Wang, C Xiong, Y Bai Advances in neural information processing systems 34, 27395-27407, 2021	142	2021
A sharp analysis of model-based reinforcement learning with self-play Q Liu, T Yu, Y Bai, C Jin International Conference on Machine Learning, 7001-7010, 2021	137	2021
Near-Optimal Reinforcement Learning with Self-Play Y Bai, C Jin, T Yu Advances in Neural Information Processing Systems, 2020, 2020	131	2020
Proxquant: Quantized neural networks via proximal operators Y Bai, YX Wang, E Liberty International Conference on Learning Representations (ICLR) 2019, 2018	120	2018
Beyond linearization: On quadratic and higher-order approximation of wide neural networks Y Bai, JD Lee International Conference on Learning Representations (ICLR) 2020, 2019	116	2019
Provably Efficient Q-Learning with Low Switching Cost Y Bai, T Xie, N Jiang, YX Wang Advances in Neural Information Processing Systems, 2019, 2019	98	2019
When can we learn general-sum Markov games with a large number of players sample-efficiently? Z Song, S Mei, Y Bai International Conference on Learning Representations (ICLR) 2022, 2021	86	2021
Near-optimal provable uniform convergence in offline policy evaluation for reinforcement learning M Yin, Y Bai, YX Wang International Conference on Artificial Intelligence and Statistics, 1567-1575, 2021	86*	2021
Approximability of discriminators implies diversity in GANs Y Bai, T Ma, A Risteski International Conference on Learning Representations (ICLR) 2019, 2018	85	2018
Near-optimal offline reinforcement learning via double variance reduction M Yin, Y Bai, YX Wang Advances in neural information processing systems 34, 7677-7688, 2021	69	2021
How important is the train-validation split in meta-learning? Y Bai, M Chen, P Zhou, T Zhao, J Lee, S Kakade, H Wang, C Xiong International Conference on Machine Learning, 543-553, 2021	69	2021
Sample-efficient learning of Stackelberg equilibria in general-sum games Y Bai, C Jin, H Wang, C Xiong Advances in Neural Information Processing Systems 34, 25799-25811, 2021	65	2021
Transformers as statisticians: Provable in-context learning with in-context algorithm selection Y Bai, F Chen, H Wang, C Xiong, S Mei Advances in neural information processing systems 36, 2024	62	2024
Subgradient descent learns orthogonal dictionaries Y Bai, Q Jiang, J Sun International Conference on Learning Representations (ICLR) 2019, 2018	58	2018
Towards understanding hierarchical learning: Benefits of neural representations M Chen, Y Bai, JD Lee, T Zhao, H Wang, C Xiong, R Socher Advances in Neural Information Processing Systems, 2020, 2020	50	2020
The role of coverage in online reinforcement learning T Xie, DJ Foster, Y Bai, N Jiang, SM Kakade arXiv preprint arXiv:2210.04157, 2022	44	2022
Don't Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification Y Bai, S Mei, H Wang, C Xiong International Conference on Machine Learning, 566-576, 2021	41	2021
Unified algorithms for rl with decision-estimation coefficients: No-regret, pac, and reward-free learning F Chen, S Mei, Y Bai arXiv preprint arXiv:2209.11745, 2022	30	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors