Yihao Feng

Cited by

	All	Since 2019
Citations	758	709
h-index	15	14
i10-index	18	17

260

130

195

2017201820192020202120222023202415 32 32 69 90 123 242 152

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Yihao Feng

Salesforce AI Research

Verified email at salesforce.com - Homepage

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Action-depedent Control Variates for Policy Optimization via Stein's Identity H Liu, Y Feng, Y Mao, D Zhou, J Peng, Q Liu arXiv preprint arXiv:1710.11198, 2017	92	2017
Learning to draw samples with amortized stein variational gradient descent Y Feng, D Wang, Q Liu arXiv preprint arXiv:1707.06626, 2017	79	2017
Doubly robust bias reduction in infinite horizon off-policy estimation Z Tang, Y Feng, L Li, D Zhou, Q Liu ICLR 2020, 2020	73*	2020
A kernel loss for solving the bellman equation Y Feng, L Li, Q Liu Advances in Neural Information Processing Systems 32, 2019	63	2019
Dynamic pricing and information disclosure for fresh produce: An artificial intelligence approach C Yang, Y Feng, A Whinston Production and Operations Management 31 (1), 155-171, 2022	60	2022
Unicontrol: A unified diffusion model for controllable visual generation in the wild C Qin, S Zhang, N Yu, Y Feng, X Yang, Y Zhou, H Wang, JC Niebles, ... arXiv preprint arXiv:2305.11147, 2023	41	2023
Incremental few-shot text classification with multi-round new classes: Formulation, dataset and system C Xia, W Yin, Y Feng, P Yu arXiv preprint arXiv:2104.11882, 2021	41	2021
Accountable off-policy evaluation with kernel bellman statistics Y Feng, T Ren, Z Tang, Q Liu International Conference on Machine Learning, 3102-3111, 2020	38	2020
Bolaa: Benchmarking and orchestrating llm-augmented autonomous agents Z Liu, W Yao, J Zhang, L Xue, S Heinecke, R Murthy, Y Feng, Z Chen, ... arXiv preprint arXiv:2308.05960, 2023	36	2023
Hive: Harnessing human feedback for instructional visual editing S Zhang, X Yang, Y Feng, C Qin, CC Chen, N Yu, Z Chen, H Wang, ... arXiv preprint arXiv:2303.09618, 2023	34	2023
Unsupervised out-of-domain detection via pre-trained transformers K Xu, T Ren, S Zhang, Y Feng, C Xiong arXiv preprint arXiv:2106.00948, 2021	31	2021
Retroformer: Retrospective large language agents with policy gradient optimization W Yao, S Heinecke, JC Niebles, Z Liu, Y Feng, L Xue, R Murthy, Z Chen, ... arXiv preprint arXiv:2308.02151, 2023	27	2023
Two methods for wild variational inference Q Liu, Y Feng arXiv preprint arXiv:1612.00081, 2016	25	2016
Libero: Benchmarking knowledge transfer for lifelong robot learning B Liu, Y Zhu, C Gao, Y Feng, Q Liu, Y Zhu, P Stone Advances in Neural Information Processing Systems 36, 2024	18	2024
ARShop: A Cloud-based Augmented Reality System for Shopping C Wang, Y Feng, AKH Tung, Y Zheng Proceedings of the VLDB Endowment 10 (12), 1845-1848, 2017	18	2017
Fantastic rewards and how to tame them: A case study on reward learning for task-oriented dialogue systems Y Feng, S Yang, S Zhang, J Zhang, C Xiong, M Zhou, H Wang arXiv preprint arXiv:2302.10342, 2023	12	2023
Non-asymptotic confidence intervals of off-policy evaluation: Primal and dual bounds Y Feng, Z Tang, N Zhang, Q Liu arXiv preprint arXiv:2103.05741, 2021	12	2021
A regularized implicit policy for offline reinforcement learning S Yang, Z Wang, H Zheng, Y Feng, M Zhou arXiv preprint arXiv:2202.09673, 2022	10	2022
A unified framework for alternating offline model training and policy learning S Yang, S Zhang, Y Feng, M Zhou Advances in Neural Information Processing Systems 35, 17216-17232, 2022	8	2022
Regularizing a model-based policy stationary distribution to stabilize offline reinforcement learning S Yang, Y Feng, S Zhang, M Zhou International Conference on Machine Learning, 24980-25006, 2022	8	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by