Stephen McAleer

Cited by

	All	Since 2019
Citations	2373	2369
h-index	18	18
i10-index	26	26

860

430

215

645

20192020202120222023202460 181 322 520 844 439

Public access

View all

15 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Pierre BaldiProfessor, University of California, IrvineVerified email at ics.uci.edu
Yaodong YangBOYA (博雅) Assistant Professor at Peking UniversityVerified email at pku.edu.cn
Roy FoxAssistant Professor, UC IrvineVerified email at uci.edu
JB LanierUC IrvineVerified email at uci.edu
Alexander ShmakovUniversity of California IrvineVerified email at uci.edu
Forest AgostinelliAssistant Professor at the University of South CarolinaVerified email at cse.sc.edu
Jun WangProfessor, Computer Science, University College LondonVerified email at cs.ucl.ac.uk
Oliver SlumbersUniversity College LondonVerified email at ucl.ac.uk
Tuomas SandholmAngel Jordan University Professor of Computer Science, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Kevin A. WangBrown UniversityVerified email at kevinwang.us
Gabriele FarinaMassachusetts Institute of TechnologyVerified email at mit.edu
Shauharda KhadkaSenior Data & Applied Scientist at MicrosoftVerified email at microsoft.com
Somdeb MajumdarIntel CorpVerified email at intel.com
Kagan TumerOregon State UniversityVerified email at oregonstate.edu
Marc LanctotResearch Scientist, Google DeepMindVerified email at google.com
Ioannis PanageasAssistant Professor, University of California, IrvineVerified email at ics.uci.edu
Pieter AbbeelUC Berkeley | CovariantVerified email at cs.berkeley.edu
Alexander IhlerUniversity of California, IrvineVerified email at ics.uci.edu
Michael DennisGoogle DeepMindVerified email at cs.berkeley.edu
Karl TuylsResearch Scientist, Google DeepMind and Professor of computer science, University of LiverpoolVerified email at google.com

Stephen McAleer

Postdoc, CMU

Verified email at uci.edu - Homepage

Artificial Intelligence Reinforcement Learning Game Theory Search


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Highly accurate machine fault diagnosis using deep transfer learning S Shao, S McAleer, R Yan, P Baldi IEEE Transactions on Industrial Informatics 15 (4), 2446-2455, 2018	1020	2018
Solving the Rubik’s cube with deep reinforcement learning and search F Agostinelli, S McAleer, A Shmakov*, P Baldi Nature Machine Intelligence 1 (8), 356-363, 2019	207	2019
Mastering the game of stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	139	2022
Language Models can Solve Computer Tasks G Kim, P Baldi, S McAleer Neural Information Processing Systems (NeurIPS), 2023	132	2023
Solving the Rubik's Cube with Approximate Policy Iteration S McAleer, F Agostinelli, A Shmakov*, P Baldi International Conference on Learning Representations (ICLR), 2018	92*	2018
Pipeline PSRO: A scalable approach for finding approximate nash equilibria in large games S McAleer, J Lanier, R Fox, P Baldi 34th Conference on Neural Information Processing Systems (NeurIPS), 2020	69	2020
Llemma: An Open Language Model for Mathematics Z Azerbayev, H Schoelkopf, K Paster, M Dos Santos, S McAleer, AQ Jiang, ... International Conference on Learning Representations (ICLR), 2023	64	2023
Evolutionary reinforcement learning for sample-efficient multiagent coordination S Majumdar, S Khadka, S Miret, S McAleer, K Tumer International Conference on Machine Learning (ICML), 2020	61	2020
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Y Chen, Y Yang, T Wu, S Wang, X Feng, J Jiang, SM McAleer, H Dong, ... 36th Conference on Neural Information Processing Systems (NeurIPS 2022 …, 2022	57	2022
AI Alignment: A Comprehensive Survey J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang, Y Duan, Z He, J Zhou, ... arXiv preprint arXiv:2310.19852, 2023	54	2023
XDO: A double oracle algorithm for extensive-form games S McAleer, J Lanier, P Baldi, R Fox Advances in Neural Information Processing Systems (NeurIPS), 2021	50	2021
Independent Natural Policy Gradient Always Converges in Markov Potential Games R Fox, S McAleer, W Overman, I Panageas AISTATS 2022, 2021	45	2021
Neural auto-curricula in two-player zero-sum games X Feng, O Slumbers, Z Wan, B Liu, S McAleer, Y Wen, J Wang, Y Yang Advances in Neural Information Processing Systems (NeurIPS), 2021	44*	2021
Online Double Oracle LC Dinh, Y Yang, S McAleer, NP Nieves, O Slumbers, Z Tian, DH Mguni, ... Transactions on Machine Learning Research, 2021	29	2021
Deep-learning-based reconstruction of the neutrino direction and energy for in-ice radio detectors C Glaser, S McAleer, S Stjärnholm, P Baldi, SW Barwick Astroparticle Physics 145, 102781, 2023	26*	2023
White Paper: ARIANNA-200 high energy neutrino telescope A Anker, P Baldi, SW Barwick, D Bergman, H Bernhoff, DZ Besson, ... arXiv preprint arXiv:2004.09841, 2020	25	2020
Curiosity-Driven Multi-Criteria Hindsight Experience Replay J Lanier, S McAleer, P Baldi NeurIPS 2019 Deep RL Workshop, 2019	21	2019
Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games S McAleer, JB Lanier, K Wang, P Baldi, R Fox, T Sandholm International Conference on Learning Representations (ICLR), 2022	18*	2022
A* search without expansions: Learning heuristic functions with deep q-networks F Agostinelli, A Shmakov, S McAleer, R Fox, P Baldi arXiv preprint arXiv:2102.04518, 2021	18	2021
Reducing variance in temporal-difference value estimation via ensemble of deep networks L Liang, Y Xu, S McAleer, D Hu, A Ihler, P Abbeel, R Fox International Conference on Machine Learning (ICML), 2022	17*	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors