Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 1488 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 358 | 2024 |
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains Y Pan, Z Abbas, A White, A Patterson, M White IJCAI'18, 2018 | 56 | 2018 |
Loss of plasticity in continual deep reinforcement learning Z Abbas, R Zhao, J Modayil, A White, MC Machado Conference on Lifelong Learning Agents, 620-636, 2023 | 52 | 2023 |
General value function networks M Schlegel, A Jacobsen, Z Abbas, A Patterson, A White, M White arXiv preprint arXiv:1807.06763, 2018 | 44 | 2018 |
Selective Dyna-style Planning Under Limited Model Capacity Z Abbas, S Sokota, EJ Talvitie, M White ICML'20, 2020 | 39 | 2020 |
Planning with expectation models Y Wan, Z Abbas, A White, M White, RS Sutton IJCAI'19, 2019 | 29 | 2019 |
Many-Shot In-Context Learning R Agarwal, A Singh, LM Zhang, B Bohnet, S Chan, A Anand, Z Abbas, ... arXiv preprint arXiv:2404.11018, 2024 | 24 | 2024 |
Investigating the properties of neural network representations in reinforcement learning H Wang, E Miahi, M White, MC Machado, Z Abbas, R Kumaraswamy, ... Artificial Intelligence 330, 104100, 2024 | 22 | 2024 |
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online Representation Learning B Rafiee, Z Abbas, S Ghiassian, R Kumaraswamy, R Sutton, E Ludvig, ... arXiv preprint arXiv:2011.04590, 2020 | 8 | 2020 |
Model-based reinforcement learning with non-linear expectation models and stochastic environments Y Wan, Z Abbas, M White, RS Sutton FAIM Workshop on Prediction and Generative Modeling in Reinforcement …, 2018 | 6 | 2018 |
Selective Dyna-style Planning Using Neural Network Models with Limited Capacity Z Abbas | 2* | 2020 |
Towards model-free RL algorithms that scale well with unstructured data J Modayil, Z Abbas arXiv preprint arXiv:2311.02215, 2023 | 1 | 2023 |
Incrementally Learning Functions of the Return B Bennett, W Chung, Z Abbas, V Liu arXiv preprint arXiv:1907.04651, 2019 | 1 | 2019 |