Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 775 | 2023 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 443 | 2023 |
Model-based value estimation for efficient model-free reinforcement learning V Feinberg, A Wan, I Stoica, MI Jordan, JE Gonzalez, S Levine arXiv preprint arXiv:1803.00101, 2018 | 380 | 2018 |
On the computational inefficiency of large batch sizes for stochastic gradient descent N Golmant, N Vemuri, Z Yao, V Feinberg, A Gholami, K Rothauge, ... arXiv preprint arXiv:1811.12941, 2018 | 79 | 2018 |
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024 | 28 | 2024 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 6 | 2024 |
Sketchy: Memory-efficient Adaptive Regularization with Frequent Directions V Feinberg, X Chen, YJ Sun, R Anil, E Hazan Conference on Neural Information Processing Systems 2023, 2023 | 4 | 2023 |
Fishy: Layerwise Fisher Approximation for Higher-order Neural Network Optimization A Peirson, E Amid, Y Chen, V Feinberg, MK Warmuth, R Anil Has it Trained Yet? NeurIPS 2022 Workshop, 2022 | 1 | 2022 |
Large linear multi-output gaussian process learning V Feinberg, LF Cheng, K Li, BE Engelhardt arXiv preprint arXiv:1705.10813, 2017 | 1 | 2017 |
Chromatic Learning for Sparse Datasets V Feinberg, P Bailis arXiv preprint arXiv:2006.03779, 2020 | | 2020 |
Performance Optimization for CNNs on Modern Intel CPUs V Feinberg | | 2015 |