Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (70), 1-53, 2024 | 1592 | 2024 |
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1170 | 2023 |
The flan collection: Designing data and methods for effective instruction tuning S Longpre, L Hou, T Vu, A Webson, HW Chung, Y Tay, D Zhou, QV Le, ... ICML 2023, 2023 | 337 | 2023 |
Question rewriting for conversational question answering S Vakulenko, S Longpre, Z Tu, R Anantha WSDM 2021, 355-363, 2021 | 142 | 2021 |
Open-domain question answering goes conversational via question rewriting R Anantha, S Vakulenko, Z Tu, S Longpre, S Pulman, S Chappidi NAACL 2021, 2020 | 130 | 2020 |
Entity-based knowledge conflicts in question answering S Longpre, K Perisetla, A Chen, N Ramesh, C DuBois, S Singh EMNLP 2021, 2021 | 124 | 2021 |
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering S Longpre, Y Lu, J Daiber TACL 2021, Vol 9, 2020 | 108 | 2020 |
The bigscience roots corpus: A 1.6 tb composite multilingual dataset H Laurençon, L Saulnier, T Wang, C Akiki, A Villanova del Moral, ... Advances in Neural Information Processing Systems 35, 31809-31826, 2022 | 104 | 2022 |
H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, and Jason Wei. 2022. Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, E Li, X Wang, ... arXiv preprint arXiv:2210.11416, 2022 | 101 | 2022 |
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers? S Longpre, Y Wang, C DuBois Findings of the Association for Computational Linguistics: EMNLP 2020, 2020 | 85 | 2020 |
You reap what you sow: On the challenges of bias evaluation under multilingual settings Z Talat, A Névéol, S Biderman, M Clinciu, M Dey, S Longpre, S Luccioni, ... Proceedings of BigScience Episode# 5--Workshop on Challenges & Perspectives …, 2022 | 66 | 2022 |
Huai hsin Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, E Li, X Wang, ... Le, and Jason Wei. Scaling instruction-finetuned language models. ArXiv, abs …, 2022 | 63 | 2022 |
Octopack: Instruction tuning code large language models N Muennighoff, Q Liu, A Zebaze, Q Zheng, B Hui, TY Zhuo, S Singh, ... arXiv preprint arXiv:2308.07124, 2023 | 58 | 2023 |
H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, and Jason Wei. Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... arXiv preprint arXiv:2210.11416 6 (7), 2022 | 48 | 2022 |
An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering S Longpre, Y Lu, Z Tu, C DuBois Proceedings of the 2nd Workshop on Machine Reading for Question Answering …, 2019 | 48 | 2019 |
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity S Longpre, G Yauney, E Reif, K Lee, A Roberts, B Zoph, D Zhou, J Wei, ... arXiv preprint arXiv:2305.13169, 2023 | 43 | 2023 |
Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP A Chen, P Gudipati, S Longpre, X Ling, S Singh ACL 2021, 2021 | 35 | 2021 |
A comparison of question rewriting methods for conversational passage retrieval S Vakulenko, N Voskarides, Z Tu, S Longpre Proceedings of the 43rd European Conference on IR Research, ECIR 2021, 2021 | 32 | 2021 |
Prometheus: Inducing fine-grained evaluation capability in language models S Kim, J Shin, Y Cho, J Jang, S Longpre, H Lee, S Yun, S Shin, S Kim, ... arXiv preprint arXiv:2310.08491, 2023 | 27 | 2023 |
How big data confers market power to big tech: Leveraging the perspective of data science C Santesteban, S Longpre The Antitrust Bulletin 65 (3), 459-485, 2020 | 26 | 2020 |