MeetingBank: A Benchmark Dataset for Meeting Summarization Y Hu, T Ganter, H Deilamsalehy, F Dernoncourt, H Foroosh, F Liu Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 14 | 2023 |
InFoBench: Evaluating Instruction Following Ability in Large Language Models Y Qin, K Song, Y Hu, W Yao, S Cho, X Wang, X Wu, F Liu, P Liu, D Yu arXiv preprint arXiv:2401.03601, 2024 | 5 | 2024 |
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4 Y Hu, K Song, S Cho, X Wang, H Foroosh, F Liu Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 | 4* | 2023 |
Can Large Language Models do Analytical Reasoning? Y Hu, K Song, S Cho, X Wang, H Foroosh, D Yu, F Liu arXiv preprint arXiv:2403.04031, 2024 | | 2024 |
SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs Y Hu, K Song, S Cho, X Wang, H Foroosh, D Yu, F Liu arXiv preprint arXiv:2402.10979, 2024 | | 2024 |