Satyapriya Krishna

Cited by

	All	Since 2019
Citations	924	924
h-index	12	12
i10-index	12	12

420

210

105

315

202120222023202425 117 366 408

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Himabindu LakkarajuAssistant Professor, Harvard UniversityVerified email at seas.harvard.edu
Rahul GuptaAmazon AlexaVerified email at amazon.com
Jwala DhamalaAmazon AGIVerified email at amazon.com
Chirag AgarwalPostdoctoral Research Fellow, HarvardVerified email at hbs.edu
Kai-Wei ChangAssociate Professor, UCLAVerified email at kwchang.net
Yada PruksachatkunNew York UniversityVerified email at nyu.edu
Martin PawelczykPostdoc, Harvard UniversityVerified email at uni-tuebingen.de
Nari JohnsonCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Sameer SinghAssociate Professor, UC IrvineVerified email at uci.edu
Marinka ZitnikAssistant Professor, Harvard UniversityVerified email at hms.harvard.edu
Eshika SaxenaMeta (FAIR), previously at Harvard UniversityVerified email at meta.com
Varun KumarAWS AI LabsVerified email at umd.edu
Dylan SlackGoogleVerified email at google.com
Tessa HanHarvard UniversityVerified email at g.harvard.edu
Alex GuMITVerified email at mit.edu
Isha PuriPhD Student - AI/NLP@MIT, NSF Fellow, MIT Great Educators FellowVerified email at mit.edu
Tony SunStanford UniversityVerified email at stanford.edu
Apurv VermaNJIT, Bloomberg, Amazon, Georgia Institute of TechnologyVerified email at njit.edu
Shahin JabbariDrexel UniversityVerified email at drexel.edu
Zhiwei Steven WuCarnegie Mellon UniversityVerified email at andrew.cmu.edu

Satyapriya Krishna

Harvard University

Verified email at g.harvard.edu - Homepage

Trustworthy AI Large Language Models Explainable & Fair ML


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation J Dhamala, T Sun, V Kumar, S Krishna, Y Pruksachatkun, KW Chang, ... ACM FAccT Conference 2021, 2021	264	2021
The Disagreement Problem in Explainable Machine Learning: A Practitioner's Perspective S Krishna, T Han, A Gu, J Pombra, S Jabbari, S Wu, H Lakkaraju Transactions on Machine Learning Research, 2024, 2024	188	2024
Openxai: Towards a transparent evaluation of model explanations C Agarwal, S Krishna, E Saxena, M Pawelczyk, N Johnson, I Puri, M Zitnik, ... Advances in neural information processing systems 35, 15784-15799, 2022	129	2022
Explaining machine learning models with interactive natural language conversations using TalkToModel D Slack, S Krishna, H Lakkaraju, S Singh Nature Machine Intelligence, 1-11, 2023	65*	2023
Rethinking Stability for Attribution-based Explanations C Agarwal, N Johnson, M Pawelczyk, S Krishna, E Saxena, M Zitnik, ... ICLR 2022 Workshop on PAIR^2Struct: Privacy, Accountability …, 2022	42	2022
Adept: Auto-encoder based differentially private text transformation S Krishna, R Gupta, C Dupuy Proceedings of the 16th Conference of the European Chapter of the …, 2021	40	2021
Post Hoc Explanations of Language Models Can Improve Language Models S Krishna, J Ma, D Slack, A Ghandeharioun, S Singh, H Lakkaraju Advances in Neural Information Processing Systems, 2023 36, 2023	39	2023
Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal U Gupta, J Dhamala, V Kumar, A Verma, Y Pruksachatkun, S Krishna, ... Findings of the Association for Computational Linguistics: ACL 2022, 2022	36	2022
Does Robustness Improve Fairness? Approaching Fairness with Word Substitution Robustness Methods for Text Classification Y Pruksachatkun, S Krishna, J Dhamala, R Gupta, KW Chang Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2021	31	2021
Black-Box Access is Insufficient for Rigorous AI Audits S Casper, C Ezell, C Siegmann, N Kolt, TL Curtis, B Bucknall, A Haupt, ... ACM FAccT Conference 2024, 2024	21	2024
Are Large Language Models Post Hoc Explainers? N Kroeger, D Ley, S Krishna, C Agarwal, H Lakkaraju arXiv preprint arXiv:2310.05797, 2023	17	2023
Eagle and finch: Rwkv with matrix-valued states and dynamic recurrence B Peng, D Goldstein, Q Anthony, A Albalak, E Alcaide, S Biderman, ... arXiv preprint arXiv:2404.05892, 2024	15	2024
Towards Bridging the Gaps between the Right to Explanation and the Right to be Forgotten S Krishna, J Ma, H Lakkaraju The Fortieth International Conference on Machine Learning (ICML), 2023, 2023	8	2023
Measuring Fairness of Text Classifiers via Prediction Sensitivity S Krishna, R Gupta, A Verma, J Dhamala, Y Pruksachatkun, KW Chang Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	8	2022
On the Intersection of Self-Correction and Trust in Language Models S Krishna arXiv preprint arXiv:2311.02801, 2023	5	2023
Towards Realistic Single-Task Continuous Learning Research for NER J Payan, Y Merhav, H Xie, S Krishna, A Ramakrishna, M Sridhar, R Gupta Findings of the Association for Computational Linguistics: EMNLP 2021, 2021	5	2021
On the Trade-offs between Adversarial Robustness and Actionable Explanations S Krishna, C Agarwal, H Lakkaraju arXiv preprint arXiv:2309.16452, 2023	3*	2023
Finetext: text classification via attention-based language model fine-tuning Y Tao, S Gupta, S Krishna, X Zhou, O Majumder, V Khare Amazon Machine Learning Conference (AMLC) 2020, 2019	3	2019
Understanding the Effects of Iterative Prompting on Truthfulness S Krishna, C Agarwal, H Lakkaraju Forty-first International Conference on Machine Learning, 2024, 2024	2	2024
Towards classification parity across cohorts A Patel, R Gupta, M Harakere, S Krishna, A Alok, P Liu ML-IRL Workshop at ICLR 2020, 2020	2	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors