Maarten Sap (he/him)

Email Icon msap2@andrew.cmu.edu   Google Scholar Profile https://scholar.google.com/citations?user=gFN4QUYAAAAJ

Positions

Carnegie Mellon University: School of Computer Science
Assistant Professor - Language Technologies Institute 2022 – present
Affiliated Faculty - Human Compuer Interaction Institute 2024 – present
Allen Institute for AI
Visiting Research Scientist 2022 – present
Postdoctoral Researcher / Young Investigator 2021 – 2022
Research Intern 2018 – 2019
Microsoft Research
Research Intern 2019

Education

University of Washington: Paul G. Allen School of Computer Science & Engineering 2015 – 2022
PhD in Computer Science & Engineering, research focus on Natural Language Processing
advised by Yejin Choi & Noah Smith
Thesis: Positive AI with Social Commonsense Models
École Polytechnique Fédérale de Lausanne: School of Computer and Communication Sciences 2010 – 2014
BS in Communications and Information Systems

Advising

PhD & MLT students

Dan Chechelnitsky he/him (co-advised with Chrysoula Zerva) LTI PhD 09/2024–present
Mingqian Zheng she/her (co-advised with Carolyn Rosé) LTI PhD 09/2024–present
Jocelyn Shen she/her (co-advised with Cynthia Breazeal) MIT Media Lab 11/2023–present
Joel Mire he/him LTI MLT 09/2023–present
Karina Halevy she/her (co-advised with Mona Diab) LTI PhD 09/2023–present
Jimin Mun she/her LTI PhD 09/2022–present
Akhila Yerukola she/her LTI PhD 09/2022–present
Xuhui Zhou he/him LTI PhD 09/2022–present

Research Interns & Research Masters

Zhe Su he/him CMU MSML 09/2023–present
Kaitlyn Zhou she/her AI2 Research Intern 06/2023–present
Ashutosh Baheti he/him AI2 Research Intern 09/2022–07-2024
Yiming Zhang he/him (co-advised with Sherry Tongshuang Wu) UChicago MS 09/2022–09/2023
Athiya Deviyani she/her LTI MSAII 09/2022–09/2023
Julia Mendelsohn she/her AI2 Research Intern 06/2022–01/2023
Sebastin Santy he/him AI2 Research Intern 06/2022–01/2023

Undergraduates & Professional Masters

Neel Bhandari he/him CMU LTI MIIS 08/2024–present
Tiya Cao she/her CMU LTI MIIS 08/2024–present
Jenna Godsey she/her CMU BS 08/2024–present
Bruno Neira he/him CMU BS 08/2024–present
Kshitish Ghate he/him (co-advised with Mona Diab) CMU LTI MLT 08/2024–present
Sophie Feng she/her CMU BS 08/2024–present
Wenkai Li he/him (co-advised with Mona Diab) CMU LTI MIIS 02/2024–present
Devansh Jain he/him CMU LTI MIIS 09/2023–present
Priyanshu Kumar he/him CMU LTI MIIS 09/2023–present
Liwen Sun he/him CMU LTI MIIS 08/2024–12/2024
Zhenxiang Guan he/him CMU LTI MIIS 08/2024–12/2024
Jiarui Liu he/him (co-advised with Mona Diab) CMU LTI MLT 02/2024–09/2024
Abhinav Rao he/him CMU LTI MIIS 09/2023–07/2024
Vishwa Shah she/her CMU LTI MIIS 09/2023–07/2024
Sanketh Rangreji he/him CMU LTI MIIS 09/2023–12/2023
Anubha Kabra she/her CMU LTI MIIS 09/2023–12/2023
Sravani Nanduri she/her (co-advised with Liwei Jiang) UW CSE BS 09/2021–10/2022
Skyler Hallinan he/him UW CSE BS 01/2021–08/2022
Zhilin Wang he/him UW CLMS 01/2021–09/2021
Michelle Ma she/her (co-advised with Hannah Rashkin) UW CSE BS 09/2019–12/2020
Sam Gehman he/him UW CSE MS 09/2019–07/2020
Aishwarya Nirmal she/her UW CSE MS 01/2018–06/2019
Kenta Takatsu he/him Cornell BS 07/2018–03/2019
Zachary Horvitz he/him (co-advised with Antoine Bosselut) AI2 Research Intern 07/2018–03/2019
Sarah Yu she/her UW CSE BS 03/2018–06/2018
Lanhao Wu he/him (co-advised with Saadia Gabriel) UW CSE BS 03/2018–06/2018
Boyan Li he/him (co-advised with Saadia Gabriel) UW CSE BS 01/2018–06/2018
Amy Shah she/her (co-advised with Elizabeth Clark) UW CSE BS 09/2017–06/2018
Emily Allaway she/her (co-advised with Hannah Rashkin) UW CSE BS 07/2017–06/2018
Marcela Cindy Prasetio she/her (co-advised with Hannah Rashkin) UW CSE BS 01/2016–06/2017

Publications

Journal

  1. Liwei Jiang, Jena D Hwang, Chandra Bhagavatula, Ronan Le Bras, Jenny Liang, Jesse Dodge, Keisuke Sakaguchi, Maxwell Forbes, Jon Borchardt, Saadia Gabriel, Yulia Tsvetkov, Oren Etzioni, Maarten Sap, Regina Rini & Yejin Choi (2025) An Empirical Investigation of Machines’ Capabilities for Moral Judgment with the Delphi Experiment. Nature Machine Intelligence.
  2. Jocelyn Shen, Daniella DiPaola, Safinah Ali, Maarten Sap, Hae Won Park & Cynthia Breazeal (2024) Empathy Towards AI vs Human Experiences: The Role of Transparency in Mental Health and Social Support Chatbot Design. JMIR Mental Health.
  3. Maarten Sap, Anna Jafarpour, Yejin Choi, Noah A. Smith, James W. Pennebaker & Eric Horvitz (2022) Quantifying the narrative flow of imagined versus autobiographical stories. PNAS.
  4. Gregory Park, H Andrew Schwartz, Maarten Sap, Margaret L Kern, Evan Weingarten, Johannes C Eichstaedt, Jonah Berger, David J Stillwell, Michal Kosinski, Lyle H Ungar & Martin E P Seligman (2017) Living in the Past, Present, and Future: Measuring Temporal Orientation with Language. Journal of Personality.
  5. Margaret L Kern, Gregory Park, Johannes C Eichstaedt, H Andrew Schwartz, Maarten Sap, Laura K, Smith & Lyle H Ungar (2016) Gaining Insights From Social Media Language: Methodologies and Challenges. Psychological Methods.
  6. Johannes C Eichstaedt, H Andrew Schwartz, Margaret L Kern, Gregory Park, Darwin R Labarthe, Raina M Merchant, Sneha Jha, Megha Agrawal, Lukasz A Dziurzynski, Maarten Sap, Christopher Weeg, Emily Larson, Lyle H Ungar & Martin E P Seligman (2015) Psychological Language on Twitter Predicts County-level Heart Disease Mortality. Psychological Science 26(2). SAGE Publications. 159--169.
  7. Charlene A Wong, Maarten Sap, Hansen Andrew Schwartz, Robert Town, Tom Baker, Lyle Ungar & Raina M Merchant (2015) Twitter Sentiment Predicts Affordable Care Act Marketplace Enrollment. Journal of Medical Internet Research 17(2). JMIR Publications Inc..
  8. Raina M. Merchant, Yoonhee P. Ha, Charlene A. Wong, H. Andrew Schwartz, Maarten Sap, Lyle H. Ungar & David A. Asch (2014) The 2013 US Government Shutdown (#Shutdown) and Health: An Emerging Role for Social Media. American Journal of Public Health 2014. e1--e3.

Conference

  1. Joel Mire*, Zubin Trivadi Aysola*, Daniel Chechelnitsky, Nicholas Deas, Chrysoula Zerva & Maarten Sap (2025) Rejected Dialects: Biases Against African American Language in Reward Models. Findings of NAACL.
  2. Kaitlyn Zhou, Jena D. Hwang, Xiang Ren, Nouha Dziri, Dan Jurafsky & Maarten Sap (2025) Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance. NAACL.
  3. Zhe Su, Xuhui Zhou, Sanketh Rangreji, Anubha Kabra, Julia Mendelsohn, Faeze Brahman & Maarten Sap (2025) AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM Agents. NAACL.
  4. Abhinav Rao*, Akhila Yerukola*, Vishwa Shah, Katharina Reinecke & Maarten Sap (2025) NormAd: A Benchmark for Measuring the Cultural Adaptability of Large Language Models. NAACL.
  5. Xianzhe Fan, Qing Xiao, Xuhui Zhou, Jiaxin Pei, Maarten Sap, Zhicong Lu & Hong Shen (2025) User-Driven Value Alignment: Understanding Users' Perceptions and Strategies for Addressing Biased and Discriminatory Statements in AI Companions. CHI.
  6. Jiaxin Ge, Zora Zhiruo Wang, Xuhui Zhou, Yi-Hao Peng, Sanjay Subramanian, Qinyue Tan, Maarten Sap, Alane Suhr, Daniel Fried, Graham Neubig & Trevor Darrell (2025) AutoPresent: Designing Structured Visuals from Scratch. CVPR.
  7. Joel Mire, Maria Antoniak, Elliott Ash, Andrew Piper & Maarten Sap (2024) The Empirical Variability of Narrative Perceptions of Social Media Texts. EMNLP.
  8. Xuhui Zhou, Zhe Su, Tiwalayo Eisape, Hyunwoo Kim & Maarten Sap (2024) Is This the Real Life? Is This Just Fantasy? The Misleading Success of Simulating Social Interactions With LLMs. EMNLP.
  9. Jocelyn Shen, Joel Mire, Hae Won Park, Cynthia Breazeal & Maarten Sap (2024) HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs. EMNLP.
  10. Liwei Jiang, Kavel Rao, Seungju Han, Allyson Ettinger, Faeze Brahman, Sachin Kumar, Niloofar Mireshghallah, Ximing Lu, Maarten Sap, Nouha Dziri & Yejin Choi (2024) WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models. NeurIPS.
  11. Jimin Mun, Liwei Jiang, Jenny Liang, Inyoung Cheong, Nicole DeCario, Yejin Choi, Tadayoshi Kohno & Maarten Sap (2024) Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits. AIES.
  12. Devansh Jain, Priyanshu Kumar, Samuel Gehman, Xuhui Zhou, Thomas Hartvigsen & Maarten Sap (2024) PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models. COLM.
  13. Maria Antoniak, Joel Mire, Maarten Sap, Elliott Ash & Andrew Piper (2024) Where Do People Tell Stories Online? Story Detection Across Online Communities. ACL.
  14. Akhila Yerukola, Saujas Vadugur, Daniel Fried & Maarten Sap (2024) Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Non-Literal Intent Resolution in LLMs. ACL.
  15. Kaitlyn Zhou, Jena D Hwang, Xiang Ren & Maarten Sap (2024) Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty. ACL.
  16. Ruiyi Wang, Haofei Yu, Wenxin Zhang, Zhengyang Qi, Maarten Sap, Graham Neubig, Yonatan Bisk & Hao Zhu (2024) SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents. ACL.
  17. Jimin Mun, Cathy Buerger, Jenny T. Liang, Joshua Garland & Maarten Sap (2024) Counterspeakers’ Perspectives: Unveiling Barriers and AI Needs in the Fight against Online Hate. CHI.
  18. Natalie Shapira, Mosh Levy, Hossein Seyed Alavi, Xuhui Zhou, Yejin Choi, Yoav Goldberg, Maarten Sap & Vered Shwartz (2024) Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models. EACL.
  19. Xuhui Zhou, Hao Zhu, Leena Mathur, Ruohong Zhang, Haofei Yu, Zhengyang Qi, Louis-Philippe Morency, Yonatan Bisk, Daniel Fried, Graham Neubig & Maarten Sap (2024) SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents. ICLR.
  20. Niloofar Mireshghallah, Hyunwoo Kim, Xuhui Zhou, Yulia Tsvetkov, Maarten Sap, Reza Shokri & Yejin Choi (2024) Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory. ICLR.
  21. Ashutosh Baheti, Ximing Lu, Faeze Brahman, Le Ronan Bras, Maarten Sap & Mark Riedl (2024) Leftover-Lunch: Advantage-based Offline Reinforcement Learning for Language Models. ICLR.
  22. Taylor Sorensen, Liwei Jiang, Jena Hwang, Sydney Levine, Valentina Pyatkin, Peter West, Nouha Dziri, Ximing Lu, Kavel Rao, Chandra Bhagavatula, Maarten Sap, John Tasioulas & Yejin Choi (2024) Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties. AAAI.
  23. Akhila Yerukola, Xuhui Zhou, Elizabeth Clark & Maarten Sap (2023) ``Don't Take This Out of Context!'' On the Need for Contextual Models and Evaluations for Stylistic Rewriting. EMNLP.
  24. Yiming Zhang, Sravani U. Nanduri, Liwei Jiang, Tongshuang Wu & Maarten Sap (2023) BiasX: ``Thinking Slow'' in Toxic Language Annotation with Explanations of Implied Social Biases. EMNLP.
  25. Jocelyn Shen, Maarten Sap, Pedro Colon-Hernandez, Hae Won Park & Cynthia Breazeal (2023) Modeling Empathic Similarity in Personal Narratives. EMNLP.
  26. Jimin Mun, Emily Allaway, Akhila Yerukola, Laura Vianna, Sarah-Jane Leslie & Maarten Sap (2023) Beyond Denouncing Hate: Strategies for Countering Implied Biases and Stereotypes in Language. Findings of EMNLP.
  27. Hyunwoo Kim, Melanie Sclar, Xuhui Zhou, Ronan Le Bras, Gunhee Kim, Yejin Choi & Maarten Sap (2023) FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions. EMNLP.
  28. Hyunwoo Kim, Jack Hessel, Liwei Jiang, Peter West, Ximing Lu, Youngjae Yu, Pei Zhou, Ronan Le Bras, Malihe Alikhani, Gunhee Kim, Maarten Sap & Yejin Choi (2023) SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization. EMNLP.
  29. Xuhui Zhou, Hao Zhu, Akhila Yerukola, Thomas Davidson, Jena D. Hwang, Swabha Swayamdipta & Maarten Sap (2023) COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements. Findings of ACL.
  30. Julia Mendelsohn, Ronan Le Bras, Yejin Choi & Maarten Sap (2023) From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models. ACL.
  31. Sebastin Santy*, Jenny T. Liang*, Ronan Le Bras, Katharina Reinecke & Maarten Sap (2023) NLPositionality: Characterizing Design Biases of Datasets and Models. ACL.
  32. Skyler Hallinan, Alisa Liu, Yejin Choi & Maarten Sap (2023) Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts. ACL.
  33. Organizers Of QueerinAI, Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherl, Davide Locatelli, Eva Breznik, Filip Klubicka, Hang Yuan, J Hetvi, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, A Pranav, Raj Korpan, Ruchira Ray, Sarah Mathew, Sarthak Arora, St John, Tanvi An, Vishakha Agrawal, William Agnew, Yanan Long, Zijie J. Wang, Zeerak Talat, Avijit Ghosh, Nathaniel Dennler, Michael Noseworthy, Sharvani Jha, Emily Baylor, Aditya Joshi, Natalia Y. Bilenko, Andrew McNamara, Raphael Gontijo-Lopes, Alex Markham, Evyn Dǒng, Jackie Kay, Manu Saraswat, Nikhil Vytla & Luke Stark (2023) Queer In AI: A Case Study in Community-Led Participatory AI. FAccT.
  34. Hyunwoo Kim, Youngjae Yu, Liwei Jiang, Ximing Lu, Daniel Khashabi, Gunhee Kim, Yejin Choi & Maarten Sap (2022) ProsocialDialog: A Prosocial Backbone for Conversational Agents. EMNLP.
  35. Maarten Sap, Ronan Le Bras, Daniel Fried & Yejin Choi (2022) Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs. EMNLP.
  36. Zhijing Jin, Sydney Levine, Fernando Gonzalez Adauto, Ojasv Kamal, Maarten Sap, Mrinmaya Sachan, Rada Mihalcea, Joshua B. Tenenbaum & Bernhard Schölkopf (2022) When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment. NeurIPS.
  37. Maarten Sap, Swabha Swayamdipta, Laura Vianna, Xuhui Zhou, Yejin Choi & Noah A. Smith (2022) Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection. NAACL.
  38. Prithviraj Ammanabrolu, Liwei Jiang, Maarten Sap, Hanna Hajishirzi, Yejin Choi & Noah A. Smith (2022) Aligning to Social Norms and Values in Interactive Narratives. NAACL.
  39. Thomas Hartvigsen, Saadia Gabriel, Hamid Palangi, Maarten Sap, Dipankar Ray & Ece Kamar (2022) ToxiGen: Controlling Language Models to Generate Implied and Adversarial Toxicity. ACL.
  40. Saadia Gabriel, Skyler Hallinan, Maarten Sap, Pemi Nguyen, Franziska Roesner, Eunsol Choi & Yejin Choi (2022) Misinfo Reaction Frames: Reasoning about Readers' Reactions to News Headlines. ACL.
  41. Jesse Dodge, Maarten Sap, Ana Marasović, William Agnew, Gabriel Ilharco, Dirk Groeneveld, Margaret Mitchell & Matt Gardner (2021) Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus. EMNLP.
  42. Ashutosh Baheti, Maarten Sap, Alan Ritter & Mark Riedl (2021) Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts. EMNLP.
  43. Alisa Liu, Maarten Sap, Ximing Lu, Swabha Swayamdipta, Chandra Bhagavatula, Noah A. Smith & Yejin Choi (2021) DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts. ACL.
  44. Albert Xu, Eshaan Pathak, Eric Wallace, Suchin Gururangan, Maarten Sap & Dan Klein (2021) Detoxifying Language Models Risks Marginalizing Minority Voices. NAACL.
  45. Xuhui Zhou, Maarten Sap, Swabha Swayamdipta, Yejin Choi & Noah A. Smith (2021) Challenges in Automated Debiasing for Toxic Language Detection. EACL.
  46. Xinyao Ma*, Maarten Sap*, Hannah Rashkin & Yejin Choi (2020) PowerTransformer: Unsupervised Controllable Revision for Biased Language Correction. EMNLP.
  47. Sam Gehman, Suchin Gururangan, Maarten Sap, Yejin Choi & Noah A Smith (2020) RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models. Findings of EMNLP.
  48. Maxwell Forbes, Jena D. Hwang, Vered Shwartz, Maarten Sap & Yejin Choi (2020) Social Chemistry 101: Learning to Reason about Social and Moral Norms. EMNLP.
  49. Maarten Sap, Eric Horvitz, Yejin Choi, Noah A Smith & James W. Pennebaker (2020) Recollection versus Imagination: Exploring Human Memory and Cognition via Neural Language Models. ACL.
  50. Maarten Sap, Saadia Gabriel, Lianhui Qin, Dan Jurafsky, Noah A Smith & Yejin Choi (2020) Social Bias Frames: Reasoning about Social and Power Implications of Language. ACL.
  51. Maarten Sap*, Hannah Rashkin*, Derek Chen, Ronan LeBras & Yejin Choi (2019) Social IQa: Commonsense Reasoning about Social Interactions. EMNLP.
  52. Maarten Sap, Dallas Card, Saadia Gabriel, Yejin Choi & Noah A Smith (2019) The Risk of Racial Bias in Hate Speech Detection. ACL.
  53. Antoine Bosselut, Hannah Rashkin, Maarten Sap, Chaitanya Malaviya, Asli Celikyilmaz & Yejin Choi (2019) COMET: Commonsense Transformers for Automatic Knowledge Graph Construction. ACL.
  54. Maarten Sap, Ronan LeBras, Emily Allaway, Chandra Bhagavatula, Nicholas Lourie, Hannah Rashkin, Brendan Roof, Noah A Smith & Yejin Choi (2019) ATOMIC: An Atlas of Machine Commonsense for If-Then Reasoning. AAAI.
  55. Hannah Rashkin, Antoine Bosselut, Maarten Sap, Kevin Knight & Yejin Choi (2018) Modeling Naive Psychology of Characters in Simple Commonsense Stories. ACL.
  56. Hannah Rashkin*, Maarten Sap*, Emily Allaway, Noah A. Smith & Yejin Choi (2018) Event2Mind: Commonsense Inference on Events, Intents, and Reactions. ACL.
  57. Maarten Sap, Marcella Cindy Prasetio, Ari Holtzman, Hannah Rashkin & Yejin Choi (2017) Connotation Frames of Power and Agency in Modern Films. EMNLP.
  58. Roy Schwartz, Maarten Sap, Ioannis Konstas, Li Zilles, Yejin Choi & Noah A Smith (2017) The Effect of Different Writing Tasks on Linguistic Style: A Case Study of the ROC Story Cloze Task. CoNLL.
  59. H. Andrew Schwartz, Gregory Park, Maarten Sap, Evan Weingarten, Johannes Eichstaedt, Margaret Kern, David Stillwell, Michal Kosinski, Jonah Berger, Martin Seligman & Lyle Ungar (2015) Extracting Human Temporal Orientation from Facebook Language. NAACL.
  60. Maarten Sap, Gregory Park, Johannes C. Eichstaedt, Margaret L. Kern, David J. Stillwell, Michal Kosinski, Lyle H. Ungar & Hansen Andrew Schwartz (2014) Developing Age and Gender Predictive Lexica over Social Media. EMNLP.

Workshop

  1. Emily Allaway, Nina Taneja, Sarah-Jane Leslie & Maarten Sap (2022) Towards Countering Essentialism through Social Bias Reasoning. EMNLP workshop on NLP for Positive Impact.
  2. Zhilin Wang, Anna Jafarpour & Maarten Sap (2022) Uncovering Surprising Event Boundaries in Narratives. Workshop on Narrative Understanding.
  3. Tal August, Maarten Sap, Elizabeth Clark, Katharina Reinecke & Noah A. Smith (2020) Exploring the Effect of Author and Reader Identity in Online Story Writing: the StoriesInTheWild Corpus. Workshop on Narrative Understanding, Storylines, and Events (NUSE)@ ACL.
  4. Roy Schwartz, Maarten Sap, Ioannis Konstas, Li Zilles, Yejin Choi & Noah A Smith (2017) Story Cloze task: UW NLP System. EACL Workshop LSD Sem. 52--55.
  5. Daniel Preotiuc-Pietro, Maarten Sap, H Andrew Schwartz & Lyle Ungar (2015) Mental Illness Detection at the World Well-Being Project for the CLPsych 2015 Shared Task. NAACL Workshop on CLPsych.
  6. Daniel Preotiuc-Pietro, Johannes Eichstaedt, Gregory Park, Maarten Sap, Laura Smith, Victoria Tobolsky, H Andrew Schwartz & Lyle Ungar (2015) The Role of Personality, Age and Gender in Tweeting about Mental Illnesses. NAACL Workshop on CLPsych.
  7. H Andrew Schwartz, Johannes Eichstaedt, Margaret L Kern, Gregory Park, Maarten Sap, David Stillwell, Michal Kosinski & Lyle Ungar (2014) Towards Assessing Changes in Degree of Depression through Facebook. ACL Workshop on CLPsych. 118--125.

Demo

  1. Xuhui Zhou, Zhe Su, Sophie Feng, Jiaxu Zhou, Jen-tse Huang, Svitlana Volkova, Tongshuang Sherry Wu, Anita Woolley, Hao Zhu & Maarten Sap (2025) SOTOPIA-S4: A User-Friendly System for Flexible, Customizable, and Large-Scale Social Simulation. NAACL System Demonstrations.
  2. Maria Antoniak, Anjalie Field, Jimin Mun, Melanie Walsh, Lauren F. Klein & Maarten Sap (2023) Riveter: Measuring Power and Social Dynamics Between Entities. ACL demonstrations.
  3. Hao Fang, Hao Cheng, Maarten Sap, Elizabeth Clark, Ariel Holtzman, Yejin Choi, Noah A Smith & Mari Ostendorf (2018) Sounding Board: A User-Centric and Content-Driven Social Chatbot. NAACL System Demonstrations.
  4. H Andrew Schwartz, Salvatore Giorgi, Maarten Sap, Patrick Crutchley, Lyle Ungar & Johannes Eichstaedt (2017) DLATK: Differential Language Analysis ToolKit. EMNLP System Demonstrations. 55--60.

Other

  1. Maarten Sap (2021) Positive AI with Social Commonsense Models.
  2. Hao Fang, Hao Cheng, Elizabeth Clark, Ariel Holtzman, Maarten Sap, Mari Ostendorf, Yejin Choi & Noah A Smith (2017) Sounding Board - University of Washington’s Alexa Prize Submission. Alexa Prize Proceedings.
  3. H Andrew Schwartz, Maarten Sap, Margaret L Kern, Johannes C Eichstaedt, Adam Kapelner, Megha Agrawal, Eduardo Blanco, Lukasz Dziurzynski, Gregory Park, David Stillwell, Michal Kosinski, Martin E P Seligman & Lyle H Ungar (2016) Predicting individual well-being through the language of social media. Biocomputing 2016: Proceedings of the Pacific Symposium. 516--527.

Preprint

  1. Runtao Zhou, Guangya Wan, Saadia Gabriel, Sheng Li, Alexander J Gates, Maarten Sap & Thomas Hartvigsen (2025) Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English. arXiv.
  2. Taeyoun Kim, Jacob Springer, Aditi Raghunathan & Maarten Sap (2025) Mitigating Bias in RAG: Controlling the Embedder. arXiv.
  3. Akhila Yerukola, Saadia Gabriel, Nanyun Peng & Maarten Sap (2025) Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures. arXiv.
  4. Shuyue Stella Li, Jimin Mun, Faeze Brahman, Jonathan S. Ilgen, Yulia Tsvetkov & Maarten Sap (2025) Aligning LLMs to Ask Good Questions: A Case Study in Clinical Reasoning. arXiv.
  5. Sanidhya Vijayvargiya, Xuhui Zhou, Akhila Yerukola, Maarten Sap & Graham Neubig (2025) Interactive Agents to Overcome Ambiguity in Software Engineering. arXiv.
  6. Jimin Mun, Wei Bin Au Yeong, Wesley Hanwen Deng, Jana Schaich Borg & Maarten Sap (2025) Diverse Perspectives on AI: Examining People's Acceptability and Reasoning of Possible AI Use Cases. arXiv.
  7. Jing-Jing Li, Valentina Pyatkin, Max Kleiman-Weiner, Liwei Jiang, Nouha Dziri, Anne G. E. Collins, Jana Schaich Borg, Maarten Sap, Yejin Choi & Sydney Levine (2024) SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation. arXiv.
  8. Wenkai Li, Jiarui Liu, Andy Liu, Xuhui Zhou, Mona Diab & Maarten Sap (2024) BIG5-CHAT: Shaping LLM Personalities Through Training on Human-Grounded Data. arXiv.
  9. Xuhui Zhou, Hyunwoo Kim, Faeze Brahman, Liwei Jiang, Hao Zhu, Ximing Lu, Frank Xu, Bill Yuchen Lin, Yejin Choi, Niloofar Mireshghallah, Ronan Le Bras & Maarten Sap (2024) HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions. arXiv.
  10. Jen-tse Huang, Jiaxu Zhou, Tailin Jin, Xuhui Zhou, Zixi Chen, Wenxuan Wang, Youliang Yuan, Maarten Sap & Michael R. Lyu (2024) On the Resilience of Multi-Agent Systems with Malicious Agents. arXiv.

Awards

Paper awards

Outstanding paper SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization EMNLP 2023
Outstanding Paper NLPositionality: Characterizing Design Biases of Datasets and Models ACL 2023
Best Paper Queer In AI: A Case Study in Community-Led Participatory AI FAccT 2023
Best Paper Social Bias Frames: Reasoning about Social and Power Implications of Language WeCNLP 2020
Best Short Paper Nomination The Risk of Racial Bias in Hate Speech Detection ACL 2019

Other awards and honors

Selected to speak at the National Academy of Engineering's Frontiers of Engineering Artificial Social Intelligence? On the challenges of Socially Aware and Ethically informed LLMs 2024
William Chan Memorial Dissertation Award Positive AI with Social Commonsense Models 2021
Amazon Alexa Prize Sounding Board: A User-Centric and Content-Driven Social Chatbot 2017

Thesis Committees

PhD Ashutosh Baheti Mark Rield, GATech 2024
PhD Kaixin Ma Eric Nyberg, CMU 2023
PhD Prakhar Gupta Jeff Bingham, CMU 2023
Ms Jocelyn Chen Cytnhia Breazeal, MIT 2023
PhD Chan Young Park Yulia Tsvetkov, UW 2023
PhD Paul Röttger Scott Hale, University of Oxford 2023

Teaching

Courses

11-430/830 Ethics, Safety, and Social Impact in NLP and LLMs Spring 2025
11-361 Data Science Seminar Fall 2024
11-830 Ethics, Social Biases, and Positive Impact in Language Technologies Spring 2024
11-361 Data Science Seminar Fall 2023
11-830 Computational Ethics Spring 2023

Guest lectures & Tutorials

Social intelligence of LLM agents 05-899 Guest lecture Fall 2024
Bias in Natural Language Processing 66-142 Guest lecture Spring 2024
Bias in Natural Language Processing 11-711 Guest Lecture Spring 2024
Toxicity in LLMs 11-667 Guest Lecture Fall 2023
Bias in Natural Language Processing 11-711 Guest Lecture Fall 2023
Bias in Natural Language Processing 05-899 Guest Lecture Spring 2023
Bias in Natural Language Processing 15-884 Guest Lecture Fall 2022
"Crowdsourcing Beyond Annotation" Tutorial EMNLP 2021
"Commonsense Reasoning in Natural Language Processing" Tutorial ACL 2020

Service

Workshops

Socially Responsible Language Modelling Research (SoLaR) co-organizer NeurIPS 2024
Pluralistic Alignment co-organizer NeurIPS 2024
Multimodal Content Moderation Workshop co-organizer CVPR 2024
Multimodal Content Moderation Workshop co-organizer CVPR 2023
NLP for Positive Impact Workshop steering committee EMNLP 2022
NLP for Positive Impact Workshop co-organizer ACL 2021

Committees

Belonging and Engagement in Language Technologies Institute (BELTI) committee CMU LTI 2022-present
PhD & MLT admissions committee CMU LTI 2022-present
Socio-cultural diversity and inclusion committee ACL 2020
Diversity committee UW CSE 2016–2020
Graduate student advisory council (G5PAC) UW CSE 01/2018–12/2020

Senior program committees

ACL rolling review 2020–present
AAAI 2021

Reviewing

Journals & conferences
ACL rolling review 2020–present
ACL 2019–present
FAccT 2024
PNAS 2024
EMNLP 2018–2023
Journal of Psycholinguistic Research 2023
Computing Survey 2023
Transactions of ACL 2020, 2022
AAAI 2020
ICWSM 2021
Dementia and Geriatric Cognitive Disorders Journal 2020
Computational Linguistic 2019, 2020
Humanities and Social Sciences Communications 2019
Journal of Artificial Intelligence Research 2019
IEEE Transactions on Cognitive and Developmental Systems 2019
Social Psychological and Personality Science 2018
Workshops
Workshop on NLP for Positive Impact 2022
Workshop on NLP for Causal Inference 2021
NAACL Student Research Workshop 2019
CLPsych workshop 2016–2018
Stylistic Variation workshop 2018

Panels & other service or outreach

Member of the Ethics Committee of AI2's OLMO project 09/2023–present
Responsible AI salon on generative AI at CMU 03/2023
Presentation to U.S. congressional appropriations committee about risks and implications of AI and LLMs 03/2023
Red-teaming GPT-4 for OpenAI 09/2022–12/2022

Talks

Artificial Social Intelligence? On the challenges of Socially Aware and Ethically informed LLMs
UCLA CS 269 Guest Lecture 02/2025
Cluster of Excellence "Science of Intelligence" (SCIoI) 01/2025
NeurIPS New In ML workshop (invited speaker) 12/2024
University of Pittsburgh CS colloquium 11/2024
Columbia NLP seminar 10/2024
NAE Frontiers of Engineering (invited talk) 09/2024
DSTA Faculty speaker series 09/2024
Aptima Brown Bag 07/2024
CMU Agent Workshop 2024 (invited speaker) 05/2024
UNC Chapel Hill Symposium on AI and Society 04/2024
How to Be a Smarter AI user
SxSW 03/2025
Rethinking the Role of AI in Counterspeech
First Workshop on Multilingual Counterspeech Generation at COLING 2025 (invited speaker) 01/2025
Developing Computational Analyses of the Social Aspects of Narratives
EMNLP Workshop on Narrative Understanding (invited speaker) 11/2024
Princeton Workshop on Narrative Possibilities (invited speaker) 06/2024
Towards Socially Aware AI with Pragmatic Competence
ICML workshop on Theory of Mind (invited speaker) 07/2023
The Pivotal Role of Social Context in Toxic Language Detection
ACL workshop on online abuse and harms (invited speaker) 07/2023
Dealing with meaning variation Workshop (invited speaker) 10/2023
Toward Prosocial NLP: Reasoning About And Responding to Toxicity in Language
MIT Media Lab Breazeal Group Meeting 11/2022
CMU S3D Computational Social Science Seminar 11/2022
Amazon Alexa Trust & Privacy 11/2022
University of Minnesota NLP seminar 10/2022
Detecting and Rewriting Social Biases in Language
Pinterest NLP seminar 09/2022
UIUC Responsible Data Science Seminar Series 02/2022
MilaNLP seminar at Università Bocconi 10/2021
PAN workshop at CLEF (invited speaker) 09/2021
Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection
NAACL 07/2022
Text As Data (TADA) 10/2021
Positive AI with Social Commonsense Models
The Web Conf Workshop UserNLP: User-centered Natural Language Processing Workshop (invited speaker) 04/2022
AKBC Workshop on Commonsense Reasoning (invited speaker) 10/2021
University of Toronto Computer Science 04/2021
MIT EECS 03/2021
CMU LTI/MLD 03/2021
UChicago CS 03/2021
TTIC 02/2021
Emory CS 02/2021
Vanderbilt CS 02/2021
EPFL I&C 01/2021
Yale Data Science & Statistics seminar 01/2021
PowerTransformer: Unsupervised Controllable Revision for Biased Language Correction
EMNLP conference 11/2020
Social Bias Frames: Reasoning About Social and Power Dynamics
WeCNLP Summit 10/2020
ACL Conference 07/2020
Reasoning about Social Dynamics and Social Bias in Language
SRI seminar 01/2021
Georgia Tech NLP seminar 10/2020
Berkeley NLP seminar 02/2020
Stanford NLP seminar 02/2020
Social and Ethical Considerations in English Toxic Language Detection
NLP with Friends 08/2020
Recollection versus Imagination: Exploring Human Memory and Cognition via Neural Language Models
ACL Conference 07/2020
COMET: Commonsense Transformers for Automatic Knowledge Graph Construction
DARPA Communicating with Computers grant meeting 11/2019
Social IQa: Commonsense Reasoning about Social Interactions
EMNLP conference 11/2019
The Risk of Racial Bias in Hate Speech Detection
ACL Conference 07/2019
ICML Queer in AI workshop (invited speaker) 06/2019
ATOMIC: An Atlas of Machine Commonsense for If-Then Reasoning
AAAI conference 01/2019
AI2 seminar 01/2019
Event2Mind: Commonsense Inference on Events, Intents, and Reactions
DARPA Communicating with Computers grant meeting 07/2018
Detecting Implicit Bias in Text through Connotative Language
UW Social Psychology seminar 04/2018

News Coverage

NLPositionality: Characterizing Design Biases of Datasets and Models (2023)
ProsocialDialog: A Prosocial Backbone for Conversational Agents (2022)
Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs (2022)
Delphi: Towards Machine Ethics and Norms (2021)
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus (2021)
Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts (2021)
DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts (2021)
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models (2020)
The Risk of Racial Bias in Hate Speech Detection (2019)
Connotation Frames of Power and Agency in Modern Films (2017)
Sounding Board - University of Washington’s Alexa Prize Submission (2017)
Miscellaneous