Maarten Sap

I am an assistant professor at CMU's LTI department with a courtesy appointment in HCII, and a part-time research scientist and AI safety lead at the Allen Institute for AI (AI2). My research focuses on (1) measuring and improving AI systems' social and interactional intelligence, (2) assessing and combatting social inequality, safety risks, and socio-cultural biases in human- or AI-generated language, and (3) building narrative language technologies for prosocial outcomes.

I received my PhD from the University of Washington where I was advised by Noah Smith and Yejin Choi.
[bio for talks]

Recent updates:

August 2025 🌟: Incredibly honored to be one of 7 US recipients of the 2025 Okawa Research Grant from the Okawa Foundation!

August 2025 🧑‍🎓: Welcoming my first postdoc, Vasudha Varadarajan, to the lab!

August 2025 👨🏼‍🏫: Excited to give a (virtual) talk about Responsible AI for Diverse Users and Cultures at the Gender Bias in NLP workshop at ACL 2025!

July 2025 🧠🛡️: Five papers were accepted to COLM 2025! Highlights include HAICOSYSTEM, a framework for sandboxing safety risks in human-AI interaction; ALFA, which aligns LLMs to ask better clinical questions; and PolyGuard, a multilingual moderation tool for unsafe content. Two other papers to be released soon :)

May 2025 🧑‍💻🏆: Super super excited to announce that our paper Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance received the Best Paper Runner Up award at NAACL 2025. Huge congratulations to Kaitlyn!

April 2025 🏜️🚂: Though I will not be attending NAACL 2025, my students and collaborators will be presenting some exciting papers: Joel Mire on Rejected Dialects: Biases Against African American Language in Reward Models, Akhila Yerukola on NormAd: A Framework for Measuring the Cultural Adaptability of Large Language Models; Kaitlyn Zhou on Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance; Xuhui Zhou on AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM Agents.

April 2025 🦞👨🏼‍🏫: Excited to give a talk at the MIT CSAIL NLP seminar on the challenges of socially aware and culturally adaptable LLMs.

[older news]

My research group:

Dan Chechelnitsky

LTI PhD student
co-advised with Chrysoula Zerva

Joel Mire

LTI MLT student

Karina Halevy

LTI PhD student
co-advised with Mona Diab

Jimin Mun

LTI PhD student

Jocelyn Shen

MIT PhD student
co-advised with Cynthia Breazeal

Vasudha Varadarajan

LTI Postdoc

Akhila Yerukola

LTI PhD student

Mingqian Zheng

LTI PhD student
co-advised with Carolyn Rosé

Xuhui Zhou

LTI PhD student

Overarching Research Themes

Themes extracted and images generated with the OpenAI API; there may be inconsistencies.

Ethical AI and Human-Centric Design

My research group explores the balance between technological advancement and ethical responsibility in AI systems. We examine the societal implications of AI usage through papers like [Why (not) use AI? Analyzing People's Reasoning and Conditions for AI Acceptability](https://arxiv.org/abs/2502.07287), which investigates the factors influencing public acceptance of AI technologies. Additionally, our work on [HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions](http://arxiv.org/abs/2409.16427) provides a framework for testing safety risks inherent in human-AI interactions, emphasizing the need for responsibly designed AI systems. Lastly, [Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures](https://arxiv.org/abs/2502.17710) highlights how cultural nuances must be integrated into AI behavior to avoid potential harm.

Explorations in Narrative Dynamics

My research group explores the complex interplay of narratives and how they shape human experience and understanding. We delve into the nuances of storytelling as seen in our investigations, including [Quantifying the narrative flow of imagined versus autobiographical stories](https://www.pnas.org/doi/10.1073/pnas.2211715119), which analyzes how different types of narratives influence perception. Another pivotal work, [HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs](https://arxiv.org/abs/2405.17633), provides insights into how diverse narrative styles can evoke empathy in audiences. This line of research underscores the importance of narrative context in shaping emotional engagement.

AI Agents and Social Intelligence

My research group explores the development of AI agents that exhibit social intelligence and adaptivenes to human interaction dynamics. A key study, [SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents](https://arxiv.org/abs/2310.11667), presents a framework for assessing how well AI systems can engage socially, guiding future developments in this area. Complementing this, [AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM Agents](https://aclanthology.org/2025.naacl-long.595/) investigates how trustworthiness can be balanced with utility in AI communication. Furthermore, our work on [Is This the Real Life? Is This Just Fantasy? The Misleading Success of Simulating Social Interactions With LLMs](http://arxiv.org/abs/2403.05020) critically assesses the reliability of AI in replicating genuine human social interactions, which has implications for applications in conversational agents and virtual environments.

Addressing Bias in Language Models

My research group explores the pressing issue of bias in AI language models and seeks to devise strategies to mitigate these biases. We highlight critical findings in [Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty](https://arxiv.org/abs/2401.06730), which reveals how AI's uncertainty can lead to miscommunication and user misunderstandings. Our work on [Rejected Dialects: Biases Against African American Language in Reward Models](https://arxiv.org/abs/2502.12858) investigates bias against various linguistic dialects, emphasizing the need for more inclusive AI training methodologies. Additionally, the paper [NormAd: A Framework for Measuring the Cultural Adaptability of Large Language Models](https://aclanthology.org/2025.naacl-long.120/) provides a new metric for assessing how well AI models align with diverse cultural contexts.

Maarten Sap

Recent updates:

My research group:

Overarching Research Themes

Ethical AI and Human-Centric Design

Explorations in Narrative Dynamics

AI Agents and Social Intelligence

Addressing Bias in Language Models

More about me