Hello,
 안녕하세요
I'm interested in building machines capable of reasoning and cognition
I'm a postdoctoral researcher working with Yejin Choi at ⚡. Prior to this, I received my PhD in Computer Science from Seoul National University, advised by Gunhee Kim.
My research goal is to enhance machine's ability to reason and process social information, such as theory of mind, and to develop systems that are socially competent and responsible. To achieve this, I explore two key directions: optimizing inference-time algorithms to make the most of existing models, and synthesizing large-scale data to address areas where models currently fall short.
Recent News
- Mar 2024 - Gave an invited talk at the UPenn NLP group seminar about theory of mind and AI.
- Feb 2024 - Gave an invited talk at Google about our recent work on contextual privacy.
- Jan 2024 - Confaide has been accepted at ICLR 2024 as spotlight.
- Dec 2023 - 🏆 SODA won the Outstanding Paper Award at EMNLP.
- Dec 2023 - I gave an invited talk at the BrainLink symposium.
- Nov 2023 - SODA and FANToM are selected for oral presentation at EMNLP.
- Oct 2023 - We released FANToM and ConfAIde benchmark. GPT-4 struggles with social reasoning!
- Aug 2023 - I won the Distinguished Doctoral Dissertation Award from Seoul National University CSE.
Publications (* equal contribution)
-
SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Yuling Gu, Oyvind Tafjord, Hyunwoo Kim, Jared Moore, Ronan Le Bras, Peter Clark, Yejin Choi
arXiv 2024
-
HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions
Xuhui Zhou, Hyunwoo Kim*, Faeze Brahman*, Liwei Jiang, Hao Zhu, Ximing Lu, Frank Xu, Bill Yuchen Lin, Yejin Choi, Niloofar Mireshghallah, Ronan Le Bras, Maarten Sap
arXiv 2024
-
Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models
Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin Choi, Alice Oh, Hyunwoo Kim
EMNLP 2024
-
Is this the real life? Is this just fantasy?
The Misleading Success of Simulating Social Interactions With LLMs
Xuhui Zhou, Zhe Su, Tiwalayo Eisape, Hyunwoo Kim, Maarten Sap
EMNLP 2024
tldr -
Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Aly M. Kassem*, Omar Mahmoud*, Niloofar Mireshghallah*, Hyunwoo Kim, Yulia Tsvetkov, Yejin Choi, Sherif Saad, Santu Rana
arXiv 2024
-
Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models
Anthony Sicilia, Hyunwoo Kim, Khyathi Raghavi Chandu, Malihe Alikhani, Jack Hessel
Findings of ACL 2024
-
Can LLMs Keep a Secret?
Testing Privacy Implications of Language Models via Contextual Integrity Theory
Hyunwoo Kim*, Niloofar Mireshghallah*, Xuhui Zhou, Yulia Tsvetkov, Maarten Sap, Reza Shokri, Yejin Choi
ICLR 2024 ✶ Spotlight
webpage | tldr -
FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Hyunwoo Kim, Melanie Sclar, Xuhui Zhou, Ronan Le Bras, Gunhee Kim, Yejin Choi, Maarten Sap
EMNLP 2023
webpage | tldr -
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization
Hyunwoo Kim, Jack Hessel, Liwei Jiang, Peter West, Ximing Lu, Youngjae Yu, Pei Zhou, Ronan Le Bras, Malihe Alikhani, Gunhee Kim, Maarten Sap, and Yejin Choi
EMNLP 2023 ✶ Outstanding Paper
github | tldr -
ProsocialDialog: A Prosocial Backbone for Conversational Agents
Hyunwoo Kim*, Youngjae Yu*, Liwei Jiang, Ximing Lu, Daniel Khashabi, Gunhee Kim,
Yejin Choi, and Maarten Sap
EMNLP 2022
github | tldr -
Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes
Hyunwoo Kim, Byeongchang Kim, and Gunhee Kim
EMNLP 2021
Also accepted at 2021 NeurIPS Meaning in Context (MiC) workshop as a contributed talk
github | workshop -
KLUE: Korean Language Understanding Evaluation
Sungjoon Park*, Jihyung Moon*, Sungdong Kim*, Won Ik Cho*, ..., Hyunwoo Kim, ...,
Alice Oh**, Jung-Woo Ha**, Kyunghyun Cho** (31 authors)
NeurIPS Datasets and Benchmarks 2021
webpage -
How Robust are Fact Checking Systems on Colloquial Claims?
Byeongchang Kim*, Hyunwoo Kim*, Seokhee Hong, and Gunhee Kim
NAACL-HLT 2021
Also accepted to 2021 ICLR Neural Conversational AI (NeuCAIR) workshop
github | workshop -
Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness
Hyunwoo Kim, Byeongchang Kim, and Gunhee Kim
EMNLP 2020
Earlier version accepted at 2020 ICLR Bridging AI and Cognitive Science (BAICS) workshop as a contributed talk
github | slides | workshop -
Curiosity Bottleneck: Exploration by Distilling Task-Specific Novelty
Youngjin Kim, Hyunwoo Kim*, Wontae Nam*, Jihoon Kim, and Gunhee Kim
ICML 2019
github | slides -
Abstractive Summarization of Reddit Posts with Multi-level Memory Networks
Byeongchang Kim, Hyunwoo Kim, and Gunhee Kim
NAACL-HLT 2019
github | slides
Honors & Awards
- Outstanding Paper Award, EMNLP, 2023
- Distinguished Doctoral Dissertation Award, Department of CSE, Seoul National University, 2023
- AI Star Scholarship, Yulchon Foundation, 2022
- Outstanding Researcher Fellowship, Department of CSE, Seoul National University, 2022
- Star Student Researcher Award, Department of CSE, Seoul National University, 2022
- Qualcomm Innovation Fellowship, Qualcomm, 2021
- NAVER Ph.D. Fellowship, NAVER, 2021
- Kwanjeong Scholarship, Kwanjeong Educational Foundation, 2019
- National Academic Excellence Scholarship, KOSAF, 2012
About Me
I always love to go taste delicious food. Please let me know if you know some great places to visit! I also enjoy watching Formula One. My favorite driver is Carlos Sainz. Smooooth operatorrrr.
In case you are looking for my original blog, where I used to write posts in Korean, it's here.
Photo credit: Sebastin Santy