News

👩🏻‍🏫 Invited talks at SNU (Jan 2nd), HYU (Jan 8th), and KU (Jan 9th)

Topic - Towards a Science of Evaluation for Language Model

Jan 2, 2026

👩🏻‍🏫 Gave an invited talk at Stanford/UW (RExBench 🦖) with Nicholas

Topic - Can coding agents autonomouslyimplement AI research extensions?

Sep 15, 2025

🎉 (New) Our CheckEval accepted to EMNLP 2025 !

CheckEval, A reliable LLM-as-a-Judge framework for evaluating text generation using checklists. Thanks to my collaborators ❤️

Aug 21, 2025

👩🏻‍🏫 Gave an invitied talk at Korea University (RExBench 🦖)

Topic - Can coding agents autonomouslyimplement AI research extensions?

Aug 19, 2025

💡 Checkout our RExBench paper

Can coding agents autonomously implement AI research extensions? Our [RExBench](https://arxiv.org/abs/2506.22598) is now available on arXiv !

Jun 30, 2025

🎉 Paper accepted to NAACL 2025 (Industry Track)

Our [WritingPath](https://arxiv.org/abs/2404.13919) paper has been accepted to **NAACL 2025 (Industry Track)**!

Feb 13, 2025

🎉 Paper accepted to NeurIPS 2024

Our [ContAccum](https://arxiv.org/abs/2406.12356v1) paper has been accepted to **NeurIPS 2024**!

Nov 30, 2024

👩🏻‍🏫 Gave an invitied talk at Korea University

Topic => Goal-Oriented Language Model and Evaluation

Sep 25, 2024

📣 I have started my Postdoc at Boston University @ tinlab

Sep 1, 2024

👩🏻‍🏫 Gave an invited talk at SK Telecom

about LLM-based Evaluation for Open-ended Generation

Aug 19, 2024