👩🏻🏫 Invited talks at SNU (Jan 2nd), HYU (Jan 8th), and KU (Jan 9th)
Topic - Towards a Science of Evaluation for Language Model
I am a postdoctoral associate at Boston University, working with Prof. Najoung Kim and Prof. Sebastian Schuster. I received my Ph.D. from Korea University, advised by Prof. Pilsung Kang. During my Ph.D, I was research intern at NAVER and contributed to CLOVA for Writing. I completed my B.S. at HUFS, advised by Prof. Chungmok Lee
My research focuses on LLM evaluation, aiming to discover, define, and measure the capabilities of language models. My long-term research vision is to establish a science of evaluation for language models. I like to think about what makes evaluation reliable and what it truly tells us about these models. I am also interested in LLM agents that autonomously solve complex problems in research and engineering, and how to reliably evaluate them.
PhD Industrial Management & Engineering
Korea University
BSc Industrial Management & Engineering and BA International Finance
HUFS
Topic - Towards a Science of Evaluation for Language Model
Topic - Can coding agents autonomouslyimplement AI research extensions?
CheckEval, A reliable LLM-as-a-Judge framework for evaluating text generation using checklists. Thanks to my collaborators ❤️
Topic - Can coding agents autonomouslyimplement AI research extensions?
For an up-to-date list of publications, check out my Google Scholar.
* denotes equal contribution, † denotes equal contribution as senior role.