🎉 (New) Our paper "Can Structural Cues Save LLMs? Evaluating Language Models in Massive Document Streams" accepted to KDD 2026!
Can Structural Cues Save LLMs? Evaluating Language Models in Massive Document Streams
I am a postdoctoral associate at Boston University, working with Prof. Najoung Kim and Prof. Sebastian Schuster. I received my Ph.D. from Korea University, advised by Prof. Pilsung Kang. During my Ph.D, I was research intern at NAVER and contributed to CLOVA for Writing. I completed my B.S. at HUFS, advised by Prof. Chungmok Lee
My research focuses on LLM evaluation, aiming to discover, define, and measure the capabilities of language models. My long-term research vision is to establish a science of evaluation for language models. I like to think about what makes evaluation reliable and what it truly tells us about these models. I am also interested in LLM agents that autonomously solve complex problems in research and engineering, and how to reliably evaluate them.
PhD Industrial Management & Engineering
Korea University
BSc Industrial Management & Engineering and BA International Finance
HUFS
Can Structural Cues Save LLMs? Evaluating Language Models in Massive Document Streams
RExBench, Can coding agents autonomously implement AI research extensions?
Please consider submitting your work :)
Topic - Towards a Science of Evaluation for Language Model
For an up-to-date list of publications, check out my Google Scholar.
* denotes equal contribution, † denotes equal contribution as senior role.