Yukyung Lee ☕️
Yukyung Lee

Postdoctoral Associate

Boston University (tinlab.)

I am a postdoctoral associate at Boston University, working with Prof. Najoung Kim and Prof. Sebastian Schuster. I received my Ph.D. from Korea University, advised by Prof. Pilsung Kang. During my Ph.D, I was research intern at NAVER and contributed to CLOVA for Writing. I completed my B.S. at HUFS, advised by Prof. Chungmok Lee

My research focuses on LLM evaluation, aiming to discover, define, and measure the capabilities of language models. My long-term research vision is to establish a science of evaluation for language models. I like to think about what makes evaluation reliable and what it truly tells us about these models. I am also interested in LLM agents that autonomously solve complex problems in research and engineering, and how to reliably evaluate them.

Download CV
Interests
  • Language Model Evaluation
  • Benchmark Design
  • LLM Agent
  • Writing with AI
  • Anomaly Detection
Education
  • PhD Industrial Management & Engineering

    Korea University

  • BSc Industrial Management & Engineering and BA International Finance

    HUFS

Recent News
Recent Publications

For an up-to-date list of publications, check out my Google Scholar.

* denotes equal contribution, † denotes equal contribution as senior role.

(2026). Can Structural Cues Save LLMs? Evaluating Language Models in Massive Document Streams. preprint.
(2025). CheckEval: A reliable LLM-as-a-Judge framework for evaluating text generation using checklists. EMNLP 2025.
(2025). RExBench: Can coding agents autonomously implement AI research extensions?. preprint.
(2025). Navigating the Path of Writing: Outline-guided Text Generation with Large Language Models. NAACL 2025 (Industry Track).
(2024). A Gradient Accumulation Method for Dense Retriever under Memory Constraint. NeurIPS 2024.