Publications

Preprints

(2026). CIRF: Tokenizing Chain-of-Thoughts into Reusable Functional Units for Efficient Latent Reasoning in Large Language Models. preprint.
PDF
(2026). Cliff Tokens: Identifying Single-Token Failure Triggers in LLM Mathematical Reasoning. preprint.
(2026). AI Writers Have a Consistent Stylometric Footprint, but AI Editors Do Not. preprint.

Publications

(2026). Can Structural Cues Save LLMs? Evaluating Language Models in Massive Document Streams. KDD 2026.
(2026). RExBench: Can coding agents autonomously implement AI research extensions?. ACL 2026.
(2025). CheckEval: A reliable LLM-as-a-Judge framework for evaluating text generation using checklists. EMNLP 2025.
(2025). Navigating the Path of Writing: Outline-guided Text Generation with Large Language Models. NAACL 2025 (Industry Track).
PDF
(2024). A Gradient Accumulation Method for Dense Retriever under Memory Constraint. NeurIPS 2024.
PDF
(2024). DSTEA: Improving Dialogue State Tracking via Entity Adaptive Pre-training. In Knowledge-Based System (IF = 8.8) and KnowledgeNLP@KDD 2023.
PDF
(2024). RAPID: Training-free Retrieval-based Log Anomaly Detection with PLM considering Token-level information. Engineering Applications of Artificial Intelligence (IF = 8.0).
(2023). LAnoBERT: System log anomaly detection based on BERT masked language model. Applied Soft Computing (IF = 8.7).
PDF
(2023). Painsight: An Extendable Opinion Mining Framework for Detecting Pain Points Based on Online Customer Reviews. In WASSA@ACL 2023.
PDF
(2022). Oh My Mistake!: Toward Realistic Dialogue State Tracking including Turnback Utterances. In SereTOD@EMNLP 2022.
PDF
(2022). Mismatch between Multi-turn Dialogue and its Evaluation Metric in Dialogue State Tracking. In ACL 2022.
PDF
(2020). Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT. In EMNLP 2020 (Findings).
(2019). Drone Surveillance System Considering Dynamic POIs. In JKIIE.
PDF