Liyan Tang

My name is Liyan Tang (唐立言 in Chinese). I’m a fifth-year (final year) Ph.D. candidate in Computer Science from the TAUR Lab (Text Analysis, Understanding, and Reasoning) at UT Austin advised by Greg Durrett. I have been fortunate to work with Ying Ding from UT iSchool, Yifan Peng from Weill Cornell Medicine and Justin F. Rousseau from UT Southwestern Medical Center (alphabetical order).

My research focuses on the automatic evaluations of LLMs, with an emphasis on hallucination detection. I developed MiniCheck, a state-of-the-art model for detecting hallucinations in LLM outputs. More recently, my interests have expanded to post-training techniques for strengthening the reasoning capabilities of LLMs and LVLMs, particularly through agent-style approaches where models interact with their environment and incorporate feedback to reduce hallucinations and improve the reasoning capability.

Blog

Bespoke-MiniChart-7B: Pushing The Frontiers Of Open VLMs For Chart Understanding
Liyan Tang, Shreyas Pimpalgaonkar, Kartik Sharma, Alexandros G. Dimakis, Mahesh Sathiamoorthy, Greg Durrett

Selected Papers

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models
Liyan Tang, Grace Kim, Xinyu Zhao, Thom Lake, Wenxuan Ding, Fangcong Yin, Prasann Singhal, Manya Wadhwa, Zeyu Leo Liu, Zayne Sprague, Ramya Namuduri, Bodun Hu, Juan Diego Rodriguez, Puyuan Peng, Greg Durrett
Proceedings of the Conference on Neural Information Processing Systems (NeurIPS), 2025

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
Liyan Tang, Philippe Laban, Greg Durrett
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024

TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Liyan Tang, Igor Shalyminov, Amy Wing-mei Wong, Jon Burnsky, Jake W. Vincent, Yu’an Yang, Siffi Singh, Song Feng, Hwanjun Song, Hang Su, Lijia Sun, Yi Zhang, Saab Mansour, Kathleen McKeown
Proceedings of the North American Chapter of the Association for Computational Linguistic (NAACL), 2024

Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors
Liyan Tang, Tanya Goyal, Alexander R. Fabbri, Philippe Laban, Jiacheng Xu, Semih Yavuz, Wojciech Kryściński, Justin F. Rousseau, Greg Durrett
Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2023

Evaluating Large Language Models on Medical Evidence Summarization
Liyan Tang, Zhaoyi Sun, Betina Idnay, Jordan G Nestor, Ali Soroush, Pierre A. Elias, Ziyang Xu, Ying Ding, Greg Durrett, Justin Rousseau, Chunhua Weng, Yifan Peng
npj Digital Medicine, 2023

News

06/02/2025: Research Intern at Google DeepMind, CA.
06/03/2024: Research Intern at Bespoke Labs (startup), CA.
12/16/2023: Completed my Master of Science degree in Computer Science at UT Austin.
05/15/2023: Applied Scientist Internship at Amazon AWS AI, WA.
08/25/2021: Started my PhD at UT Austin.
05/14/2021: Completed my Bachelor of Science degree in Mathematics at UT Austin.